HDR Gateway logo
HDR Gateway logo

Bookmarks

CCU037: Improving methods to minimise bias in ethnicity data for more representative and generalisable models, using CVD in COVID-19 as an example

Safe People

Organisation name

University of Oxford

Organisation sector

Academic Institute

Applicant name(s)

Sara Khalid

Sub-licence arrangements (if any)?

No

Safe Projects

Project ID

CCU037

Lay summary

Inequality in health has been made worse by the COVID-19 pandemic. People from minority ethnic backgrounds are more likely to become very sick or die from COVID-19. An example of inequality in health is technology for predicting a person’s future health risks. This involves routinely collected health information which is put into a computer model and then a health risk score for a patient is given. Doctors can use this to decide patient care. If there is bias in the data or bias in the model, the doctor can potentially make wrong decisions and patients can get the wrong care or no care. This could result in some groups of patients being incorrectly prioritised over others for booster vaccines, hospital beds, or life-saving treatments. This might affect patient and public trust, as well as cost the NHS. We are aiming to improve existing technology for predicting personalised future risk of health conditions, particularly those affecting overlooked groups of patients. We aim to do so by: a) improving the way recorded ethnicity is used in research, and b) improving the modelling process to build risk prediction models designed specifically to ethnicity groups and therefore more reliable.

Public benefit statement

We know that there are ethnicity biases for cardiovascular disease in COVID-19 patients. We are developing a calculator to predict cardiovascular disease in COVID-19 patients. We will use this as a first example and will then be able to use this approach across other health and disease areas. The calculator can be used by public to guide lifestyle choices, and by doctors to provide better care. This can also be used by researchers nationwide doing health research involving ethnicity. This work will be based on health information that represents almost everyone currently living in England and Wales, without being traced back to them. By extending to Northern Ireland and Scotland in future, we hope that this work will help to make health equal and fair for everyone in the UK.

Latest approval date

18/11/2021

Safe Data

Dataset(s) name

Intensive Care National Audit and Research Centre - Covid19 (ICCD)

Intensive Care National Audit and Research Centre (ICNC)

Data sensitivity level

De-Personalised

Safe Setting

Access type

TRE