Bookmarks

An HDRUK dataset: Linked Health and Air Quality Data for older adults

Population Size

3,175,299

People

Population Size statistic card

Years

2000 - 2023

Years statistic card

Associated BioSamples

None/not available

Associated BioSamples statistic card

Geographic coverage

https://www....

Geographic coverage statistic card

Lead time

1-2 months

Lead time statistic card

Summary

An HDRUK QQ2 longitudinal dataset of 3,175,299 admissions for patients aged 65+, linking DEFRA air pollution data to acute care presentations. Includes acuity, respiratory data, medications and outcomes to assess environmental health impacts.

Documentation

The environmental determinants of health are increasingly recognised with descriptions of higher hospital admissions and poorer outcomes for patients during periods of low air quality. Older adults are known to be especially prone to adverse health events during extremes of temperature but the acute impact of poor air quality is less well known.

PIONEER has curated a highly granular dataset of 3,175,299 admissions for patients aged 65 and over, linking DEFRA air pollution data to patients' registered addresses. It includes demographics, admission details, diagnostic codes (ICD-10 & SNOMED-CT), respiratory data, medications, and presenting complaints. This dataset enables analysis of the short- and long-term impacts of air quality on health outcomes, providing a unique resource for research in environmental health, epidemiology, and multidisciplinary studies. Covering admissions from 01-01-2000 to 31-08-2023, it offers opportunities to expand to other timelines or include all age groups if required.

Geography: The West Midlands (WM) has a population of 6 million & includes a diverse ethnic & socio-economic mix. UHB is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & > 120 ITU bed capacity. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”.

Data set availability: Data access is available via the PIONEER Hub for projects which will benefit the public or patients. This can be by developing a new understanding of disease, by providing insights into how to improve care, or by developing new models, tools, treatments, or care processes. Data access can be provided to NHS, academic, commercial, policy and third sector organisations. Applications from SMEs are welcome. There is a single data access process, with public oversight provided by our public review committee, the Data Trust Committee. Contact pioneer@uhb.nhs.uk or visit www.pioneerdatahub.co.uk for more details.

Available supplementary data: Matched controls; ambulance and community data. Unstructured data (images). We can provide the dataset in OMOP and other common data models and can build synthetic data to meet bespoke requirements.

Available supplementary support: Analytics, model build, validation & refinement; A.I. support. Data partner support for ETL (extract, transform & load) processes. Bespoke and “off the shelf” Trusted Research Environment (TRE) build and run. Consultancy with clinical, patient & end-user and purchaser access/ support. Support for regulatory requirements. Cohort discovery. Data-driven trials and “fast screen” services to assess population size.

Dataset type

Health and disease, Measurements/Tests, Lifestyle

Dataset sub-type

Not applicable

Dataset population size

3175299

Keywords

Observations

Observed Node

Disambiguating Description

Measured Value

Measured Property

Observation Date

Persons

3,175,299 patients from Jan 2000 to Sep 2023 aged 65+

3175299

Count

05 Dec 2024

Provenance

Purpose of dataset collection

Care

Source of data extraction

EPR

Collection source setting

Secondary care - Accident and Emergency, Secondary care - In-patients

Patient pathway description

Data is representative of the multi-ethnicity population within the West Midlands (42% non white). Data includes all patients admitted during this timeframe, with National data Opt Outs applied, and therefore is representative of admissions to secondary care. Data focuses on in-patient stay in hospital during the acute episode but can be supplemented on request to include previous and subsequent hospital contacts (including outpatient appointments) and ambulance, 111, 999 data.

Image contrast

Not stated

Biological sample availability

None/not available

Structural Metadata

Details

Publishing frequency

Quarterly

Version

1.0.0

Modified

17/12/2024

Distribution release date

17/12/2024

Citation Requirements

This publication uses data from PIONEER, an ethically approved database and analytical environment (East Midlands Derby Research Ethics 20/EM/0158)

Coverage

Start date

01/01/2000

End date

01/09/2023

Time lag

Less than 1 week

Minimum age range

65

Maximum age range

150

Follow-up

0 - 6 Months

Accessibility

Language

en

Alignment with standardised data models

LOCAL

Controlled vocabulary

LOCAL

Format

SQL, CSV, EXCEL

Data Access Request

Dataset pipeline status

Available

Time to dataset access

1-2 months

Access request cost

www.pioneerdatahub.co.uk/data/data-services-costs/

Access method category

TRE/SDE

Access service description

Trusted Research Environments (TRE) are built using Microsoft Azure services and hosted in the UK to provide research teams a safe, secure and agile environment which allows users to quickly analyse, interpret and form an enriched view of primary care information through a range of integrated datasets.

Health data collated from multiple sources is ingested into a secure data lake which will then allow subsets of data to be made available to research teams on approval of a data request. Once approved a customer specific TRE is made available with a standard set of leading analytical tools from Microsoft including Azure Databricks, Azure Machine Learning, Azure SQL and Azure Synapse (for large-scale data warehouses). Specific tools can be provided at an additional cost over the standard platform data access charge and the PIONEER team will work with you to determine your exact needs.

Access to the TRE is managed using the latest virtual desktop technology to provide a safe and secure end-user experience. By utilising leading edge design PIONEER are able to create TREs rapidly to enable us to service any customer requirement.

Jurisdiction

GB-ENG

Data use limitation

General research use

Data use requirements

Project-specific restrictions

Data Controller

University Hospitals Birmingham NHS Foundation Trust

Data Processor

NOT APPLICABLE

Dataset Types: Health and disease, Measurements/Tests, Lifestyle


Collection Sources: Secondary care - Accident and Emergency, Secondary care - In-patients

end of page