Bookmarks
A dataset of hospitalised patients with Sarcoma
Population Size
1,629
People
Years
2004 - 2021
Associated BioSamples
None/not available
Geographic coverage
United Kingdom
England
Lead time
1-2 months
Summary
Documentation
Background
Sarcomas are uncommon cancers that can affect any part of the body. There are many different types of sarcoma and subtypes can be grouped into soft tissue or bone sarcomas. About 15 people are diagnosed every day in the UK. 3 in every 200 people with cancer in the UK have sarcoma.
A highly granular dataset with a confirmed sarcoma event including hospital presentation, serial physiology, demography, treatment prescribed and administered, prescribed and administered drugs. The infographic includes data from 27/12/2004 to 31/12/2021 but data is available from the past 10 years+.
PIONEER geography: The West Midlands (WM) has a population of 5.9 million & includes a diverse ethnic & socio-economic mix.
EHR. UHB is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & an expanded 250 ITU bed capacity during COVID. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”.
Scope: All hospitalised patients from 2004 onwards, curated to focus on Sarcoma. Longitudinal & individually linked, so that the preceding & subsequent health journey can be mapped & healthcare utilisation prior to & after admission understood. The dataset includes highly granular patient demographics & co-morbidities taken from ICD-10 & SNOMED-CT codes. Serial, structured data pertaining to acute care process (timings, staff grades, specialty review, wards and triage). Along with presenting complaints, outpatients admissions, microbiology results, referrals, procedures, therapies, all physiology readings (pulse, blood pressure, respiratory rate, oxygen saturations and others), and all blood results (urea, albumin, platelets, white blood cells and others). Includes all prescribed & administered treatments and all outcomes. Linked images are also available (radiographs, CT scans, MRI).
Available supplementary data: Matched controls; ambulance, OMOP data, synthetic data.
Available supplementary support: Analytics, Model build, validation & refinement; A.I.; Data partner support for ETL (extract, transform & load) process, Clinical expertise, Patient & end-user access, Purchaser access, Regulatory requirements, Data-driven trials, “fast screen” services.
Dataset type
Dataset sub-type
Dataset population size
Keywords
Observations
Observed Node | Disambiguating Description | Measured Value | Measured Property | Observation Date |
---|---|---|---|---|
Persons | 1,629 spells with patients with diabetes between 27-12-2004 and 31-12-2021 | 1629 | Count | 02 Jan 2022 |
Provenance
Purpose of dataset collection
Source of data extraction
Collection source setting
Patient pathway description
Image contrast
Biological sample availability
Structural Metadata
Details
Publishing frequency
Version
Modified
08/10/2024
Distribution release date
20/01/2022
Citation Requirements
Coverage
Start date
27/12/2004
End date
31/12/2021
Time lag
Geographic coverage
Minimum age range
Maximum age range
Follow-up
Accessibility
Language
Alignment with standardised data models
Controlled vocabulary
Format
Data Access Request
Dataset pipeline status
Time to dataset access
Access request cost
Access method category
Access service description
Trusted Research Environments (TRE) are built using Microsoft Azure services and hosted in the UK to provide research teams a safe, secure and agile environment which allows users to quickly analyse, interpret and form an enriched view of primary care information through a range of integrated datasets.
Health data collated from multiple sources is ingested into a secure data lake which will then allow subsets of data to be made available to research teams on approval of a data request. Once approved a customer specific TRE is made available with a standard set of leading analytical tools from Microsoft including Azure Databricks, Azure Machine Learning, Azure SQL and Azure Synapse (for large-scale data warehouses). Specific tools can be provided at an additional cost over the standard platform data access charge and the PIONEER team will work with you to determine your exact needs.
Access to the TRE is managed using the latest virtual desktop technology to provide a safe and secure end-user experience. By utilising leading edge design PIONEER are able to create TREs rapidly to enable us to service any customer requirement.
Jurisdiction
Data use limitation
Data use requirements
Data Controller