HDR Gateway logo
HDR Gateway logo

Bookmarks

King's College Hospital MedCAT NLP 2011-2019

Population Size

1,073,183

People

Years

2011 - 2019

Associated BioSamples

None/not available

Geographic coverage

United Kingdom

England

Lead time

Not applicable

Summary

SNOMED codes derived from free text EHR from 2011-2019 covering all inpatients at King's College Hospital, using MedCAT Natural Language Processing. Research use of the dataset is governed by the KERRI committee, and requires a KCH principal investigator.

Documentation

This dataset contains Natural Language Processing (NLP) output from the MedCAT library applied to the full text content of the King's College Hospital electronic health record available through CogStack. Documents were annotated with SNOMED codes and meta-annotations for experiencer, negation and temporality.

Research use of the dataset is governed by the patient-led KERRI committee, and requires a KCH principal investigator.

Dataset type
Health and disease
Dataset sub-type
Not applicable
Dataset population size
1073183

Keywords

Electronic Health Record, NLP, SNOMED CT

Observations

Observed Node
Disambiguating Description
Measured Value
Measured Property
Observation Date

Persons

1073183

count

31 Dec 2019

Provenance

Purpose of dataset collection
Care
Collection source setting
Secondary care - In-patients
Image contrast
Not stated
Biological sample availability
None/not available

Structural Metadata

Details

Publishing frequency
Continuous
Version
1.0.0
Modified

08/10/2024

Citation Requirements
King's College London NHS Foundation Trust

Coverage

Start date

01/01/2011

End date

31/12/2019

Time lag
Not applicable
Geographic coverage
United Kingdom, England, London
Minimum age range
18
Maximum age range
100

Accessibility

Language
en
Controlled vocabulary
SNOMED CT
Format
text/json, text/csv

Data Access Request

Dataset pipeline status
Not available
Time to dataset access
Not applicable
Access method category
Varies based on project
Access service description
Research use of the dataset is governed by the patient-led KERRI committee, and requires a KCH principal investigator. We recommend making contact with a KCH principal investigator first to facilitate applications for approvals. The data will only be accessible in the KCH data environment within the NHS firewall and will not be transferred out of KCH.
Jurisdiction
GB-ENG
Data use limitation
Research use only
Data use requirements
User-specific restriction,Project-specific restrictions
Data Controller
Kings College Hospital NHS Foundation Trust, with oversight of the Caldicott Guardian
Data Processor
N/A

Dataset Types: Health and disease


Collection Sources: Secondary care - In-patients