HDR UK Gateway
HDR Gateway logo

Bookmarks

Genomics England - Transcriptomics

Population Size

7,840

People

Population Size statistic card

Years

2023

Years statistic card

Associated BioSamples

RNA

Associated BioSamples statistic card

Geographic coverage

UK

Geographic coverage statistic card

Lead time

2-6 months

Lead time statistic card

Summary

The Genomics England 100kGP Transcriptomics Pilot and Extension comprises RNA-sequencing of a subset of rare disease probands from the 100,000 Genomes Project who did not receive a genetic diagnosis through the Genomics England Interpretation Pipeline.

Documentation

The Genomics England 100kGP Transcriptomics Pilot and Extension comprises RNA-sequencing of a subset of rare disease probands from the 100,000 Genomes Project who did not receive a genetic diagnosis through the Genomics England Interpretation Pipeline (7840 samples from 7829 probands: 5546 samples in the initial Pilot project, 2294 samples in the Extension). We prioritised probands who were found to carry variants of unknown significance.

Priorities were based on:

  • Variants highlighted through Splice AI
  • Autosomal recessive disorders with only a single pathogenic variant identified
  • GMC-selected VUS AND contribution to phenotype partial / unknown AND variant type likely to affect RNA processing
  • Based on outcome questionnaire and a call to clinicians
  • VUS with a high Exomiser score AND variant likely to results in detectable abnormal RNA processing
  • Disorder category ranking by Genomics England on the basis of likely monogenic cause (ranks 1-5) for participants from 1.1 AND no diagnosis in outcome questionnaire
  • Call to GMCs / clinicians to propose cases based on strong phenotype for a monogenic disorder with no lead from WGS
  • Review whether RNA sample is available or requirement for fresh RNA sample

Dataset type

Health and disease

Dataset population size

7840

Keywords

Observations

Observed Node

Disambiguating Description

Measured Value

Measured Property

Observation Date

Persons

A subset of rare disease probands from the 100,000 Genomes Project who did not receive a genetic diagnosis through the Genomics England Interpretation Pipeline. 7840 samples from 7829 probands: 5546 samples in the initial Pilot project, 2294 samples in the Extension

7840

RNA-Seq

25 Sep 2025

Provenance

Purpose of dataset collection

Care, Disease registry, Study

Source of data extraction

Machine generated

Collection source setting

Clinic

Patient pathway description

Linked datasets cover secondary care.

Image contrast

Not stated

Biological sample availability

RNA

Structural Metadata

Details

Publishing frequency

Quarterly

Version

19.0.2

Modified

04/02/2026

Distribution release date

11/09/2025

Citation Requirements

The 100,000 Genomes Project Protocol v3, Genomics England. doi:10.6084/m9.figshare.4530893.v3. 2017. Publications that use the Genomics England Database should include an author as Genomics England Research Consortium. Please see the publication policy.

Coverage

Start date

21/12/2023

Time lag

Other

Geographic coverage

UK

Maximum age range

150

Follow-up

Other

Accessibility

Language

en

Alignment with standardised data models

OTHER

Controlled vocabulary

OTHER

Format

DRAGEN output, RNA-Seq QC output

Data Access Request

Dataset pipeline status

Not available

Time to dataset access

2-6 months

Access request cost

Fees will be dependent on the type of access that is necessary. Raw data is not eligible for export. Summary-level data may be exported provided that it is approved through the Genomics England Airlock Process

Access service description

More information about the Genomics England Research Environment can be found here: https://www.genomicsengland.co.uk/research and https://re-docs.genomicsengland.co.uk/welcome/. Genomics England 100k participants have consented to longitudinal lifetime followup and recontact safely through our clinical network.

Data use limitation

General research use

Data use requirements

Ethics approval required,Project-specific restrictions,Publication moratorium

Data Controller

GENOMICS ENGLAND

Data Processor

GENOMICS ENGLAND

Dataset Types: Health and disease


Collection Sources: Clinic

Relationships: