tcga

is a Data Source.

The Cancer Genome Atlas (TCGA) is a landmark cancer genomics program that molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types. This joint effort between NCI and the National Human Genome Research Institute began in 2006, bringing together researchers from diverse disciplines and multiple institutions. TCGA generated over 2.5 petabytes of genomic, epigenomic, transcriptomic, and proteomic data. The data, which has led to improvements in the ability to diagnose, treat, and prevent cancer, remains publicly available through the Genomic Data Commons for anyone in the research community to use.

Domains

genomics

License

Warning: No license entered

Homepage

tcga

Repository

Unknown

Infores ID

infores:tcga

FAIRsharing ID

Unknown

Product Summary

Products

From this Resource
ID Name URL Category Format Description
tcga.gdc_portal GDC Data Portal portal.gdc.cancer.gov Product http Genomic Data Commons Data Portal prov...
tcga.gdc_api GDC API gdc-application-programming-interface-api ProgrammingInterface http Genomic Data Commons Application Prog...
tcga.gdc_submission GDC Data Submission Portal submission GraphicalInterface http Data Submission Portal for uploading ...

Details

The Cancer Genome Atlas

Overview

The Cancer Genome Atlas (TCGA) is a landmark cancer genomics program that molecularly characterized over 20,000 primary cancer and matched normal samples spanning 33 cancer types. This joint effort between the National Cancer Institute (NCI) and the National Human Genome Research Institute began in 2006, bringing together researchers from diverse disciplines and multiple institutions.

Data and Impact

Over twelve years, TCGA generated over 2.5 petabytes of genomic, epigenomic, transcriptomic, and proteomic data. The comprehensive molecular characterization has led to significant improvements in our ability to diagnose, treat, and prevent cancer. All TCGA data remains publicly available through the Genomic Data Commons (GDC) for anyone in the research community to use.

Key Features

  • 33 Cancer Types: Comprehensive molecular characterization across diverse cancer types
  • 20,000+ Samples: Primary cancer and matched normal samples
  • Multi-omics Data: Genomic, epigenomic, transcriptomic, and proteomic measurements
  • Harmonized Data: Standardized clinical and genomic data for cross-analysis
  • Open Access: All data publicly available through the GDC Data Portal

Data Access

TCGA data is accessible through the Genomic Data Commons Data Portal, which provides:

  • Data Repository: Browse and download clinical, biospecimen, and genomic data
  • Cohort Builder: Create custom cohorts using clinical and biospecimen filters
  • Analysis Tools: Visualization tools for genomic alterations and clinical features
  • API Access: Programmatic access through the GDC API
  • Data Transfer Tool: Efficient download of large data files

Cancer Types Studied

TCGA selected 33 cancer types for molecular characterization based on public health impact, availability of samples, and potential for biological insights.

Information Resource ID

This resource has the Information Resource identifier: infores:tcga

Support

For questions about TCGA data or the GDC applications, contact the GDC Support team.

Is this information incorrect or incomplete? Request an update.

Created: November 08, 2025 | Last modified: November 08, 2025