is a Data Source.
GENCODE is a comprehensive and high-quality reference annotation of the human and mouse genomes, providing evidence-based gene annotations including protein-coding genes, long non-coding RNAs, small RNAs, pseudogenes, and other genomic features based on manual curation and computational analysis.
Warning: No license entered
Unknown
Unknown
Unknown
| ID | Name | URL | Category | Format | Description |
|---|---|---|---|---|---|
| gencode.human.gtf | GENCODE Human Annotations GTF | gencode.v49.annotation.gtf.gz (89.0 MB) | Product | gff | Current comprehensive GENCODE gene an... |
| gencode.mouse.gtf | GENCODE Mouse Annotations GTF | gencode.vM38.annotation.gtf.gz (35.9 MB) | Product | gff | Current comprehensive GENCODE gene an... |
| gencode.primary | GENCODE Primary Transcripts | gencode_primary | Product | gff | GENCODE Primary transcript set captur... |
| ID | Name | URL | Category | Format | Relation | Description |
|---|---|---|---|---|---|---|
| ubkg.neo4j | UBKG Neo4j Docker Distribution | ubkg-downloads.xconsortia.org | GraphProduct | ❔ | had primary source | Turnkey neo4j distributions that depl... |
| ubkg.csv | UBKG Ontology CSV Files | ubkg-downloads.xconsortia.org | GraphProduct | csv | had primary source | Ontology CSV files that can be import... |
| lncrnalyzr.graph | lncRNAlyzr Knowledge Graph | ❔ | GraphProduct | neo4j | had primary source | Neo4j knowledge graph containing lncR... |
| hrpimp.data | HuRI Protein-Protein Interaction Data | HuRI.tsv (1.6 MB) | Product | tsv | was informed by | Current HuRI TSV file containing 52,5... |
| hrpimp.huri.psi | HuRI PSI-MI Interaction Data | HuRI.psi (162.0 MB) | Product | psi_mi_mitab | was informed by | Current HuRI PSI-MI formatted interac... |
GENCODE (Encyclopedia of Genes and Gene Variants) is a scientific project aimed at identifying and classifying all gene features in the human and mouse genomes with high accuracy based on biological evidence. It provides comprehensive, evidence-based annotations that serve as a reference standard for genome interpretation and biomedical research.
The goal of the GENCODE project is to identify and classify all gene features in the human and mouse genomes with high accuracy based on biological evidence, and to release these annotations for the benefit of biomedical research and genome interpretation.
GENCODE annotations are integrated into:
GENCODE is supported by the National Human Genome Research Institute (NHGRI) of the US National Institutes of Health.
When using GENCODE data, please cite the GENCODE project and relevant publications describing the specific release used.
GENCODE data are freely available for research use. See EMBL-EBI Terms of Use for details.
Created: November 26, 2025 | Last modified: June 02, 2026