corum

is a Data Source.

CORUM (Comprehensive Resource of Mammalian Protein Complexes) is a curated database of experimentally characterized protein complexes from mammalian organisms, particularly human, mouse, and rat, with a focus on manually annotated information from scientific literature.

License

CC BY-NC 4.0

Homepage

corum

Infores ID

Unknown

FAIRsharing ID

Unknown

Product Summary

Products

From this Resource
ID Name URL Category Format Description
corum.all_complexes CORUM All Complexes download Product tsv Complete dataset of all curated prote...
corum.core_complexes CORUM Core Complexes download Product tsv Core dataset of manually curated, non...
corum.psi_mi CORUM PSI-MI download Product psi_mi_xml Dataset of all CORUM protein complexe...
corum.mitab CORUM MITAB download Product psi_mi_mitab Dataset of all CORUM protein complexe...
From other Resources
ID Name URL Category Format Relation Description
bioteque.embeddings Bioteque Embeddings embeddings Product had primary source Network embeddings of the Bioteque gr...
clinicalkg.graph CKG Graph Dump 1 GraphProduct mixed had primary source Neo4j database dump of the Clinical K...
cancer-genome-interpreter.clinicalkg.graph CKG Graph Dump 1 GraphProduct mixed had primary source Neo4j database dump of the Clinical K...
ckg.graph CKG Graph Database Dump 1 GraphProduct neo4j had primary source Graph database dump and additional re...
pathwaycommons.biopax Integrated BioPAX Model pc-biopax.owl.gz (1.6 GB) Product biopax was derived from PC v14 integrated BioPAX Level 3 unif...
harmonizome.downloads Harmonizome Downloads download Product mixed was derived from Harmonizome 3.0 processed dataset dow...
harmonizome.kg-neo4j Harmonizome Knowledge Graph Neo4j Database harmonizome-kg.maayanlab.cloud GraphProduct neo4j was derived from Neo4j knowledge graph serialization o...
pathwaycommons.downloads Pathway Commons Data Downloads v14 Product mixed was derived from Download directory for Pathway Common...
pathwaycommons.sif SIF Network Format pc-hgnc.sif.gz (9.4 MB) Product sif was derived from PC v14 Simple Interaction Format netw...
pathwaycommons.gmt GMT Gene Set Format pc-hgnc.gmt.gz (256.4 KB) Product was derived from PC v14 Gene Matrix Transposed gene se...
pathwaycommons.txt Extended SIF TXT Format pc-hgnc.txt.gz (110.3 MB) Product txt was derived from PC v14 tab-delimited extended SIF nod...
biobtree.api BioBTree REST API api ProgrammingInterface http had primary source REST API for searching identifiers an...

Details

CORUM - Comprehensive Resource of Mammalian Protein Complexes

CORUM is a manually curated database of experimentally characterized protein complexes from mammalian organisms. It serves as a comprehensive and high-quality resource for researchers studying protein-protein interactions, cellular functions, and disease mechanisms.

Overview

The CORUM database focuses on protein complexes from human, mouse, and rat, with a particular emphasis on human complexes. The database is distinguished by its commitment to manual curation from scientific literature, ensuring high data quality. Every entry in CORUM is based on experimental evidence from published research papers, providing reliable information about the composition and function of protein complexes.

As of the 2023 update, CORUM contains:

  • Over 4,700 protein complexes
  • More than 3,300 different genes
  • Approximately 20,000 protein-protein interactions

Key Features

CORUM provides comprehensive information about each protein complex, including:

  • Complex name and synonyms
  • Organism source
  • Protein composition (with UniProt IDs)
  • Associated diseases
  • Biological functions (GO terms)
  • Cellular localization
  • PubMed references to experimental evidence
  • Tissue distribution
  • Protein complex purification methods

Data Sources and Curation

All entries in CORUM are manually extracted from scientific literature by expert curators. This manual curation approach ensures high-quality data and allows for the inclusion of detailed information that may not be captured by automated text mining approaches. The curation process focuses on:

  1. Identifying experimentally verified protein complexes from peer-reviewed publications
  2. Mapping protein components to standardized identifiers (UniProt)
  3. Annotating complexes with functional information and disease associations
  4. Cross-referencing with other databases and ontologies

Applications

CORUM serves as a valuable resource for various research applications:

  • Proteomics: Interpretation of mass spectrometry data and analysis of protein-protein interactions
  • Systems Biology: Study of cellular pathways and network analysis
  • Disease Research: Investigation of disease mechanisms and identification of therapeutic targets
  • Computational Biology: Training and validation of protein interaction prediction algorithms
  • Functional Genomics: Functional annotation of genes and proteins

Data Access

CORUM data is freely available for academic and non-commercial use under a Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) license. The database can be accessed through:

Is this information incorrect or incomplete? Request an update.

Created: July 22, 2025 | Last modified: June 12, 2026