msigdb

is a Data Source.

The Molecular Signatures Database (MSigDB) is a comprehensive collection of tens of thousands of annotated gene sets for use with Gene Set Enrichment Analysis (GSEA) software. MSigDB includes curated gene sets from pathway databases, gene ontology annotations, hallmark gene sets, immunologic signatures, regulatory target sets, and cell type-specific signatures derived from single-cell sequencing studies. Available for both human and mouse.

Domains

genomics, systems biology, pathways

Homepage

msigdb

Repository

Unknown

Infores ID

infores:msigdb

FAIRsharing ID

Unknown

Product Summary

Products

From this Resource
ID Name URL Category Format Description
msigdb.browser MSigDB Web Browser msigdb GraphicalInterface http Web interface for browsing, searching...
msigdb.downloads.human MSigDB Human Gene Sets Downloads downloads.jsp#msigdb Product mixed Downloadable gene set files in GMT, X...
msigdb.downloads.mouse MSigDB Mouse Gene Sets Downloads downloads.jsp#msigdb Product mixed Downloadable gene set files for mouse...
msigdb.investigate MSigDB Investigate Tool annotate.jsp GraphicalInterface Interactive tool for computing overla...
msigdb.gene_families MSigDB Gene Families Tool gene_families.jsp GraphicalInterface Tool for categorizing gene set member...
From other Resources
ID Name URL Category Format Description
ubkg.neo4j UBKG Neo4j Docker Distribution ubkg-downloads.xconsortia.org GraphProduct Turnkey neo4j distributions that depl...
ubkg.csv UBKG Ontology CSV Files ubkg-downloads.xconsortia.org GraphProduct csv Ontology CSV files that can be import...
obo-db-ingest.msigdb.tsv msigdb Nodes TSV msigdb.tsv (3.0 MB) Product tsv msigdb Nodes TSV

Details

Molecular Signatures Database (MSigDB)

Overview

The Molecular Signatures Database (MSigDB) is a comprehensive resource of tens of thousands of annotated gene sets designed for use with Gene Set Enrichment Analysis (GSEA) software. Developed as a joint project between UC San Diego and the Broad Institute, MSigDB provides systematic gene set collections that enable researchers to interpret genome-wide expression profiles and identify coordinate changes in gene expression.

Collections

Human Collections (v2025.1.Hs)

  • Hallmark Gene Sets: Coherently expressed signatures representing well-defined biological states or processes
  • Curated Gene Sets: From pathway databases, PubMed publications, and domain experts
  • Regulatory Target Gene Sets: microRNA seed sequences and transcription factor binding sites
  • Computational Gene Sets: Mined from large cancer-oriented expression datasets
  • Gene Ontology Gene Sets: Genes annotated by the same ontology term
  • Oncogenic Signature Gene Sets: From cancer gene perturbations
  • Immunologic Signature Gene Sets: Cell states and perturbations in the immune system
  • Cell Type Signature Gene Sets: Cluster markers from single-cell sequencing studies
  • Positional Gene Sets: Chromosome cytogenetic band locations

Mouse Collections (v2025.1.Mm)

  • Mouse-ortholog versions of hallmark gene sets
  • Curated gene sets from pathways and literature
  • Gene ontology annotations
  • Immunologic signatures
  • Cell type signatures from single-cell studies
  • Regulatory target gene sets
  • Positional gene sets

Features

  • Browse and search gene sets by name, keyword, or collection
  • Examine individual gene set annotations and member genes
  • Compute overlaps between custom gene sets and MSigDB collections
  • Categorize genes by gene families
  • View expression profiles in public compendia
  • Integration with NDEx biological network repository
  • Download gene sets in multiple formats (GMT, XML, etc.)

Access

  • Free registration required for downloads and web tools
  • Used to track usage for funding agency reports
  • No charge for academic and non-commercial use

Use with GSEA

MSigDB gene sets are designed for direct use with GSEA (Gene Set Enrichment Analysis) software to identify whether predefined sets of genes show statistically significant, concordant differences between two biological states.

Funding

Currently funded by NCI’s Informatics Technology for Cancer Research (ITCR) program.

Community Contributions

MSigDB welcomes suggestions and contributions of new gene sets from the research community.

Is this information incorrect or incomplete? Request an update.

Created: October 30, 2025 | Last modified: October 30, 2025