oma

is a Data Source.

OMA (Orthologous MAtrix) is a database of orthologous genes among multiple species, providing a systematic classification of orthologs across complete genomes.

License

CC-BY-4.0

Homepage

oma

Repository

GitHub

Infores ID

Unknown

FAIRsharing ID

Unknown

Product Summary

Products

From this Resource
ID Name URL Category Format Description
oma.site OMA Browser oma GraphicalInterface http Web interface for exploring OMA data
oma.api.rest OMA REST API api ProgrammingInterface http RESTful API for accessing OMA data
oma.api.sparql OMA SPARQL sparql ProgrammingInterface http SPARQL endpoint for accessing OMA data
oma.groups.txt OMA Groups (Text) oma-groups.txt.gz Product txt OMA orthology groups in plain text fo...
oma.groups.orthoxml OMA Groups (OrthoXML) oma-groups.orthoXML.xml.gz Product xml OMA orthology groups in OrthoXML format
oma.hogs.orthoxml OMA HOGs (OrthoXML) oma-hogs.orthoXML.gz Product xml Hierarchical Orthologous Groups (HOGs...
oma.pairs.txt OMA Pairwise Orthologs oma-pairs.txt.gz Product tsv Pairwise orthologs in tab-separated t...
oma.proteins.fasta OMA Protein Sequences oma-seqs.fa.gz Product fasta Protein sequences from all genomes in...
oma.cdna.eukaryotes.fasta OMA cDNA Sequences (Eukaryotes) eukaryotes.cdna.fa.gz Product fasta cDNA sequences for Eukaryotic genomes...
oma.cdna.prokaryotes.fasta OMA cDNA Sequences (Prokaryotes) prokaryotes.cdna.fa.gz Product fasta cDNA sequences for Prokaryotic genome...
oma.cdna.viruses.fasta OMA cDNA Sequences (Viruses) viruses.cdna.fa.gz Product fasta cDNA sequences for viral genomes in F...
oma.proteins.seqxml OMA Protein Sequences (SeqXML) oma-seqs.seqxml.tgz Product xml Protein sequences in SeqXML format
oma.protein-annotations.txt OMA Protein Annotations oma-protein-annotations.txt.gz Product txt Protein annotations in text format
oma.hdf5 OMA Browser Database (HDF5) OmaServer.h5 Product hdf5 OMA Browser database in HDF5 format
oma.hdf5.index OMA Browser Database Index OmaServer.h5.idx Product mixed Suffix index for the OMA Browser HDF5...
oma.rdf OMA RDF Data oma-rdf-turtle.tgz Product ttl OMA data in RDF format (Turtle syntax)
oma.mapping.uniprot OMA to UniProt Mapping oma-uniprot.txt.gz MappingProduct tsv Mapping of OMA identifiers to UniProt...
oma.mapping.ensembl OMA to Ensembl Mapping oma-ensembl.txt.gz MappingProduct tsv Mapping of OMA identifiers to Ensembl...
oma.mapping.refseq OMA to RefSeq Mapping oma-refseq.txt.gz MappingProduct tsv Mapping of OMA identifiers to RefSeq ...
oma.mapping.entrez OMA to Entrez Gene Mapping oma-entrez.txt.gz MappingProduct tsv Mapping of OMA identifiers to Entrez ...
oma.mapping.go OMA to GO Mapping oma-go.txt.gz MappingProduct tsv Mapping of OMA identifiers to Gene On...
oma.species.txt OMA Species Metadata oma-species.txt Product txt Species metadata including taxonomy i...
oma.group-descriptions.txt OMA Group Descriptions group-descriptions.txt.gz Product txt Text descriptions for OMA groups

Details

OMA (Orthologous MAtrix) is a comprehensive database of orthologous gene relationships across multiple species. It provides a systematic and efficient approach to identifying orthologs among complete genomes, which are genes in different species that evolved from a common ancestor through speciation.

The OMA methodology employs an algorithm that starts with an all-against-all sequence alignment, followed by the identification of orthologous pairs and the construction of orthologous groups. The database includes:

  • Regular orthology inference across thousands of genomes
  • Hierarchical Orthologous Groups (HOGs) to capture evolutionary relationships at different taxonomic levels
  • Pairwise orthologous relationships between genes
  • Protein and DNA sequence data
  • Cross-references to major biological databases (UniProt, Ensembl, RefSeq, etc.)
  • Functional annotations and GO term mappings

OMA is maintained by the Dessimoz Lab at the University of Lausanne and the Swiss Institute of Bioinformatics (SIB), with biannual releases adding new genomes and improving data quality. The database is widely used for comparative genomics, phylogenetics, functional annotation transfer, and evolutionary studies.

Access to OMA data is provided through a user-friendly web interface, a REST API for programmatic access, and bulk downloads in various formats including text files, OrthoXML, FASTA, and RDF.

Is this information incorrect or incomplete? Request an update.

Created: May 07, 2025 | Last modified: June 01, 2026