is a Data Source.
A large-scale microbial genome resource with consistent annotation, species clustering, and downloadable representative genome and protein datasets.
Warning: No license entered
Unknown
Unknown
Unknown
| ID | Name | URL | Category | Format | Description |
|---|---|---|---|---|---|
| progenomes.portal | proGenomes Portal | progenomes.embl.de | GraphicalInterface | http | Main proGenomes web interface for exp... |
| progenomes.pg4.ncbi-taxonomy | proGenomes v4 NCBI Taxonomy Table | pg4_ncbi_taxonomy.tsv.gz (5.8 MB) | Product | tsv | NCBI taxonomy mapping table for proGe... |
| progenomes.pg4.rep-proteins | proGenomes v4 Representative Proteins | pg4_proteins_representatives.faa.gz (28.3 GB) | Product | fasta | Representative protein FASTA set for ... |
| ID | Name | URL | Category | Format | Relation | Description |
|---|---|---|---|---|---|---|
| string.protein.links | STRING Protein Links | protein.links.v12.0.txt.gz (128.7 GB) | GraphProduct | txt | had primary source | protein network data (full network, s... |
| string.protein.links.detailed | STRING Protein Links Detailed | protein.links.detailed.v12.0.txt.gz (189.6 GB) | GraphProduct | txt | had primary source | protein network data (full network, i... |
| string.protein.links.full | STRING Protein Links Full | protein.links.full.v12.0.txt.gz (199.6 GB) | GraphProduct | txt | had primary source | protein network data (full network, i... |
| string.protein.physical.links | STRING Protein Physical Links | protein.physical.links.v12.0.txt.gz (11.1 GB) | GraphProduct | txt | had primary source | protein network data (physical subnet... |
| string.protein.physical.links.detailed | STRING Protein Physical Links Detailed | protein.physical.links.detailed.v12.0.txt.gz (13.8 GB) | GraphProduct | txt | had primary source | protein network data (physical subnet... |
| string.protein.physical.links.full | STRING Protein Physical Links Full | protein.physical.links.full.v12.0.txt.gz (14.5 GB) | GraphProduct | txt | had primary source | protein network data (physical subnet... |
| string.cog.links | STRING COG Links | COG.links.v12.0.txt.gz (176.8 MB) | GraphProduct | txt | had primary source | association scores between orthologou... |
| string.cog.links.detailed | STRING COG Links Detailed | COG.links.detailed.v12.0.txt.gz (238.7 MB) | GraphProduct | txt | had primary source | association scores (incl. subscores p... |
| string.database | STRING Database Network Schema | network_schema.v12.0.sql.gz (262.2 GB) | GraphProduct | ❔ | had primary source | full database, part II: the networks ... |
| metatraits.traits | metaTraits Trait List | traits | Product | http | had primary source | Trait data table listing all 140+ har... |
| metatraits.gtdb2ncbi | GTDB to NCBI Taxonomy Mapping | GTDB2NCBI.tsv.gz (2.8 MB) | Product | mixed | was derived from | Taxonomy crosswalk from GTDB release ... |
| metatraits.ncbi2gtdb | NCBI to GTDB Taxonomy Mapping | NCBI2GTDB.tsv.gz (2.8 MB) | Product | mixed | was derived from | Taxonomy crosswalk from NCBI taxonomy... |
| metatraits.ncbi.family-summary | metaTraits NCBI Family Summary | ncbi_family_summary.jsonl.gz (1.5 MB) | Product | mixed | was derived from | Family-level harmonized trait annotat... |
| metatraits.ncbi.genus-summary | metaTraits NCBI Genus Summary | ncbi_genus_summary.jsonl.gz (5.2 MB) | Product | mixed | was derived from | Genus-level harmonized trait annotati... |
| metatraits.ncbi.species-summary | metaTraits NCBI Species Summary | ncbi_species_summary.jsonl.gz (32.5 MB) | Product | mixed | was derived from | Species-level harmonized trait annota... |
| metatraits.gtdb.family-summary | metaTraits GTDB Family Summary | gtdb_family_summary.jsonl.gz (4.4 MB) | Product | mixed | was derived from | Family-level harmonized trait annotat... |
| metatraits.gtdb.genus-summary | metaTraits GTDB Genus Summary | gtdb_genus_summary.jsonl.gz (15.6 MB) | Product | mixed | was derived from | Genus-level harmonized trait annotati... |
| metatraits.gtdb.species-summary | metaTraits GTDB Species Summary | gtdb_species_summary.jsonl.gz (45.9 MB) | Product | mixed | was derived from | Species-level harmonized trait annota... |
proGenomes is a large-scale microbial genome resource with consistent annotations, quality controls, representative sets, and downloadable genome-derived datasets.
Created: January 30, 2026 | Last modified: February 15, 2026