Catalogue of Life
Overview
The Catalogue of Life (CoL) is the most comprehensive global taxonomic resource, serving as the authoritative checklist of all known species on Earth. Built and maintained by a global consortium of 500+ taxonomic experts from 165 peer-reviewed taxonomic databases, it provides the foundational taxonomic backbone for biodiversity research, conservation, species identification, and environmental policy worldwide.
Since its creation in 2001 as a partnership between Species 2000 and ITIS, the Catalogue of Life has evolved into an essential infrastructure resource supporting major international biodiversity initiatives including IUCN Red List assessments, Convention on Biological Diversity implementation, and GBIF (Global Biodiversity Information Facility) operations. As of 2025, it documents 2.2+ million living species and 153,000 extinct species through integration of 59,668 global, regional, and specialist data sources.
Data Content and Scale
Species Coverage
Catalogue of Life 2025 (Latest Release):
- 2,238,246 living (extant) species - Comprehensive global inventory
- ~153,000 extinct species - Historical and paleontological data
- 5.9+ million scientific names - Including accepted names, synonyms, and common names
- 59,668 data sources - Global, regional, national, and specialized checklists
- 165 peer-reviewed taxonomic databases - From specialist institutions worldwide
Year-over-Year Growth:
- 2% increase in total names from previous year
- 48,766 newly accepted species names added in 2025 release
- Continuous expansion reflecting ongoing taxonomic discovery and research
Taxonomic Coverage by Kingdom
Catalogue of Life provides comprehensive coverage across all major organism groups:
Animalia (Animals):
- Vertebrates: fish, amphibians, reptiles, birds, mammals (~95% coverage)
- Arthropods: insects, arachnids, crustaceans
- Mollusks, echinoderms, cnidarians, sponges
- Other invertebrate phyla
- Coverage estimated at 95%+ for described vertebrates
Plantae (Plants):
- Vascular plants: flowering plants, ferns, gymnosperms (~95% coverage)
- Mosses, liverworts, hornworts
- Green and red algae
- Coverage estimated at 95%+ for described vascular plants
Fungi:
- Ascomycetes and basidiomycetes
- Other fungal groups
- Moderate coverage with ongoing expansion
Chromista & other protists:
- Algae-like organisms
- Marine protists
- Diatoms and other microalgae
Bacteria & Archaea:
- Prokaryotic organisms
- Lower coverage due to rapid discovery rates
- Expanding through specialist databases
Protozoa:
- Protist coverage expanding through integration of specialist resources
Coverage Quality Assessment
Well-Documented Groups (>95% coverage):
- Vertebrates (mammals, birds, reptiles, amphibians, fish)
- Vascular plants (flowering plants, ferns, gymnosperms)
- Major arthropod orders
- Well-studied invertebrate groups
Moderately Documented Groups:
- Many marine organisms
- Fungal species
- Microorganisms
Underrepresented Groups:
- Many arthropod groups (nematodes, mites, parasitoids)
- Microorganisms (bacteria, archaea)
- Deep-sea organisms
- Undiscovered and poorly-documented taxa
Data Sources and Integration
Integration Model
The Catalogue operates through ChecklistBank, a sophisticated data repository infrastructure that ingests diverse taxonomic data sources in multiple formats and standardizes them into consistent data models.
Contributing Databases
Peer-Reviewed Taxonomic Databases:
165 major global, regional, and thematic taxonomic databases representing specialist institutions worldwide, including:
- ITIS (Integrated Taxonomic Information System)
- World Flora Online
- Species Files Group databases
- World Register of Marine Species (WoRMS)
- Fauna Europaea
- Regional and national taxonomic authorities
Data Source Diversity:
- 59,668 total checklists and data sources
- Global databases for major organism groups
- Regional taxonomic compilations
- Specialist databases for specific taxa
- Museum specimen records
- Literature-derived taxonomic information
Data Integration Pipeline
Processing Workflow:
- Raw data ingestion from diverse sources in various formats
- Conversion and standardization to Catalogue of Life Data Package (ColDP) format
- Quality control and validation processes
- Expert verification for Base Release (annual)
- Programmatic integration for Extended Release
- Publication via ChecklistBank and COL portals
- DOI assignment and version control
Quality Assurance:
- Expert peer review for Base Release accuracy
- Automated validation and consistency checking
- Verification by 500+ global taxonomic specialists
- Citation tracking and reference linkage
- Version control and archival
Data Access Methods
Web Portal Interface
Catalogue of Life Website: https://www.catalogueoflife.org/
- Interactive hierarchical taxonomy browser starting from kingdom level
- Expandable taxonomy tree for visual exploration of classification
- Full-text search for species names, synonyms, and common names
- Multi-language support for common names
- Direct access to taxonomic relationships and parent-child linkages
- Free, public access without registration
Application Programming Interfaces
Primary API: https://api.catalogueoflife.org/
- RESTful JSON-based API
- Multiple endpoints for different data queries
- Programmatic species lookup
- Taxonomic hierarchy navigation
- Synonym and common name searching
- Authentication via GBIF user accounts
- Documentation and examples provided
ChecklistBank API: http://webservice.catalogueoflife.org/
- Access to underlying checklist data
- Complex query support
- JSON responses
- Integration-ready for external applications
Data Download Options
Download Portal: https://www.catalogueoflife.org/data/download
Supported Formats:
- Catalogue of Life Data Package (ColDP) - Recommended for data exchange
- Tabular CSV/TSV format with metadata
- Based on Frictionless Data principles
- Specific support for taxonomic relationships
- ZIP file structure
- Darwin Core Archive (DwC-A) - For compatibility with biodiversity systems
- Star schema with core table
- Metadata and EML documentation
- Integration with GBIF and other biodiversity portals
- Annual Checklist Exchange Format (ACEF) - Legacy format
- Text-based format for taxonomic data
- Supported for compatibility
- TextTree Format - Simple hierarchy representation
- Human-readable text format
- Parent-child relationships
- MySQL Database Dumps - For technical users
- Complete database structure
- For local deployment and analysis
Download Flexibility:
- Complete Catalogue or specific taxonomic subtrees
- Kingdom-specific extracts
- Region-specific subsets
- Partial downloads with root taxon specification
- Historical versions from 2019 onward
Release Management
Release Types and Schedules:
- Annual Releases:
- Full annual version released yearly (most recent: July 2025)
- Base Release: Expert-curated, non-overlapping checklists emphasizing accuracy
- Extended Release (COL XR): Broader coverage incorporating ~60,000 sources
- Long-term archival with permanent DOI
- Monthly Updates:
- Regular releases between annual versions
- Continuous improvements and updates
- Maintained in ChecklistBank for approximately one year
- Not guaranteed permanent archival
- Version Control:
- Unique DOI for every release (complete catalogue and individual checklists)
- Individual species names have unique identifiers
- Citation tracking and attribution
Catalogue of Life Data Package (ColDP)
Primary Exchange Format:
- Tabular CSV or tab-separated values (TSV)
- ZIP file structure with metadata
- Specifically designed for taxonomic data
- Based on Frictionless Data principles
- Overcomes limitations of Darwin Core Archive for taxonomy
ColDP Components:
- Three core object types: taxa, names, and references
- Support for accepted names and synonyms
- Nomenclatural status tracking
- Taxonomic rank designation
- Parent-child relationship definition
- Author and year attribution
- Literature citations (BibTeX and CSL-JSON)
- Common names in multiple languages
Data Fields:
- Taxon identifiers (taxonID)
- Scientific names
- Authorship information
- Nomenclatural status (accepted, synonym, etc.)
- Taxonomic rank (Kingdom through subspecies)
- Parent taxon reference
- Literature citations and references
- Common names (multiple languages)
Taxonomic Standards
Linnaean Classification:
- Hierarchical ranks from domain (or superkingdom) to infraspecific levels
- Standard ranks: Kingdom, Phylum, Class, Order, Family, Genus, Species
- Infraspecific ranks: Subspecies and other sub-specific categories
- Support for non-standard ranks where needed
Name Management:
- Standardized scientific nomenclature
- Synonym tracking and linking
- Common names in multiple languages
- Literature citations for all names
- Full nomenclatural history
Relationship to Darwin Core:
- ColDP developed to overcome DwC-A limitations for taxonomy
- Compatible with Darwin Core terminology where applicable
- More intuitive for taxonomic specialists
- Better support for complex relationships
Version Control and Citation
- Every release receives unique DOI identifier
- Every contributing checklist has its own DOI
- Individual species can be cited with specific identifiers
- Version history maintained and accessible
- Recommended for scholarly publications
Technical Architecture
Hosting and Infrastructure
Administrative Host: Global Biodiversity Information Facility (GBIF) Secretariat
- Primary institutional support as of January 2024
- Infrastructure and administrative services
- Long-term preservation and archival
Co-Chair Institutions:
- Naturalis Biodiversity Center (Leiden, Netherlands) - Operations and coordination
- Illinois Natural History Survey (Champaign-Urbana, USA) - Operations and technical support
Physical Infrastructure:
- Hosted on GBIF infrastructure
- Designed for global-scale access
- Scalable architecture supporting millions of species records
- High availability and reliability
Technical Stack
Backend Architecture:
- Framework: Dropwizard Java application
- Build System: Maven-based modular structure
- API Technology: Shaded JARs using Dropwizard framework
- Database: Support for various backend databases
- APIs: RESTful JSON-based web services
Software Components:
- ChecklistBank System:
- Central repository for taxonomic datasets
- Data standardization and validation
- Publication and discovery platform
- GitHub: github.com/CatalogueOfLife/checklistbank
- Backend Application:
- Core application logic
- API generation and exposure
- Data processing pipeline
- GitHub: github.com/CatalogueOfLife/backend
- Data Format Specification:
- ColDP format documentation
- Validation tools and schemas
- GitHub: github.com/CatalogueOfLife/coldp
Data Processing Pipeline:
- Raw checklist ingestion from diverse sources
- Format standardization to ColDP
- Quality control and validation
- Expert review (for Base Release)
- Programmatic integration (for Extended Release)
- Publication via APIs and download services
- DOI assignment and archival
Use Cases and Applications
Research Applications
Taxonomy and Systematics:
- Foundational reference for phylogenetic studies
- Standardized taxonomic nomenclature verification
- Species identification and classification
- Biodiversity inventory and assessment
- Comparative biology and systematics research
Biodiversity Modeling:
- Species richness estimation and comparison
- Habitat distribution prediction
- Ecological pattern analysis
- Extinction risk and threat modeling
- Biogeographic studies
Genomic and Molecular Research:
- Integration with DNA barcode databases (BOLD)
- Genomic data linking to taxonomic classification
- Molecular species identification
- Standardization for metabarcoding studies
Conservation Applications
IUCN Red List Integration:
- Taxonomic reference for 172,000+ assessed species
- Species nomenclature standardization
- Threat assessment consistency
- Conservation priority identification
Conservation Planning:
- Species distribution mapping
- Biodiversity hotspot identification
- Habitat assessment and monitoring
- Protected area planning and evaluation
- Invasive species tracking
Policy Implementation:
- Convention on Biological Diversity (CBD) support
- Convention on International Trade in Endangered Species (CITES) enforcement
- Convention on Migratory Species (CMS) compliance
- IPBES (Biodiversity and Ecosystem Services) assessments
- Kunming-Montreal Global Biodiversity Framework implementation
Data Integration Applications
Global Biodiversity Platforms:
- GBIF (Global Biodiversity Information Facility) - Backbone taxonomy
- Encyclopedia of Life (EOL) - Species information synthesis
- Biodiversity Heritage Library - Literature integration
- National and regional biodiversity portals
Collection Management:
- Museum specimen cataloging and standardization
- Herbarium organization and classification
- Taxonomic standards for institutional collections
Environmental Management:
- Government species regulation and enforcement
- Environmental impact assessments
- Biosecurity and quarantine applications
- Regulatory compliance with trade agreements
Educational and Citizen Science:
- Species identification tools and guides
- Biodiversity education and training
- Standardization for crowdsourced observations
- Public engagement in taxonomy and biodiversity
Organizational Structure
Governance
Core Leadership:
- Naturalis Biodiversity Center (Leiden, Netherlands) - Co-chair and operational coordination
- Illinois Natural History Survey (Champaign-Urbana, USA) - Co-chair and technical support
- GBIF Secretariat - Administrative host (as of 2024)
Expert Community:
- Over 500 global taxonomic experts
- Hundreds of specialist database contributors
- Expert committees for major organism groups
- Peer review for Base Release curation
Contributing Institutions:
- 165 peer-reviewed taxonomic database organizations
- Specialist institutions worldwide
- Regional taxonomic authorities
- University-based taxonomy programs
Founding Organization: Species 2000
History and Mission:
- Created in 1997 by Frank Bisby and colleagues (University of Reading, UK)
- Consortium of taxonomic database organizations
- Parent organization that created Catalogue of Life (2001)
- Oversees COL strategic direction and governance
Related Organizations:
- LifeWatch - European research infrastructure partner
- Smithsonian Institution - Supporting organization
- Chinese Academy of Sciences - Supporting organization
Maintenance and Development Status
Active Development and Updates
Release Cycle:
- Annual comprehensive releases (most recent: July 2025)
- Monthly intermediate releases throughout the year
- Continuous data updates and improvements
- Regular quality assurance processes
Current Version (2025):
- Released: July 9, 2025
- 2,238,246 living species + 153,000 extinct species
- 5.9+ million scientific names
- 48,766 newly accepted species added
Infrastructure Status:
- Fully operational since December 2020
- Actively maintained with regular updates
- High availability and global accessibility
- Scalable to support growing species databases
Future Development
- Continued transition toward open data licenses (CC-0, CC-BY)
- Expansion of API capabilities
- Increased integration with global biodiversity platforms
- Focus on completeness in underrepresented organism groups
- Development of advanced taxonomic name services
- Support for Kunming-Montreal Global Biodiversity Framework implementation
Standards and Interoperability
- Linnaean Taxonomy - Hierarchical classification standard
- Darwin Core - Biodiversity data standard
- Linked Data Principles - Semantic interoperability
- Open Data Standards - FAIR (Findable, Accessible, Interoperable, Reusable)
- Creative Commons Licensing - Open science principles
Citation and Usage
Catalogue of Life data and services are freely accessible with open licensing encouraging maximum reuse for research, conservation, and educational purposes.
Recommended Citation
For Complete Catalogue:
“Catalogue of Life (2025). Available at: https://www.catalogueoflife.org/”
With DOI:
“Catalogue of Life 2025 annual release [version], released 2025-07-09. Digital Object Identifier: [DOI]”
Three-Level Attribution:
- Credit to complete Catalogue of Life work
- Credit to contributing database/checklist
- Credit to taxonomic expert providing verification
Additional Resources
- Main Website: https://www.catalogueoflife.org/
- API Documentation: https://api.catalogueoflife.org/
- ChecklistBank Repository: https://checklistbank.org/
- Data Downloads: https://www.catalogueoflife.org/data/download
- GitHub Organization: https://github.com/CatalogueOfLife
- Citation Guidelines: https://www.catalogueoflife.org/howto/cite
- Data Usage Policy: https://www.catalogueoflife.org/about/colusage
- Naturalis Biodiversity Center: https://www.naturalis.nl/
- Illinois Natural History Survey: https://www.inhs.illinois.edu/
- GBIF Secretariat: https://www.gbif.org/
Catalogue of Life continues to serve as the global standard for species taxonomy and nomenclature, providing essential infrastructure for biodiversity research, conservation decision-making, and the implementation of international agreements to protect Earth’s biological diversity.
Automated Evaluation