glycocoo

is a Ontology.

The Glycoconjugate Ontology (GlycoCoO) is a standard semantic framework for describing and representing glycoproteomics and glycolipidomics data in RDF format. It provides a formal representation of glycoconjugate structures (glycoproteins and glycolipids), their associated metadata including publication information, biological source data, experimental evidence, and abundance ratios. GlycoCoO extends GlycoRDF by creating subclasses of ReferencedCompound including ReferencedGlycoconjugate, ReferencedProtein, and ReferencedLipid, enabling comprehensive annotation of glycan structures attached to proteins and lipids with contextual information such as glycosylation sites, disease associations, tissue and cell line sources, and analytical methods. The ontology supports both complete structural information and compositional data, can represent single glycans or glycoform mixtures at specific sites, and accommodates partially missing site information when no mapping is performed. It has been adopted by major glycoproteomics databases including UniCarbKB, GlyConnect, and GlycoNAVI, enabling federated queries across resources to retrieve integrated information about glycoconjugates from multiple publications and experimental contexts.

Domains

chemistry and biochemistry, biological systems, biomedical

License

Warning: No license entered

Homepage

glycocoo

Repository

Unknown

Infores ID

Unknown

FAIRsharing ID

Unknown

Product Summary

Products

From this Resource
ID Name URL Category Format Description
glycocoo.ontology GlycoCoO OWL Ontology glycocoo.owl (5.5 KB) OntologyProduct owl The GlycoCoO OWL ontology file defini...
glycocoo.wiki GlycoCoO Wiki Documentation wiki DocumentationProduct http GitHub Wiki with developer informatio...
glycocoo.rdf-samples GlycoCoO RDF Sample Data RDF_Sample Product http Sample RDF data files demonstrating G...
glycocoo.sparql-examples GlycoCoO SPARQL Query Examples SPARQL_Query.md (2.4 KB) Product http Example SPARQL queries for querying g...
From other Resources
ID Name URL Category Format Description
ubkg.neo4j UBKG Neo4j Docker Distribution ubkg-downloads.xconsortia.org GraphProduct Turnkey neo4j distributions that depl...
ubkg.csv UBKG Ontology CSV Files ubkg-downloads.xconsortia.org GraphProduct csv Ontology CSV files that can be import...

Details

GlycoConjugate Ontology (GlycoCoO)

Overview

The Glycoconjugate Ontology (GlycoCoO) is a standard semantic framework developed to address the need for digital standards in representing glycoproteins and glycolipids. It provides a comprehensive ontology for describing glycoconjugate structures and their functions, enabling the integration of glycoproteomics and glycolipidomics data within the Semantic Web ecosystem.

Motivation and Background

Recent advances in glycoproteomics protocols have led to a sustainable increase in reporting proteins with their attached glycans and glycosylation sites. However, very few of these reports are deposited into databases due to the absence of digital standards. GlycoCoO addresses this challenge by providing a framework that can:

  • Represent glycans as complete structures or compositions
  • Store single glycans or represent glycoforms on specific glycosylation sites
  • Handle partially missing site information when no site mapping is performed
  • Store abundances or ratios of glycans within a glycoform of a specific site

Ontology Structure

GlycoCoO extends the GlycoRDF ontology by creating specialized subclasses:

Core Classes

  • ReferencedGlycoconjugate: Represents glycoconjugates (glycoproteins and glycolipids) with their associated metadata
  • ReferencedProtein: Describes proteins with glycosylation site information using FALDO (Feature Annotation Location Description Ontology)
  • ReferencedLipid: Represents lipid components of glycolipids
  • Compound: The superclass encompassing glycan structures, typically pointing to GlyTouCan IDs
  • Citation: Publication and bibliographic information using Dublin Core and Bibliographic Ontology
  • Source: Biological source information including tissue, cell line, and taxonomy using Uberon and Cell Ontology
  • Evidence: Experimental evidence with subclasses for analytical methods (NMR, LC, MS)

Relationship to GlycoRDF

GlycoCoO reuses and extends GlycoRDF concepts by maintaining the ReferencedCompound mechanism for linking compounds with their evidence, source, and citation information. This approach allows the same glycan to be described with different metadata sets from different publications or experimental contexts.

Implementation and Adoption

GlycoCoO has been adopted by major glycoproteomics databases:

UniCarbKB

  • Provides approximately 1,530 glycoprotein entries
  • Contains over 4,000 annotated glycosylation sites
  • Includes 4,000 glycan structures (partial and fully defined)
  • SPARQL endpoint: http://sparql.unicarbkb.org

GlyConnect

  • Features 22,600 glycosylation sites on roughly 2,200 UniProtKB-referenced glycoproteins
  • Contains almost 4,000 glycans and 3,400 glycosylation sites
  • Supported by 900 articles
  • Includes 3,300 human N- and O-glycopeptides
  • SPARQL endpoint: https://glyconnect.expasy.org/rdf

GlycoNAVI

  • Stores information on glycan abundance ratios of glycoforms
  • Contains 1,297 glycans, 178 abundance ratio data points
  • Includes 102 disease states, 9 tissues, and 178 articles
  • SPARQL endpoint: https://sparql.glyconavi.org/sparql

Applications and Benefits

The adoption of GlycoCoO enables:

  1. Federated Queries: Single SPARQL queries can retrieve integrated information from multiple databases simultaneously
  2. Data Integration: Common glycans and metadata can be identified across different resources
  3. Disease Association Analysis: Disease annotations and their citations can be accumulated from diverse sources
  4. Source Tracking: Biological source information including tissues, cell lines, and organisms can be traced across databases
  5. Publication Integration: Overlapping and unique publications can be identified across resources

Technical Features

  • Semantic Web Compatibility: Uses RDF triples for data representation enabling SPARQL queries
  • Ontology Reuse: Integrates established ontologies including UniProt Core, Bibliographic Ontology, DCMI, HUPO-PSI-MS, Uberon, Cell Ontology, Gene Ontology, Disease Ontology, and Cellosaurus
  • Flexible Data Model: Accommodates both complete structural information and partial/compositional data
  • Site-Specific Annotation: Supports detailed annotation of glycosylation sites using FALDO for precise position information
  • Abundance Data: Can represent quantitative information about glycan abundance and ratios

Development and Maintenance

GlycoCoO was developed through international collaboration:

  • Originated at the Japanese BioHackathon 2012
  • Further developed at the Glyco-BioHackathon China 2013
  • Maintained on GitHub with contributions from multiple developers
  • Available through BioPortal for ontology browsing
  • Version controlled using Semantic Versioning

The ontology is continuously updated and maintained by a consortium of glycoinformatics researchers from institutions worldwide, including the National Bioscience Database Center (NBDC) of JST, Glycoinformatics Consortium (GLIC), Institute for Glycomics (Griffith University), and Swiss Institute of Bioinformatics.

Standardization and Prefixes

GlycoCoO uses standardized prefixes for consistent data representation:

  • gco: - http://purl.jp/bio/12/glyco/conjugate#
  • glycan: - http://purl.jp/bio/12/glyco/glycan#
  • faldo: - http://biohackathon.org/resource/faldo#
  • sio: - http://semanticscience.org/resource/
  • codao: - http://purl.glycoinfo.org/ontology/codao#

Future Directions

The GlycoCoO framework continues to evolve with plans for:

  • Integration with the Protein Ontology (PRO) to capture glycoproteoform complexity
  • Expansion to lipid ontologies for comprehensive glycolipid representation
  • Development of a glycoconjugate repository with unique accession numbers
  • Enhanced support for heterogeneity in site-specific protein glycosylation
  • Integration with GlyGen for comprehensive glycoscience data resources

Is this information incorrect or incomplete? Request an update.

Created: October 29, 2025 | Last modified: October 29, 2025