is a Data Source.
The Columbia Open Health Data (COHD) API provides access to observed clinical frequencies and co-occurrence frequencies from electronic health records at Columbia University Medical Center. The database contains counts and frequencies of conditions, procedures, drug exposures, and patient demographics from the OHDSI common data model, along with statistical associations between clinical concepts. To protect patient privacy, all concepts where count ≤10 were excluded and counts were randomized using Poisson distribution. COHD offers multiple datasets including 5-year (2013-2017) and lifetime data, both in hierarchical and non-hierarchical forms, plus beta temporal co-occurrence data.
clinical, health, public health, precision medicine
Warning: No license entered
infores:cohd
Unknown
| ID | Name | URL | Category | Format | Description |
|---|---|---|---|---|---|
| cohd.api | COHD API | api | ProgrammingInterface | http | RESTful API providing programmatic ac... |
| cohd.portal | COHD Web Interface | cohd.io | GraphicalInterface | http | Interactive web interface for explori... |
| cohd.notebooks | COHD API Examples | COHD_API_Example.ipynb (2.7 MB) | ProcessProduct | http | Python Jupyter notebooks demonstratin... |
| cohd.docs ⚠ | COHD Documentation | api | DocumentationProduct | http | API documentation covering endpoint d... |
| ID | Name | URL | Category | Format | Description |
|---|---|---|---|---|---|
| translator.cohd.graph | Translator COHD KGX Graph | latest | GraphProduct | kgx-jsonl | KGX JSONL graph package for Columbia ... |
| translator.translator_kg.graph | Translator Aggregate KGX Graph | latest | GraphProduct | kgx-jsonl | Aggregated KGX JSONL graph package co... |
The Columbia Open Health Data (COHD) project provides an API for accessing observed clinical frequencies and co-occurrence patterns from electronic health records at Columbia University Medical Center. The database is built on the OHDSI (Observational Health Data Sciences and Informatics) common data model and provides valuable real-world evidence about the prevalence of medical conditions, procedures, and drug exposures in clinical practice.
1. 5-Year Non-Hierarchical Dataset (2013-2017)
2. Lifetime Non-Hierarchical Dataset
3. 5-Year Hierarchical Dataset (2013-2017)
4. Temporal Co-occurrence Data (BETA)
COHD provides access to:
All clinical concepts are coded using standard concept IDs from the OMOP Common Data Model:
The API includes mapping functionality to translate between vocabularies and other ontologies using the EMBL-EBI Ontology Xref Service (OxO).
To protect patient privacy:
Dataset descriptions, concept counts, and database statistics
Common vocabulary for name and concept identifier mapping between standard terminologies
Access to counts and frequencies of:
Inferred associations between concepts using:
COHD was developed at the Columbia University Department of Biomedical Informatics through collaboration between:
This resource has the Information Resource identifier: infores:cohd
Created: November 04, 2025 | Last modified: November 04, 2025