is a Knowledge Graph.
A weighted heterogeneous knowledge graph containing four types of entities (Tumor, Biomarker, Drug, and ADR) extracted from MEDLINE corpus for adverse drug reaction discovery in antitumor drugs. TBKG uses a naive Bayesian model to explore correlations and provides explainable predictions through tumor-biomarker-drug pathways. The knowledge graph contains 1,179 tumors, 2,550 biomarkers, 1,806 drugs, and 756 ADRs with six types of relationships totaling 139,254 edges.
drug discovery, pharmacology, biomedical, clinical, systems biology
Unknown
Unknown
| ID | Name | URL | Category | Format | Description |
|---|---|---|---|---|---|
| tbkg.data | TBKG Knowledge Graph Data | full#supplementary-material | GraphProduct | mixed | Weighted heterogeneous knowledge grap... |
| tbkg.osimertinib_case_study | TBKG Osimertinib ADR Case Study Data | full#supplementary-material | Product | mixed | Clinical validation dataset with calc... |
TBKG (Tumor-Biomarker Knowledge Graph) is an explainable knowledge graph-based approach for discovering potential adverse drug reactions (ADRs) of antitumor drugs. The system extracts entities from biomedical literature (MEDLINE database with 22+ million citations) using the UMLS Metathesaurus 2020AA and Apache cTAKES natural language processing tool.
| Importance Measure: log(p(biomarker | tumor present)) - log(p(biomarker | tumor absent)) |
Osimertinib case study demonstrated:
Created: November 22, 2025 | Last modified: May 29, 2026