is a Knowledge Graph.
KG-Predict is a knowledge graph computational framework for drug repurposing that integrates multiple types of genotypic and phenotypic data. The framework constructs GP-KG (Genotype-Phenotype Knowledge Graph), containing 1,246,726 associations between 61,146 biomedical entities from various databases. KG-Predict uses graph embedding methods to learn low-dimensional representations of entities and relations, enabling inference of new drug-disease interactions. The system has been validated for identifying repositioned candidate drugs, particularly for Alzheimer's disease, achieving high performance metrics (AUROC = 0.981, AUPR = 0.409) and successfully prioritizing FDA-approved and clinical trial anti-AD drugs.
drug discovery, systems biology, biomedical, pharmacology, genomics, phenotype
Unknown
Unknown
Unknown
| ID | Name | URL | Category | Format | Description |
|---|---|---|---|---|---|
| kg-predict.gpkg | GP-KG Knowledge Graph Data | GP_KG.txt (46.2 MB) | GraphProduct | tsv | GP-KG tab-delimited knowledge graph c... |
| kg-predict.code ⚠ | KG-Predict Code | code | ProcessProduct | http | Python implementation of KG-Predict f... |
| kg-predict.raw_data | GP-KG Raw Data Documentation | Raw_Data.docx (17.0 KB) | DocumentationProduct | http | Source data documentation for the GP-... |
| kg-predict.ad_predictions | AD Drug Predictions | case_study_predict_results.csv (43.0 KB) | Product | csv | Alzheimer's disease case study drug r... |
| kg-predict.ad_nct_evidence | AD National Clinical Trial Evidence | ad_nct_evidence.csv (1.7 KB) | Product | csv | Alzheimer's disease National Clinical... |
KG-Predict is a computational framework designed for drug repurposing through knowledge graph embedding and inference. The system addresses the challenge of integrating and analyzing complex, heterogeneous biomedical data to identify new therapeutic applications for existing drugs.
The framework constructs GP-KG (Genotype-Phenotype Knowledge Graph), a comprehensive knowledge graph that integrates multiple types of entities and relations from various genotypic and phenotypic databases. GP-KG contains 1,246,726 associations between 61,146 biomedical entities, including drugs, diseases, genes, proteins, pathways, and phenotypes.
KG-Predict aggregates heterogeneous topological and semantic information from GP-KG to learn low-dimensional representations of entities and relations using advanced graph embedding techniques. These learned representations enable the inference of novel drug-disease interactions for drug repurposing applications.
In cross-validation experiments, KG-Predict demonstrated superior performance compared to other state-of-the-art graph embedding methods:
KG-Predict was applied to identify novel repositioned candidate drugs for Alzheimer’s disease (AD), achieving:
GP-KG integrates data from multiple genotypic and phenotypic databases, including information about:
The framework is implemented in Python and includes:
Created: November 22, 2025 | Last modified: June 02, 2026