surechembl

is a Data Source.

SureChEMBL is a freely available, large-scale resource of chemical compounds extracted from the patent literature through automated text- and image-mining. Hosted by EMBL-EBI, it links millions of annotated chemical structures to the patent documents in which they appear, enabling search and discovery of chemistry disclosed in patents. The resource is updated continuously and made available for both interactive search and bulk download.

License

CC-BY-4.0

Homepage

surechembl

Repository

Unknown

Infores ID

Unknown

FAIRsharing ID

Unknown

Product Summary

Products

From this Resource
ID Name URL Category Format Description
surechembl.bulk-data SureChEMBL Bulk Data bulk_data Product Bulk download of the entire annotated...
surechembl.search SureChEMBL Web Search Interface www.surechembl.org GraphicalInterface http Web-based search interface for SureCh...
surechembl.docs SureChEMBL Documentation surechembl DocumentationProduct http Official SureChEMBL documentation des...
From other Resources
ID Name URL Category Format Relation Description
biobtree.api BioBTree REST API api ProgrammingInterface http had primary source REST API for searching identifiers an...

Details

SureChEMBL

SureChEMBL is a freely available, large-scale resource of chemical compounds extracted from the patent literature. Compounds are identified automatically through text- and image-mining of patent documents, then chemically annotated and linked back to the patents in which they appear. The resource is hosted and maintained by EMBL-EBI.

In KG-Registry, the SureChEMBL products point to the live web search interface, the bulk Parquet data distributed through the EMBL-EBI FTP server, and the official documentation. The data are made available under a CC BY 4.0 license, though individual compounds may carry additional restrictions imposed by the original data owners.

Is this information incorrect or incomplete? Request an update.

Created: June 15, 2026 | Last modified: June 15, 2026