ChEMBL 36 is a dataset from Kaggle, likely containing bioactivity data for chemical compounds. The specific number of records, columns, and update date are unknown. It is inferred to be a snapshot of the ChEMBL database, a manually curated resource of bioactive molecules.
Use Cases
- Train a model to predict compound bioactivity (inferred from domain, verify after download)
- Perform similarity search for lead compound optimization (inferred from domain, verify after download)
- Analyze structure-activity relationships (SAR) (inferred from domain, verify after download)
Strengths
- Published on Kaggle.
- Based on the ChEMBL database, a major public resource for drug discovery.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
Provenance
- Source
- ChEMBL database