Loading...
Loading...
Drug-target interaction, molecular screening, ADMET, compound databases, pharmaceutical data
532 datasets
A text dataset from Kaggle, likely containing examples of sentiment spam and toxic content. The dataset's author, size, and specific collection period are unknown. Its intended purpose appears to be for analyzing or detecting manipulated sentiment and harmful language.
A Kaggle dataset focused on text analysis, likely containing examples of sentiment spam and toxic language. The dataset's title suggests it involves the linguistic phenomenon of double negation. The dataset's author, organization, and temporal coverage are unknown.
Pharmaceutical outlet locations in the Indian state of Punjab. The dataset is hosted on Kaggle, but its specific size, collection date, and creator are unknown. Columns likely contain identifiers and location details for retail pharmacies.
Punjab CDSL Pharmaceutical Outlets Punjab is a dataset published on Kaggle. The title suggests it contains information about drug stores or pharmacies in the Indian state of Punjab. The specific contents, scale, and origin of the data are not detailed in the provided metadata.
Presenting a Data Management and Sharing Plan (DMS) for a research project involving DelAqua Pharmaceuticals. It describes the scientific data to be generated and/or used and outlines a strategy for managing and sharing project data. The plan was authored by Konstantin Lukianov.
MulTaBench likely contains text data for toxicity classification tasks. The dataset is published on Kaggle, but specific details about its size, origin, and creation date are unknown. Columns suggest it includes text samples and corresponding classification labels.
Goliath Dataset French Toxicity is a text dataset hosted on HuggingFace by author ItsAxel. The dataset's content likely contains French language text annotated for toxicity, as suggested by its title. It was last updated on the HuggingFace platform on 2026-02-21 11:35:24.
A dataset from Kaggle describing an integrated microfluidic platform for high-throughput screening. The dataset likely contains experimental results from screening compounds targeting the BCL-2 protein family. The author, organization, and temporal coverage are unknown.
Two three-week toxicity tests were completed at Davis station in 2009/10 as part of STP project 3217. The dataset assesses the impact of sewage effluent on two key local invertebrate species, Paramoera walkeri and Skenella paludionoides, using standard bioassay protocols. Results provide a baseline for quantifying future changes to effluent discharge and determining safe dilution factors for the near-shore marine environment.
OECD Health Data provides international statistics on pharmaceutical markets. The data likely contains metrics on drug consumption, expenditure, and market structure across member countries. It is published by the Organisation for Economic Co-operation and Development (OECD).
Kaggle hosts a dataset titled 'toxicity'. The dataset likely contains measurements or classifications related to toxic substances. Its specific content, scale, and origin are not detailed in the provided metadata.
Featuring raw data from a manuscript investigating the protective effects of phosphatidylserine-based liposomes encapsulating DMX-5804 against doxorubicin-induced cardiotoxicity. The data was authored by Jessica Tetterton-Kellner and was last updated in February 2026. Specific details on rows, columns, and file formats are unavailable.
Dr. Duke's Phytochemical and Ethnobotanical Databases is a leading repository of plant chemical and usage data, evolving from USDA research and a published handbook. It facilitates searches for plant chemical profiles, biological activities, and ethnobotanical uses, with references to supporting scientific publications. The database is structured to support user-focused browsing and searching.
The Data Management and Sharing Plan for p16INK4a Expression, Chemotherapy Toxicity, and Aging in Women with Breast Cancer outlines the strategy for managing and sharing scientific data generated by the research project. Authored by Hy Muss and harvested by ODUM, the plan describes the intended data but specific details like row count, column names, and file formats are unavailable.
ProteinDrugDB is a research-grade synthetic dataset intended for machine learning-driven drug discovery. The dataset is hosted on Kaggle and is tagged with topics including ML Ethics, Healthcare, and Chemistry. Its specific size, format, and column details are unknown.
Northeast US weekly pharmaceutical sales data from 2018 to 2023. The dataset likely contains sales figures aggregated over time. The author and organization are unknown.
Exocarpium Citri Grandis compounds are analyzed for their potential anti-hyperlipidemia effects using network pharmacology methods. The dataset likely contains molecular interaction data, such as compound-target or pathway relationships, sourced from Kaggle. Its author, organization, and specific size are unknown.
Delivering molecular interaction data intended for virtual screening in drug discovery. The specific number of rows, columns, and data features is not provided in the input.
Encompassing information about certain drug types, intended for binary classification tasks. It is tagged for applications related to health, heart conditions, and drugs and medications.
Results from 11 bioassays conducted in 2014 assess the toxicity of five metals (copper, cadmium, lead, zinc, nickel) on two species of Antarctic marine microalgae. The dataset includes 7 tests with Phaeocystis antarctica and 4 tests with Cryothecomonas armigera, measuring growth rates, cell density, and other physiological parameters under controlled conditions. Data was aggregated by the Australian Antarctic Data Centre (AU_AADC) and published on NASA's Earthdata platform.