DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Chemistry Datasets | DataSalon

All Categories

🧪

Chemistry

Organic/inorganic chemistry, analytical chemistry, electrochemistry, molecular properties, chemical reactions

2,033 datasets

Chemistry

Wavelet-Based Lossless Compression for Computer Screen Images

Dr. P.S. Jagadeesh Kumar's paper discusses a wavelet sub-band coding method for lossless compression of compound images, such as computer screens. The work addresses the challenge of transmitting real-time screen video data, where an 800x600 true color frame is 1.44 MB and 85 frames per second produces over 100 MB of data. Implementation results show excellent visual quality for text in compressed images.

ImageCoding Social SciencesComputer ScienceLossless CompressionComputer VisionWavelet TransformArtificial IntelligenceComputer Graphics ImagesSoftwareLossless CodingComputer HardwareData Compression+1

0 views

Chemistry

ChEMBL Human Kinase QSAR Snapshot 2026: Bioactivity Records for Selectivity Analysis

Curated human kinase pChEMBL records provide a foundation for quantitative structure-activity relationship modeling. The data appears to be a snapshot from the ChEMBL database, a major bioactivity resource for medicinal chemistry. The specific source, size, and update details are not provided in the input.

TabularQsarChemblDrug DiscoveryHuman Kinase+1

0 views

Chemistry

Molecular Property Data for Chemical Compound Analysis

Molecular Property Data is a dataset hosted on Kaggle. Its title suggests it contains information on chemical compounds, likely including properties like solubility or toxicity. The specific content, scale, and origin require verification after download.

TabularChemistryComputational ChemistryMolecular Properties+1

0 views

Chemistry

High-Precision Isotopic Analysis Data for Orogenic Processes

A UK-based research project by the British Geological Survey aims to generate high-precision isotopic analyses of igneous and metamorphic phases. The data is intended to resolve timescales for metamorphic and magmatic processes using mass spectrometry techniques. Row count, column details, and specific data fields are currently unknown.

TectonicsNerc Ddc+1

0 views

Chemistry

Hourly Water and Air Temperature Data from 16 Mesocosms in 2023

From April 21 to November 7, 2023, this dataset contains hourly water and air temperature measurements from an experimental mesocosm facility. Sixteen 1-meter deep mesocosms were filled with water from Windermere, with water temperature recorded every five minutes and aggregated into hourly averages to study the effects of different N:P nutrient ratios.

Water TemperatureMesocosm ExperimentAquacosm PlusEnvironmental Monitoring Facilities+1

0 views

Chemistry

Countercurrent Chromatography Review: Applications in Natural Products and Pharmaceuticals

Alain Berthod from Université Claude Bernard Lyon 1 authored a review paper on countercurrent chromatography (CCC). The paper focuses on the theory, column designs, and applications of CCC in analytical chemistry. It covers studies from the last two decades, including applications in plant analysis, pharmaceutical separation, and food analysis.

TextLiquid ChromatographyCountercurrent ChromatographyEngineeringThermodynamicsCountercurrent ExchangeOrganic ChemistryAnalytical ChemistryHigh Performance Liquid ChromatographyLiquid PhaseHealthcareMechanical EngineeringChemical AnalysisChemistryPhase MatterSupercritical Fluid ChromatographyChromatographyStationary PhaseCentrifugal Force+1

0 views

Chemistry

Antibiogram and Biochemical Reactions of Biofield-Treated Staphylococcus aureus

92.86% of treated Staphylococcus aureus cells showed altered antimicrobial susceptibility, and 35.71% showed altered biochemical reactions compared to control. The dataset likely contains results from a study by Mahendra Kumar Trivedi investigating the effect of biofield treatment on S. aureus, including antimicrobial susceptibility, minimum inhibitory concentration, biochemical reactions, and genotyping data. The study analyzed samples at multiple time points using an automated MicroScan Walk-Away system.

TabularAntibioticsBacteriaMinimum Inhibitory ConcentrationGenotypingGeneticsBiologyMicrobiologyAntimicrobialGenotypeChemistryStaphylococcus AureusLarge ScaleBiochemistryGeneAntimicrobial ResistanceAntibiogramAntibiotic Resistance+1

0 views

Chemistry

GLORICH: Global River Chemistry Database with 1.27 Million Samples

GLORICH contains over 1.27 million hydrochemical samples from more than 18,000 locations worldwide. It combines water chemistry data with catchment characteristics like lithology, climate, and land cover. The database was created by Jens Hartmann and described in a 2014 overview article.

TabularGeospatialEnvironmental scienceComputer ScienceDatabaseCatchment CharacteristicsChemistryRiver ChemistryLarge ScaleHydrochemistry+1

0 views

Chemistry

Openr1 Math 220K: Mathematical Text Corpus for Language Models

Openr1 Math 220K is a text dataset published on Hugging Face by Neelectric. The title and platform tags suggest it contains mathematical content, likely intended for language model training or evaluation. The dataset was last updated on April 1, 2026.

TextOPTIMIZED-PARQUETParquetLibrarypolarsLibrarydaskModalitytextSize Categories100 Kn1 MMathematicsLibrarymlcroissantLibrarydatasetsLanguage ModelRegionusText Corpus+1

0 views

Chemistry

Zeolite Synthesis Data from Low-Temperature Hydrous Melts

Diffraction, NMR, and DSC data supporting the finding that zeolite can form in hydrous melts. The dataset was authored by Eric Breynaert and is hosted by Harvard Dataverse.

Chemistry+1

0 views

Chemistry

USGS Spectral Library Version 7 with Ultraviolet to Far Infrared Measurements

USGS Spectral Library Version 7 contains spectra measured with laboratory, field, and airborne spectrometers covering wavelengths from 0.2 to 200 microns. It includes samples of specific minerals, plants, chemical compounds, man-made materials, and physically-constructed mixtures. The data release is accompanied by a detailed publication describing the instruments, metadata, and possible artifacts in the spectral measurements.

ImagerybasemapsearthcoverImaging SpectroscopyHyperspectral ImagingCentral Minerals And Environmental Resources ScienAsiaGeoscientificinformationMRPMineralogyInfrared imagingAustraliaCmerscEuropeCggscChemical AnalysisAfricaMineral Resources ProgramBiotaEnvironmentCrustal Geophysics And Geochemistry Science CenterAVIRIS+1

0 views

Chemistry

USearch Molecules: 7 Billion Small Molecules with Pre-Built Search Indexes

7 billion small molecules are represented in SMILES notation, accompanied by 28 billion molecular fingerprints including MACCS, ECFP4, FCFP4, and PubChem types. The collection includes pre-constructed USearch indexes for efficient similarity search. It is hosted on AWS Open Data and published by Ash Vardanian under an Apache-2.0 license.

TabularChemical BiologyBiologyFingerprintsLife SciencesDrug DiscoveryMolecular StructuresCheminformaticsLarge ScalePharmaceutical+1

0 views

Chemistry

Bioanalytical Chemistry: Trends in the Postgenomic Era

A discussion paper highlighting trends in chemistry following the genomic revolution. The author, S.D. Varfolomeyev of Lomonosov Moscow State University, covers areas including combinatorial organic chemistry, protein design with unnatural amino acids, and novel nanoanalytical systems. The paper is sourced from the paperswithcode platform.

TextNanotechnologyBiologyBioanalytical ChemistryPostgenomic ChemistryHealthcareComputational BiologyChemistryCharacterization Materials ScienceBiochemistryBioanalysisChromatographyDrug DesignMaterials Science+1

0 views

Chemistry

Stock Index and Interest Rate Conversion Calculations

An Excel spreadsheet authored by Kevin Fergusson for converting raw financial data into stock index returns and continuously compounded interest rates. The dataset is part of the replication files for the research paper 'Stylized Properties of the Stock Index and the Interest Rate Term Structure under the Benchmark Approach'. It was last updated on March 25, 2026.

TabularMathematical SciencesFinanceStock IndexBusiness and ManagementFinancial CalculationsInterest Rates+1

0 views

Chemistry

ChEMBL 36: Bioactivity Data for Drug Discovery

ChEMBL 36 is a dataset from Kaggle, likely containing bioactivity data for chemical compounds. The specific number of records, columns, and update date are unknown. It is inferred to be a snapshot of the ChEMBL database, a manually curated resource of bioactive molecules.

TabularBioactivityMoleculesDrug DiscoveryChemistry+1

0 views

Chemistry

Heterocyclic Compounds Glossary of Organic Chemistry Classes

A glossary of terms used to denote classes of compounds, substituent groups, and reactive intermediates, compiled by G.P. Moss. The overwhelming majority of terms refer to organic compounds, with a few inorganic classes included for convenience. The principal criterion for inclusion is that the class be definable by structure.

TextClass PhilosophyContrast VisionChemical ClassesComputer ScienceMathematicsMineralogyOrganic CompoundsStereochemistryInclusion MineralArtificial IntelligenceChemistryGlossaryPhilosophyTerminologyPrincipal Computer SecurityLinguisticsSubstituent+1

0 views

Chemistry

Homogeneous Catalysis: Scientific Knowledge from AccessScience

AccessScience is an authoritative online resource containing educational material covering major scientific disciplines, including homogeneous catalysis. The dataset likely contains curated scientific explanations and reference information. It was authored by Denis Forster and aggregated via the paperswithcode platform.

TextTrustworthinessInternet PrivacyEpistemologyComputer ScienceKnowledge ManagementData ScienceScientific KnowledgeWorld Wide WebGateway Web PageHomogeneous CatalysisChemistryResource DisambiguationPhysicsPhilosophyCore Optical FiberCore KnowledgeSociology Of Scientific KnowledgeHomogeneousQuality Philosophy+1

0 views

Chemistry

Selectivity in Analytical Chemistry: A Terminological Discussion

Jörgen Vessman of AstraZeneca discusses the correct use of the term 'selectivity' and its distinction from 'specificity' in analytical chemistry. The work provides a definition for selectivity and recommends promoting its use while discouraging the use of 'specificity'. The dataset appears to be a textual discussion or paper sourced from the paperswithcode platform.

TextSelectivityOrganic ChemistryScientific DiscourseAnalytical ChemistryChemistryPhysicsTerminologyTerm Time+1

0 views

Chemistry

perch_v2_no_dft_v3: An ONNX Model File

perch_v2_no_dft_v3.onnx is an ONNX model file hosted on Kaggle. The dataset's content, authorship, and creation date are unknown. Its specific application and internal structure require inspection after download.

MultimodalMachine LearningOnnxModel Weights+1

0 views

Chemistry

Carnegie Mellon Pronouncing Dictionary: 111,308 Lowercased Word Entries

111,308 entries from the Carnegie Mellon Pronouncing Dictionary, originally a resource for phonetic information. The list has been processed to convert entries to lower case and remove duplicate pronunciations. The original source file includes a text header explaining the list's origin and purpose.

TextProgramming LanguageComputer ScienceDictionaryPhilosophyNatural Language ProcessingLinguisticsPronunciationText Corpus+1

0 views

PreviousPage 92 of 102Next