Loading...
Loading...
Organic/inorganic chemistry, analytical chemistry, electrochemistry, molecular properties, chemical reactions
2,033 datasets
Dr. P.S. Jagadeesh Kumar's paper discusses a wavelet sub-band coding method for lossless compression of compound images, such as computer screens. The work addresses the challenge of transmitting real-time screen video data, where an 800x600 true color frame is 1.44 MB and 85 frames per second produces over 100 MB of data. Implementation results show excellent visual quality for text in compressed images.
Curated human kinase pChEMBL records provide a foundation for quantitative structure-activity relationship modeling. The data appears to be a snapshot from the ChEMBL database, a major bioactivity resource for medicinal chemistry. The specific source, size, and update details are not provided in the input.
Molecular Property Data is a dataset hosted on Kaggle. Its title suggests it contains information on chemical compounds, likely including properties like solubility or toxicity. The specific content, scale, and origin require verification after download.
A UK-based research project by the British Geological Survey aims to generate high-precision isotopic analyses of igneous and metamorphic phases. The data is intended to resolve timescales for metamorphic and magmatic processes using mass spectrometry techniques. Row count, column details, and specific data fields are currently unknown.
From April 21 to November 7, 2023, this dataset contains hourly water and air temperature measurements from an experimental mesocosm facility. Sixteen 1-meter deep mesocosms were filled with water from Windermere, with water temperature recorded every five minutes and aggregated into hourly averages to study the effects of different N:P nutrient ratios.
Alain Berthod from Université Claude Bernard Lyon 1 authored a review paper on countercurrent chromatography (CCC). The paper focuses on the theory, column designs, and applications of CCC in analytical chemistry. It covers studies from the last two decades, including applications in plant analysis, pharmaceutical separation, and food analysis.
92.86% of treated Staphylococcus aureus cells showed altered antimicrobial susceptibility, and 35.71% showed altered biochemical reactions compared to control. The dataset likely contains results from a study by Mahendra Kumar Trivedi investigating the effect of biofield treatment on S. aureus, including antimicrobial susceptibility, minimum inhibitory concentration, biochemical reactions, and genotyping data. The study analyzed samples at multiple time points using an automated MicroScan Walk-Away system.
GLORICH contains over 1.27 million hydrochemical samples from more than 18,000 locations worldwide. It combines water chemistry data with catchment characteristics like lithology, climate, and land cover. The database was created by Jens Hartmann and described in a 2014 overview article.
Openr1 Math 220K is a text dataset published on Hugging Face by Neelectric. The title and platform tags suggest it contains mathematical content, likely intended for language model training or evaluation. The dataset was last updated on April 1, 2026.
Diffraction, NMR, and DSC data supporting the finding that zeolite can form in hydrous melts. The dataset was authored by Eric Breynaert and is hosted by Harvard Dataverse.
USGS Spectral Library Version 7 contains spectra measured with laboratory, field, and airborne spectrometers covering wavelengths from 0.2 to 200 microns. It includes samples of specific minerals, plants, chemical compounds, man-made materials, and physically-constructed mixtures. The data release is accompanied by a detailed publication describing the instruments, metadata, and possible artifacts in the spectral measurements.
7 billion small molecules are represented in SMILES notation, accompanied by 28 billion molecular fingerprints including MACCS, ECFP4, FCFP4, and PubChem types. The collection includes pre-constructed USearch indexes for efficient similarity search. It is hosted on AWS Open Data and published by Ash Vardanian under an Apache-2.0 license.
A discussion paper highlighting trends in chemistry following the genomic revolution. The author, S.D. Varfolomeyev of Lomonosov Moscow State University, covers areas including combinatorial organic chemistry, protein design with unnatural amino acids, and novel nanoanalytical systems. The paper is sourced from the paperswithcode platform.
An Excel spreadsheet authored by Kevin Fergusson for converting raw financial data into stock index returns and continuously compounded interest rates. The dataset is part of the replication files for the research paper 'Stylized Properties of the Stock Index and the Interest Rate Term Structure under the Benchmark Approach'. It was last updated on March 25, 2026.
ChEMBL 36 is a dataset from Kaggle, likely containing bioactivity data for chemical compounds. The specific number of records, columns, and update date are unknown. It is inferred to be a snapshot of the ChEMBL database, a manually curated resource of bioactive molecules.
A glossary of terms used to denote classes of compounds, substituent groups, and reactive intermediates, compiled by G.P. Moss. The overwhelming majority of terms refer to organic compounds, with a few inorganic classes included for convenience. The principal criterion for inclusion is that the class be definable by structure.
AccessScience is an authoritative online resource containing educational material covering major scientific disciplines, including homogeneous catalysis. The dataset likely contains curated scientific explanations and reference information. It was authored by Denis Forster and aggregated via the paperswithcode platform.
Jörgen Vessman of AstraZeneca discusses the correct use of the term 'selectivity' and its distinction from 'specificity' in analytical chemistry. The work provides a definition for selectivity and recommends promoting its use while discouraging the use of 'specificity'. The dataset appears to be a textual discussion or paper sourced from the paperswithcode platform.
perch_v2_no_dft_v3.onnx is an ONNX model file hosted on Kaggle. The dataset's content, authorship, and creation date are unknown. Its specific application and internal structure require inspection after download.
111,308 entries from the Carnegie Mellon Pronouncing Dictionary, originally a resource for phonetic information. The list has been processed to convert entries to lower case and remove duplicate pronunciations. The original source file includes a text header explaining the list's origin and purpose.