Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
39,933 datasets
Benthic chamber measurements from the Australian Ocean Data Network quantify solute exchange rates between sediment and water in Port Phillip Bay. Data from the summers of 1994 and 1995 includes fluxes for oxygen, ammonium, nitrate, nitrite, phosphate, silicate, TCO2, and alkalinity. The dataset identifies four distinct bay regions and calculates that benthic recycling accounted for 63% and 72% of the annualized nitrogen and phosphorus inputs, respectively.
NCAR King Air aircraft collected detailed atmospheric boundary layer data during the FIFE experiments in 1987 and 1989. Measurements include vertical and horizontal wind gusts, humidity, potential temperature, and radiation parameters to study convective boundary layer processes. The dataset is managed by the National Aeronautics and Space Administration.
A dataset containing results from a novel thermodynamic integration scheme for calculating Gibbs free energies of crystalline solids. The data, authored by Karel L. K. De Witte and shared on figshare, compares the new NPT-based method with conventional approaches using case studies on ice polymorphs and CsPbI3. The dataset was last updated on June 4, 2026.
New South Wales regional water strategy areas define 14 regions for long-term water planning and management. The boundaries are based on surface water hydrology, statutory water plans, and socio-economic factors, and were developed in partnership with local councils, Aboriginal communities, and other stakeholders. The dataset is provided by the NSW Department of Climate Change, Energy, the Environment and Water and was last updated in May 2026.
A dataset from figshare details a series of rationally designed heat shock protein 90 (Hsp90) inhibitor-evodiamine conjugates for colon cancer therapy. The data includes specific in vitro antiproliferative activity (IC50 = 7.7 nM) and in vivo tumor growth inhibition results (TGI up to 72.9%) for a lead compound. It was authored by Yuhang Sun and last updated on 2026-05-28.
NASA/NOAA Suomi NPP VIIRS Water Reservoir Product Monthly Level 3 Global Version 2 provides monthly time series data for 164 global water bodies, including 151 man-made reservoirs and 13 regulated natural lakes. The dataset is derived from satellite remote sensing and models to estimate reservoir surface area, elevation, water storage capacity, evaporation rate, and evaporation volume. It synthesizes 8-day area classifications with meteorological data to produce monthly composites.
2,866 Plasmodium falciparum isolates from Tanzania's Kagera region were genotyped between 2021 and 2023 to track antimalarial drug resistance. Alfred Simkin published this dataset on figshare, showing the R561H mutation's regional prevalence rose from 5.5% to 6.9% over three years, with its first appearance in new districts indicating eastward spread. The data highlights notable variation in mutation prevalence and the co-occurrence of high-level resistance markers.
Lanzhou Freight Center in China provided 114 monthly observations of rail freight volume from 2013 to 2022. The dataset contains results from a prediction model that uses Fuzzy Information Granulation and an Improved Particle Swarm Optimization-Support Vector Machine (IPSO-SVM) algorithm. Author Yunbo Gao published the data on figshare under a CC-BY-4.0 license.
A 2026 quasi-experimental study by Sarah A. Laane on figshare, analyzing data from 370 matched adults (185 with and 185 without mental illness) who underwent 6 months of online brain health training. The dataset likely contains pre- and post-intervention measures of psychological distress, resilience, quality of life, engagement in meaningful activities, and a composite cognitive clarity score.
Daily Level 2 Solar-Induced Fluorescence (SIF) estimates from 1995-07-01 to 2003-06-22 provide a direct measure of chlorophyll activity. Data from the ESA's ERS-2 GOME instrument includes raw and bias-adjusted SIF on an orbital basis for land pixels at 40 km x 320 km resolution, along with quality control and ancillary data. The dataset contains both Version 1 and Version 2 files, with Version 2 adding fields like SIF uncertainties and longitude-latitude corners.
Report-RES-081 from the Ministry of Environment, Housing and Territorial Development contains water quality measurements for specific river basins and stretches in Colombia. The dataset includes sampling dates, geographic coordinates, and parameters like dissolved oxygen, BOD, COD, and fecal coliforms. It is published by www.datos.gov.co and was last updated on 2026-05-25.
A methodological dataset and R package for spatially aware likelihood-based inference on multivariate data. The work develops inside-out cross-covariance models, which are scalable and flexible alternatives to the linear model of coregionalization, and demonstrates performance on synthetic data and colorectal cancer proteomics data. The dataset was authored by Michele Peruzzi and last updated on June 4, 2026.
Jianqing Fan's research dataset provides statistical tests for analyzing structural breaks in large factor models. The dataset, last updated in 2026, includes three two-sample tests for evaluating principal eigenvalues, eigenvalue proportions, and eigenvectors. It demonstrates application using daily returns of S&P 500 stocks to analyze events like the 2008 financial crisis and the 2020 pandemic.
34 peer-reviewed articles published between 2001 and 2024 were synthesized in this scoping review. The dataset, created by Aoyi Li and shared on figshare, analyzes the integration of Virtual Reality and Music Therapy. It delineates technical frameworks, clinical outcomes, and neurobiological mechanisms across diverse patient populations.
A heterodimeric imaging agent synthesized by linking Vancomycin and desferrioxamine B for positron emission tomography (PET) detection of Staphylococcus aureus infections. The dataset likely contains results from biochemical validation, biodistribution studies, and in vivo imaging experiments. The data was authored by Collin E. Merrick and last updated on figshare in May 2026.
A 2026 dataset from figshare by Abilash Rosario Arockiyasamy documents a design and fabrication strategy for 3D-printed elastomeric foam lattices. The dataset includes experimental and simulation results for hierarchical grid-stiffened structures, such as a star-diamond interwoven lattice, achieving a 1.7-fold expansion and a density of 215 kg/m³. The described lattices demonstrated a 600% increase in stiffness and a 1100% increase in strength compared to non-foamed conditions.
August 20-28, 2000 ground measurements were collected at a 1 km by 1 km dry grassland site adjacent to Sua Pan, Botswana, during the SAFARI 2000 Dry Season Aircraft Campaign. The dataset contains leaf area index (LAI) and photosynthetically active radiation (PAR) readings from 135 sample points for LAI and 93 transect points for PAR, used to validate the MISR LAI/FPAR algorithm. Measurements were taken with LAI-2000 and Sunfleck PAR ceptometer instruments under predominantly clear sky conditions.
NASA's 2022 SHIFT campaign provides paired field measurements and airborne imagery for a California estuary. The dataset includes soil salinity samples, fractional cover estimates for four classes, and derived vegetation indices like NDVI and CRSI. Data formats include GeoTIFF, CSV, and shapefiles for analysis.
NASA/NOAA Suomi NPP VIIRS FILDA-2 Modified Combustion Efficiency (MCE) Version 2 swath product (VNP47IMG) provides high-resolution fire monitoring data. It contains 83 variables for fire detection, Fire Radiative Power (FRP), Visible Energy Fraction (VEF), and MCE retrieval at 375-meter resolution in 6-minute orbit segments. The algorithm leverages visible band observations at night to assess combustion efficiency and detect smaller, cooler fires.
Eight seismic profiles totaling about 2000 km and bathymetric data collected by the RAT "Rig Seismic" vessel in February 1992 provide coverage of the Christmas Island area. The Australian Ocean Data Network compiled this data to produce a new bathymetric map at a 1:1,000,000 scale, offering more detail than previous maps from the 1970s and 1980s. The dataset likely contains information on seabed morphology, sediment thickness, seamount distribution, and the structure of the Java Trench.