Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
40,989 datasets
Prather DilipKumar's dataset on figshare contains computational data exploring the energetics and structural modulation of dipeptides through interactions with the ionic liquid [EMIM][TFSI]. The dataset, last updated in May 2026, includes results for a set of neutral and ionic dipeptides and individual amino acids. It provides molecular-level insights into interaction profiles, including electrostatic and dispersion energy contributions.
Geoscience Australia Data created a global benthic seascape classification to support high seas marine protected area (MPA) network design. The classification uses a multivariate statistical method with six biophysical variables to define 11 seascape categories across 53,713 separate polygons. The study demonstrates how GIS analysis of these seascape and geomorphic heterogeneity maps can objectively identify MPA candidate sites.
A survey of 399 respondents characterizes municipal solid waste composition and resident perceptions in Onitsha, Nigeria. The dataset likely contains results from Principal Component Analysis on waste impacts and Analytical Hierarchy Process evaluations of five waste-to-energy technologies. Author Emmanuel Nnaemeka Obasi published the supporting file under a CC-BY-4.0 license in May 2026.
A land suitability map for commercial lettuce cultivation in the Nariño Department of Colombia, produced in December 2018. The dataset results from a land evaluation methodology applied at a 1:100,000 scale, incorporating biophysical, socioeconomic, and socio-ecosystem components. It was developed under contract 222 of 2017 between UPRA and the University of Nariño.
Hyperspectral and multi-spectral polarization measurements of above-water radiance were collected from a helicopter platform. The dataset covers 10 stations in Chesapeake Bay during August 2022, with observations taken at four distinct altitudes ranging from 60 to 750 meters. It was produced by NASA as part of a VIIRS satellite validation experiment, with complementary in-situ measurements from a boat.
Geoscience Australia completed the first phase of a national coastal geomorphology map. The dataset is a geodatabase containing state-wide feature datasets reclassified into a national scheme, utilizing pre-existing GIS data from government agencies. The data was last updated on 2026-06-05.
DORIS RINEX data provides precise orbit determination and ground beacon positioning using a dual-frequency Doppler system. The system, developed by CNES with French agency cooperation, has flown on missions including TOPEX/Poseidon, SPOT series, Envisat, and Jason satellites. Data is collected via an uplink system where ground beacons transmit to satellite receivers, with observations centralized for distribution.
NASA's Earth Radiation Budget dataset provides nearly 30 years of calibrated observations from the ERBE and CERES projects, starting in 1984. The data includes top-of-atmosphere and surface irradiance measurements, processed using established CERES algorithms to ensure consistency. These products are available from the Atmospheric Science Data Center.
NASA's Earth Radiation Budget Satellite and CERES missions provide a calibrated, long-term record of Earth's radiation budget at the top of the atmosphere and surface. The dataset spans nearly 30 years, from 1984 onward, and is produced by the National Aeronautics and Space Administration. It applies CERES algorithms to create consistent Level 3 products for analyzing climate variability.
June 15 to July 31, 2023, this dataset contains navigation and meteorological parameters recorded by the NASA ER-2 high-altitude research aircraft during the ALOFT field campaign. The campaign aimed to study terrestrial gamma-ray flashes, gamma-ray glows, and lightning in tropical convection. Data is provided by the National Aeronautics and Space Administration in ASCII format.
GOES-13 satellite images covering the Midlatitude Continental Convective Clouds Experiment (MC3E) campaign area. The dataset contains visible and infrared images in PNG format, captured at 15-minute intervals from May 6 to June 30, 2011. It was produced and archived in near real-time by NASA's Global Hydrology Research Center for use with the Real Time Mission Monitor.
Shinae Kim mapped potential energy surfaces for ammonia decomposition and diffusion on Pt(111) using an automated Pynta tool. The dataset includes identified adsorption sites, reaction pathways, and a constructed microkinetic model based on lowest-energy configurations. The work was last updated on 2026-05-28.
5.3 MB of computational chemistry data mapping potential energy surfaces for ammonia decomposition and diffusion on Pt(111). The dataset, created by Shinae Kim using the automated Pynta tool, includes identified adsorption sites, reaction pathways, diffusion rate coefficients, and a constructed microkinetic model. It was last updated on May 28, 2026.
Brazilian adolescents aged 11–17 are the focus of this psychometric dataset. It contains validation data for the 17-item SCOAC scale, developed from a sample of 642 participants. The dataset was created by Rosana Fanucci Silva Ramos and last updated in May 2026.
DORIS data provides long-term cumulative position and velocity solutions for a global network of ground beacons, derived from satellite-based dual-frequency Doppler measurements. The dataset is produced by the International DORIS Service (IDS) Analysis Centers and coordinated by NASA's Crustal Dynamics Data Information System (CDDIS). It is aligned to the current International Terrestrial Reference Frame (ITRF), with residuals highlighting non-linear station motions.
Luu Thi Viet Ha published data on defect-engineered Ce-doped ZnO/Fe₂O₃ heterojunctions for visible-light photocatalysis on figshare in 2026. The dataset includes performance metrics for the degradation of the antibiotic levofloxacin, with an optimized sample achieving 94.72% removal within 150 minutes. The data likely contains results from structural analyses and photocatalytic activity tests across multiple catalyst mass ratios.
Yashaswee Narayan's dataset contains 500 synthetically generated customer support tickets. The records simulate realistic interactions and include uniquely identifiable canary tokens. It was last updated on June 3, 2026.
Canada-wide data on gastroschisis prevalence among infants conceived between April 2006 and March 2020. The dataset likely contains results from a population-based cohort study examining associations with conception month and geographic latitude. The research was authored by Shiliang Liu and published on figshare.
Shiliang Liu's population-based cohort study analyzes the prevalence of gastroschisis among infants conceived in Canada between April 2006 and March 2020. The dataset likely contains tabular data on gastroschisis rates, conception months, geographic latitudes, and rural/urban residence. Findings indicate a protective effect of periconceptional sunlight exposure, with winter conception and northern latitudes associated with higher risk.
Australia's offshore mineral occurrences and deposits within its 200-nautical-mile exclusive economic zone and extended continental shelf. The map draws together data from published and unpublished marine research surveys and government records, showing minerals including manganese nodules, heavy mineral sand, phosphorites, diamonds, tin, copper, gold, and coal. It was produced by Geoscience Australia in collaboration with CSIRO and state and territory geological surveys.