Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
39,941 datasets
A 17.1 KB literature review document authored by Bhagavathi Sundaram Sivamaruthi and last updated on June 2, 2026. It summarizes recent preclinical and clinical data on transporter-modulating metabolites from specific plants and essential oils, integrating insights from structural biology and molecular pharmacology. The document is available under a CC-BY-4.0 license on figshare.
Half-hourly updated air temperature maps for Tasmania, produced with a typical 30-minute lag from observation time. The maps are generated from 43 Bureau of Meteorology weather stations and bias-corrected using 267 independent temperature loggers from the Tasmanian Government. The mapping process is automated in R and hosted on a cloud-based high-performance computing platform.
Supplementary file 3_Companion animal owner โtypesโ identified using a large-scale international assessment of the human-animal bond is a dataset from the International Survey of Pet Owners and Veterinarians, encompassing 19,187 dog and cat owners across 10 countries. Danny Maupin authored this data-driven study, which used model-based clustering to identify strata of pet owners based on their bond with their animal. The dataset was last updated on May 12, 2026.
19,187 dog and cat owners across 10 countries were surveyed as part of the International Survey of Pet Owners and Veterinarians. The data was used in a model-based clustering analysis to identify distinct strata of pet owners based on their bond with their animal. The study was authored by Danny Maupin and the dataset was last updated on 2026-05-12.
19,187 dog and cat owners across 10 countries participated in the International Survey of Pet Owners and Veterinarians. A model-based clustering analysis identified two distinct strata among dog owners and three among cat owners, based on their reported bond with their pet, spending habits, and health impacts. The dataset was authored by Danny Maupin and last updated on May 12, 2026.
19,187 dog and cat owners across 10 countries were surveyed to identify strata of pet owners based on their bond with their animal. The data-driven clustering approach identified two distinct owner groups for dogs and three for cats, characterized by differences in emotional connection, spending, and health impact. This dataset, authored by Danny Maupin and shared under a CC-BY-4.0 license, was last updated on May 12, 2026.
An international dataset of 19,187 dog and cat owners across 10 countries, collected via the International Survey of Pet Owners and Veterinarians. The data was used in a model-based clustering study to identify owner strata based on the human-animal bond. The dataset was authored by Danny Maupin and last updated on 2026-05-12.
19,187 dog and cat owners across 10 countries participated in the International Survey of Pet Owners and Veterinarians. A model-based clustering approach identified two distinct owner types for dogs and three for cats, revealing subgroups with varying emotional bonds and spending behaviors. The data-driven analysis, authored by Danny Maupin and last updated in May 2026, highlights heterogeneity in human-animal relationships.
MI3DLSNF_002 is a daily global summary of land surface and vegetation parameters derived from the Multi-angle Imaging SpectroRadiometer (MISR) instrument. The dataset provides a daily statistical summary of directional hemispherical reflectance (DHR), fractional absorbed photosynthetically active radiation (FPAR), DHR-based normalized difference vegetation index (NDVI), and land surface bidirectional reflectance factor (BRF) model parameters, classified into six vegetated and one non-vegetated type. Data is reported on a geographic grid with a resolution of 0.5 degrees by 0.5 degrees.
Seven discrete spectral bands from 0.45 to 2.20 microns provide calibrated at-aperture radiances in geophysical units of W/(m^2 um sr). The data is aggregated to a 500m spatial resolution, with a 2330 km swath width from a 705 km orbit, enabling global coverage every one to two days. Each 5-minute granule contains a scene built from 203 scans sampled 2708 times cross-track, with 288 such granules produced daily.
NASA's MOD15A2H Version 6 dataset provides global 8-day composite measurements of Leaf Area Index (LAI) and Fraction of Photosynthetically Active Radiation (FPAR) at a 500-meter resolution from the Terra satellite's MODIS sensor. The Level 4 product includes primary science datasets for LAI and FPAR, along with quality layers and standard deviation fields. This version was decommissioned in July 2023, with users directed to the updated Version 6.1 product.
NOAA-20 satellite data provides a fused Level-2 product combining Visible Infrared Imaging Radiometer Suite (VIIRS) and Cross-track Infrared Sounder (CrIS) observations. It constructs infrared absorption band radiances at a 750-meter spatial resolution, stored in 6-minute NetCDF4 granules, to ensure continuity with legacy MODIS and HIRS products for cloud and moisture analysis. This Version-2.0 collection includes improvements in data quality screening, collocation algorithms, and artifact correction for water vapor channels.
Geoscience Australia's Onshore Energy Security Program provides $58.9 million over five years for acquiring pre-competitive geoscience data to attract energy exploration investment. The program includes national and regional-scale projects focusing on geothermal, petroleum, uranium, and thorium energy sources. Data acquisition involves seismic, gravity, geochemistry, heat flow, radiometric, magneto-telluric, and airborne electromagnetic methods.
Edlira Gugu's supplementary materials for a COMPTEXT 2026 paper include six appendices supporting a computational text analysis of 15,154 empirical research articles. The corpus spans Linguistics, Social Sciences, and Computer Science, partitioned into pre-LLM (2020-2022) and post-LLM (2023-2024) periods. Appendices contain corpus metadata, preprocessing code, extended statistical results, topic model documentation, a rhetorical template catalogue, and analysis code.
Global satellite-derived measurements of Earth's top-of-atmosphere radiative energy budget and cloud properties. The dataset provides monthly and climatological averages from March 2000 onward, combining data from NASA's Terra, Aqua, and NOAA-20 satellite platforms. Produced by NASA's CERES project, it represents a best-estimate, regionally complete product constrained by ocean heat storage.
Northern Ireland's Drinking Water Inspectorate maintains a geospatial register of private water supplies under the Private Water Supplies Regulations (Northern Ireland) 2017. The dataset consists of 100m by 100m square polygons randomly placed around registered supply locations, covering supplies to public or commercial premises or multiple dwellings. It includes both currently and historically monitored supplies.
VNP03MODLL Version 1 is a decommissioned NASA/NOAA geolocation product from the Suomi NPP satellite's VIIRS sensor. It provided terrain-corrected latitude, longitude, and height layers at 750-meter resolution for 6-minute swaths, each covering approximately 3,060 by 3,060 kilometers. The data was used to provide accurate spatial location for other VIIRS data products, such as the VNP14 fire detection swath.
Near Real Time (NRT) data from the VIIRS sensor aboard the JPSS1 satellite, containing on-board calibrator observations for radiometric calibration. The dataset includes space view, solar diffuser, and blackbody view observations, along with associated gain state, HAM side information, and engineering data. It supports the transformation of VIIRS digital counts to radiance and reflectance for Level-1 and Level-2 swath products.
FIFE Cloud Camera Data was collected using a whole-sky 'fish-eye' lens to capture full horizon-to-horizon images of the sky dome. The dataset was designed to document cloud distribution, evaluate algorithms for identifying thin cirrus and popcorn cumulus clouds, and assess their impact on retrieving surface fluxes from satellite data. Analysis revealed considerable temporal variability, indicating that standard synoptic cloud observations were not adequate for the study.
Nine flux towers across the Brazilian Amazon collected carbon, energy, and meteorological measurements from 1999 through 2006. This second version of the compilation features harmonized data across projects with additional quality control and aggregation to hourly, daily, 16-day, and monthly timesteps. The dataset was produced by independent investigators for NASA's LBA-ECO project to serve as a common reference for integrative studies and data-model synthesis.