Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
40,305 datasets
Prevalence data for mono-<i>Plasmodium vivax</i>, mono-<i>Plasmodium falciparum</i>, and mixed malaria infections. The dataset is stratified by collection setting (hospital vs. community) and geographic location across three sites in Cameroon: Bamenda, Bertoua, and Buea. It was authored by Cheikh Cambel Dieng and last updated on 2026-06-04.
Bat echolocation recordings were collected in the City of Wanneroo, Western Australia, between January and April 2025. The dataset includes verified reference calls from hand-released bats and acoustic transect survey recordings from bushland reserves. This data forms part of a forthcoming publication on community-based urban bat monitoring and biodiversity management.
A national strategic roadmap for open data in Colombia, led by the Ministry of Information and Communications Technologies. The dataset catalogs prioritized datasets, responsible entities, and publication status to align with global standards and national agendas. It was last updated on 2026-05-18.
A 2024 retrospective study of 536 very preterm infants from the Shenzhen Neonatal Data Network. It evaluates the implementation and synergistic effects of four core evidence-based practices on adverse outcomes like severe complications or death. The dataset was created by Jing Feng and is shared under a CC-BY-4.0 license.
A research framework and evaluation results for determining the optimal direction of user interface items in bidirectional systems. The dataset includes results from evaluating 30 common UI items with BiDi users, using multiple indicators. It was authored by Yulia Goldenberg and last updated on 2026-05-29.
151 studies were synthesized for this narrative review on low energy availability in female athletes. The review, authored by Hadeel Ali Ghazzawi and last updated in May 2026, examines the interrelationships between energy availability, disordered eating, menstrual function, and bone mineral density. It identifies low energy availability as the primary driver of multi-systemic impairments like endocrine disruption and skeletal vulnerability.
John Kruper provides probability maps, endpoints, and region-of-interest (ROI) files for four white matter fasciculi, designed for use with the pyAFQ software. The dataset is 1.5 MB and was last updated on May 25, 2026. It is derived from a combination of streamlines in the O'Donnell Research Group (ORG) atlas and endpoint regions from the Automated Anatomical Labeling (AAL) atlas.
This dataset provides daily and 3-hourly estimates of Photosynthetically Active Radiation (PAR) globally from 2000 onward, derived from combined MODIS Terra and Aqua satellite observations. It is a gridded Level 3 product with a spatial resolution of 0.05 degrees (approximately 5.6 km at the equator). The data is generated using a prototyping algorithm that calculates PAR from surface reflectance via look-up tables accounting for aerosols, clouds, and viewing geometry.
1987 data from the FIFE study area examines the spectral reflectance of manipulated vegetative canopies. The dataset contains reflectance factors and standard errors for four MSS bands, supporting analysis of a curvilinear relationship between vegetation productivity and grazing intensity. Measurements were taken at two locations in the northwest quadrant during the growing season.
A bibliometric and text-mining dataset analyzing literature on trained immunity in inflammatory bone diseases. The core corpus includes 83 records, with an extended corpus of 301 records, sourced from Web of Science Core Collection, Scopus, and PubMed. Author Zihang Zhao published the dataset under a CC-BY-4.0 license in May 2026.
83 records form the core corpus for a bibliometric and text-mining analysis of trained immunity in inflammatory bone disease, with an extended corpus of 301 records. The dataset, created by Zihang Zhao and last updated in May 2026, analyzes publication trends from 2013 to 2025 using Web of Science Core Collection, Scopus, and PubMed. It includes topic modeling and semantic screening to map literature-level patterns connecting myeloid reprogramming and periodontitis.
A bibliometric and literature-level text-mining analysis of research on trained immunity in inflammatory bone disease. The analysis includes a core corpus of 83 records and an extended corpus of 301 records, sourced from Web of Science Core Collection, Scopus, and PubMed. The dataset was created by Zihang Zhao and last updated in May 2026.
A geological report from Geoscience Australia analyzing subsurface permeability barriers affecting groundwater flow in the Murray Basin. The paper describes the stratigraphy and geometry of mid-Tertiary units, including the Ettrick Formation, Winnambool Formation, and Geera Clay, based on borelog analysis and palaeogeographic reconstructions. It includes specific porosity and permeability data from a fully cored section in the Piangil West-2 borehole.
A 5.5 KB Excel file containing primer sequences used in Real-Time PCR experiments. The data was generated by author Jeyalakshmi Kandhavelu to study the effects of Valproic acid and Zebularine on KLF4 and CTNNB1 gene expression in SW480 and DLD-1 colon cancer cell lines. The dataset was last updated on June 1, 2026.
Sentinel-5P TROPOMI Level-1B radiance product for band 8 (SWIR detector) provides calibrated radiance, irradiance, and engineering data from a hyperspectral spectrometer. The instrument covers ultraviolet-visible, near infrared, and shortwave infrared wavelengths (270nm to 2385nm) with a nadir spatial resolution of approximately 5.5km implemented from August 6, 2019. Data is generated by the Koninklijk Nederlands Meteorologisch Instituut (KNMI) processor and stored in netCDF-4 format.
SWOT Level 2 River Single-Pass Vector Node Data Product provides water surface elevation, slope, width, and discharge for river nodes spaced approximately 200 meters apart. The data are derived from the Ka-band Radar Interferometer (KaRIn) on the SWOT satellite, which launched on December 16, 2022, and entered its science phase in August 2023. This Version C product is distributed as ESRI Shapefiles covering the full satellite swath for each continent-pass.
Colombia's departmental-level index of informal rural land, calculated by the UPRA using 2019 data. The index is based on criteria including properties not linked in the cadastre-registry system, those without a property title, and those identified as public land. The dataset includes departmental codes, areas, and an informality score.
Quebec's wildlife species face documented threats with severity scores based on anticipated population decline. The table includes 130 threats affecting 91 species, using a standardized classification system from the Quebec Ministry of Forests, Wildlife and Parks. It was last updated on April 17, 2026.
Fifteen gravity cores and grab samples were collected from Prydz Bay and the Mac. Robertson Shelf during the 1992/93 Antarctic shipping season. The data, managed by the Australian Ocean Data Network, aims to study Quaternary environmental change and sedimentation processes. Sampling targets the Lambert Glacier-Amery Ice Shelf system, the largest ice stream draining the East Antarctic Ice Sheet.
A longitudinal controlled cohort study tracked 36 Holstein-Friesian cows for over 10 years under uniform conditions. The dataset includes lifetime phenotypes such as milk yield, composition, reproductive performance, survival time, and detailed veterinary diagnoses, collected by Morteza Hosseini Ghaffari and published in 2026. It compares cows stratified by genomic functional longevity breeding values while being matched for milk production potential.