Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
40,087 datasets
Colombia's agricultural frontier boundary map identifies rural land separating agricultural areas from ecologically important zones and areas where farming is legally excluded. The dataset, version 4.1, was generated using ArcGIS software and a multi-stage process integrating land cover data, family farming areas, and legal exclusions like national parks and páramos. It provides polygon-level data for administrative units across the country.
A research document describes a study on combination immunotherapy for colon cancer using a mouse model. The study, authored by Qianyu Jing, was last updated on June 3, 2026. It details the effects of combining the TLR7/8 agonist 3M-052 with an inhibitory anti-TNFR2 antibody on tumor growth and immune cell populations.
A 5.4 MB collection of supplementary materials for a research article proposing improved regression models for continuous proportional data. The work, authored by Changwoo J. Lee, was last updated on May 18, 2026. It includes PDF, TXT, and DS_STORE files detailing the cobin and micobin models, which address robustness and computational issues in beta regression.
Deep P-Spline (DPS) is a methodology linking neuron selection in deep neural networks to knot placement in basis expansions. The dataset includes a PDF and TXT file authored by Noah Yi-Ting Hung, last updated on 2026-05-18. It describes a framework for efficient network structure tuning with theoretical guarantees.
Data from the Wisconsin Longitudinal Study (WLS) used to study the long-term effects of unwanted pregnancies on mothers' later life outcomes. The dataset, last updated in 2026, employs a two-team cross-screening approach for exploratory, confirmatory, and replication analysis within a single observational study. It focuses on outcomes related to mental health, physical health, economic well-being, and life satisfaction.
Survey data from 287 senior undergraduate English majors across ten universities in Chongqing investigates their acceptance of Generative AI tools for thesis writing. The dataset includes questionnaire responses and semi-structured interview data, collected using the Integrated Model of Technology Acceptance. It was authored by Wulin Ma and last updated on 2026-05-31.
Version 5.0 of the OMI/Aura Nitrogen Dioxide product incorporates high-resolution (~25 km) Global Modeling Initiative simulations and improved algorithms for de-striping and cloud retrieval. Each Level-2 file contains data from the day lit portion of an orbit (~53 minutes), with approximately 14 orbits generated per day. The dataset provides slant column density, total vertical column density, and separated stratospheric and tropospheric vertical column densities for nitrogen dioxide.
Nimbus-7 SAMS data provides temperature profiles across 62 pressure levels from 246 to 0.0012 mbar, offering a detailed vertical view of the upper atmosphere. The dataset covers a 4.5-year period from December 1978 to June 1983, with daily files gridded at 2.5° latitude by 10° longitude resolution from 50°S to 67.5°N. Its two record types include full-resolution retrievals and a subset of temperature and error values at 10 standard pressure levels.
The Samuel y Audrey Bilingual YouTube Transcript Corpus ES/EN provides 643 travel video records with Spanish and English transcript content. Created by Samuel Jeffery, the dataset includes SRT payloads, video metadata, and multiple export formats. It was last updated on 2026-05-29.
A US-focused transportability assessment of the FINEARTS-HF clinical trial evaluating finerenone for heart failure. The dataset includes five supplemental tables and figures detailing effect modifiers, regulatory approvals, and systematic review criteria. Alex J Turner published these materials on figshare in May 2026.
The Great Artesian Basin covers 1.7 million square kilometers, about one-fifth of Australia, extending across parts of Queensland, New South Wales, South Australia, and the Northern Territory. It consists of a multi-layered confined aquifer system with Triassic, Jurassic, and Cretaceous sandstones, and its development since around 1880 supports pastoral industries and town water supplies. The dataset is provided by the Australian Ocean Data Network and was last updated in June 2026.
The Australian Ocean Data Network provides an assessment of regional lithofacies variations based on 365 surface and near-surface seabed samples from the Tasmanian shelf and Bass Strait. The data describes sediment composition, including quartz-rich sands, muddy sediments, and relict bryozoan sands and gravels, with geochemical analyses for elements like tin and phosphorus. It was last updated on June 4, 2026.
The Australian Ocean Data Network provides a gravity field dataset for offshore Australia. It divides the free-air anomaly field into about fifty regional provinces characterized by trend, level, or disturbance, and discusses them in relation to structural and bathymetric features. The dataset uses Bouguer anomalies as a guide to crustal thickness variations and covers features from continental shelves to abyssal plains.
A 2026 figshare document by Georgina Hopkins details a study using a human air-liquid interface bronchial epithelial model (MucilAir™) to investigate protease exposure. The work assesses the impact of liquid dosing and protease concentrations from 0.00125 to 7812 μg/mL on epithelial barrier integrity, cytokine production, and extracellular vesicle dynamics. It provides a foundation for developing in vitro methodologies for inhalation risk assessment.
NESP Marine Biodiversity Hub project A6 focuses on prioritizing research and management needs for Australian elasmobranch species. The project involved a 2016 workshop to assess species by degree-of-concern and knowledge gaps, plus a desk study on management pathways. Planned outputs include a workshop report and a paper on adaptive bycatch monitoring.
12 genera and 13 species of marine invertebrate macrofauna are described from the upper Permian strata of eastern Australia. The fauna includes one newly recognized genus, Pseudonucula, and one new species. This data supports the identification of three biostratigraphic zones and conclusions about regional geological correlations and a depositional hiatus.
A 13-fold enhancement in phosphorescence quantum yield to 40.8% is achieved by encapsulating a single silver ion in a gold nanocluster. This dataset contains XYZ files describing the structure of the high-nuclearity bimetallic nanocluster [Au11Ag1] and its hollow [Au11] precursor, published by Shan-Le Wang on figshare in 2026. The silver encapsulation provides structural reinforcement via 9-fold Au(I)–Ag(I) interactions, enabling efficient photocatalytic aerobic oxidation of sulfides.
9000 km2 of the Bremer Basin underlies the upper continental slope of offshore southwest Australia. The dataset, provided by the Australian Ocean Data Network, describes the basin's structure, likely sedimentary composition, and inferred hydrocarbon potential based on seismic data and geological analogy. It was last updated on 2026-06 04.
Moreton Bay's vector tile layer provides a customized world basemap optimized to display special areas of interest (AOIs). These AOIs include landscaping features like grass, trees, and rock, and sports amenities such as tennis courts and field lines, created and edited by Community Maps contributors. The layer is built using the same data sources as Esri's World Topographic Map and was last updated on 2026-05-27.
Northwestern Australia's Bonaparte Basin provides a fossil record of Early Carboniferous ostracods. The dataset describes at least 29 species across 18 genera, including eight new species, and proposes an eight-assemblage biostratigraphic scheme. It was published by the Australian Ocean Data Network and last updated in June 2026.