Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,407 datasets
501 participants from five communities in Shanghai's Huangpu District were recruited for a community-based intervention trial from August 2024 to March 2025. The dataset likely contains results from a multimodal intervention combining educational materials and AI-assisted voice calls to promote child restraint system use. The study was authored by Ning Gao and shared under a CC-BY-4.0 license.
Arctic Water Vapor Characteristics from Rawinsondes, Version 1 is a gridded monthly-mean database assembled by NASA from fixed stations and Russian drifting ice stations. It contains variables like temperature, specific humidity, wind speed, and vapor flux across 15 pressure levels and five atmospheric layers. Data coverage spans from 1954 to 1991, structured on an octagonal grid centered over the North Pole.
3-hour and daily aggregated satellite data provides near-real-time aerosol optical depth measurements from NASA's Dark Target and Deep Blue algorithms. The product combines observations from VIIRS instruments on NOAA-20 and SNPP satellites with MODIS instruments on Aqua and Terra platforms. It is designed for evaluation and data assimilation applications, with data filtered using pre-defined quality assurance thresholds.
Australia's sub-continental lithosphere is modeled in 3D electrical resistivity. The model was created by inverting magnetotelluric responses from the AusLAMP and AWAGS arrays using a novel distributed 3-D electromagnetic inverse solver. Presented at the 2025 Australasian Exploration Geoscience Conference, it unlocks insights into crust-mantle controls on mineral resources.
NASA's AERDA_D3_VIIRS_MODIS_NRT product provides 24-hour aggregated aerosol optical depth data from four satellite instruments: VIIRS on NOAA-20 and SNPP, and MODIS on Aqua and Terra. It combines Dark Target and Deep Blue retrieval algorithms with consistent Level 3 aggregation and pre-defined QA filtering for near-real-time applications. The dataset is produced daily after all upstream Level 2 data becomes available.
Government and Municipalities of Québec provides detailed taxation and pricing rates for the City of Montreal. The dataset shows rates by building types, including residential, non-residential, and wasteland, applied to annual adjusted property values. The data was last updated on 2026-04-17.
Searchable metadata for papers from top AI venues including NeurIPS, ICML, ICLR, CVPR, ICCV, WACV, ACL, EMNLP, and NAACL. The dataset is hosted by GenAI4ELab and was last updated on June 14, 2026. It includes a full index and per-venue browse views.
A 25.7 KB Excel file from figshare, last updated on 2026-05-23. The dataset relates to research on secondary brain injury following acute ischemic stroke, specifically focusing on the inflammatory response in ischemia-reperfusion injury. It was authored by Bingjie Jiang and is shared under a CC-BY-4.0 license.
Records from January 1, 2009, list building permits issued by the City of Edmonton's Urban Planning & Economy Department for construction and maintenance. The dataset includes permit details, location coordinates, and occupancy dates for residential and non-residential projects, published by data.edmonton.ca. Applicant information is withheld for privacy reasons.
1,281,633 rows of metadata and URLs for images from the Vogue Runway dataset. The dataset includes fields such as image dimensions, designer, season, year, category, file size, aesthetic score, and JSON-encoded tags. It was created by ROSCOSMOS and last updated on Hugging Face in June 2026.
20.7 KB of text files contain data used for the analysis in the paper "Accumulation of CO2 limits energy gain in freely diving grey seals." The dataset includes files for fish energy content, metabolic rates, triglyceride concentrations, and lactate levels from experimental trials. It was authored by Eva-Maria Bonnelycke and last updated in April 2026.
Veedurías Ciudadanas Personería Envigado tracks citizen-led oversight groups monitoring public administration and private entities handling public resources. The dataset includes columns for registration year, municipality, number of members, and the specific object of oversight. It is published on the Colombian open data portal, datos.gov.co, and was last updated in May 2026.
The central New South Wales continental shelf is the focus of this dataset, which relates its morphology to offshore heavy-mineral prospects. It is a legacy product from the Australian Ocean Data Network with no abstract available. The dataset was last updated on 2026-06-23.
Geospatial data describing the geomorphology and sedimentology of the continental shelf adjacent to Mac Robertson Land in East Antarctica. The dataset, provided by the Australian Ocean Data Network, characterizes a 'scalped shelf' deeply eroded by glaciers and currents during the Quaternary period, exposing underlying basement rock. The record was last updated on 2026-06-16.
Legacy product describing heavy-mineral sand deposits along the Western Australian coast. The dataset is published by the Australian Ocean Data Network on data_gov_au. It was last updated on 2026-06-23.
The Albany Canyon complex extends 700 km from Cape Leeuwin to east of Esperance, with canyons cutting down up to 2000 meters. Geoscience Australia Data compiled this information on canyon structure and geological history, last updated in May 2026. The data likely contains details on canyon dimensions, thalweg slopes, and the exposed Jurassic and younger rock sequences.
SemantaAI's dataset suite demonstrates a World Intelligence Operating System capable of generating large, structured, and scenario-covered synthetic worlds for many industries. The dataset is a market-facing proof of concept, intended to show capabilities beyond a single narrow demo. It was last updated on June 22, 2026.
United Nations Human Settlements Programme data tracks the proportion of urban populations with access to services like improved water, sanitation, clean energy, internet, and durable housing. The dataset is provided in XLSX format and was last updated on May 29, 2026. It originates from the UN-Habitat Data and Analytics Section.
Ophiomicros bathursti, a new genus and species of ophiuroid (brittle star), is described from Cenomanian (Upper Cretaceous) strata on Bathurst Island, Northern Territory. The description highlights morphological distinctions, such as unusually large oral plates and small adoral plates, which differentiate it from allied genera like Ophiura and Amphiura. This dataset comprises the formal taxonomic publication detailing the fossil's discovery and classification.
9.5 KB of simulation analysis data supporting a novel data compression method for bridge monitoring. The dataset, authored by Ming Chen and shared on figshare, demonstrates a domain knowledge-based compression method achieving a 75% compression ratio, with a synergistic processing method exceeding 92% compression and 95% data fidelity. The data was last updated on April 15, 2026.