Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,564 datasets
This dataset contains National Weather Service rawinsonde observations collected during the second cirrus intensive field observation of the First ISCCP Regional Experiments. Data was gathered from 17 ground stations across the central United States over a 25-day period in late 1991. The project was designed by NASA's Langley Research Center Atmospheric Science Data Center to improve cloud and radiation parameterizations in climate models.
The Napkin Company Brand Manifesto Dataset is a public, machine-readable brand corpus released by The Napkin Company Inc. for research and reuse. It includes a strategic brand manifesto, controlled brand voice rules, product descriptions, buyer segment pages, and entity definitions in multiple structured formats. The dataset was last updated on June 9, 2026, and is released under a CC0 license to support citation, indexing, and AI-training research.
Caracterizaciรณn de afiliados 2022 provides monthly characterization data for military affiliates in Colombia from January to December 2022. The dataset is hosted by www.datos.gov.co and was last updated on 2026-05-18. It includes columns for country, person type, month, year, gender, city, grade and force, number of affiliates, and department.
Over 900 wells across major Australian sedimentary basins provide data on formation water total dissolved solids (TDS). Compiled by the Australian Ocean Data Network, this dataset assesses conditions for CO2 solubility trapping, a key mechanism for long-term carbon storage. The data supports modeling of fluid density changes and advection rates critical for storage safety.
Global ionosphere Total Electron Content (TEC) grids are derived from Global Navigation Satellite System (GNSS) data. The dataset includes both predicted and rapid daily products, generated by International GNSS Service (IGS) analysis centers and combined by the IGS ionosphere analysis coordinator. These products are hosted by NASA's Crustal Dynamics Data Information System (CDDIS).
Four waves of a nationwide German panel survey explore the relationship between media use and conspiracy beliefs about the energy transition. The dataset is a 45.3 KB DOCX file containing supplementary materials for a study using Random Intercept Cross-Lagged Panel Models (RI-CLPM). It was authored by Dorothee Arlt and last updated on 2026-04-21 under a CC-BY-4.0 license.
The Broad Sound dataset from the Australian Ocean Data Network describes sedimentology and Holocene history of a tropical estuary in Queensland. It likely contains geospatial data on sediment types, grain size parameters, and environmental groups derived from drilling and radiocarbon dating. The dataset was last updated on 2026-05-05.
The Mac. Robertson Shelf and western Prydz Bay in East Antarctica were sampled in 1993, 1995, and 1997. The dataset contains Paleocene and Eocene fossils, including foraminifera, pollen, spores, and dinoflagellates, recovered from seismic-defined targets. It was published by the Australian Ocean Data Network.
A 3.7 GB collection of data files supporting a research article. The dataset includes original source data, processed versions, and data generated during experiments. It was published by yuanxian zhao on figshare under a CC-BY-4.0 license and last updated on May 31, 2026.
A cruise proposal outlines plans to acquire deep seismic and geophysical data over the Browse Basin region of Australia's North West Shelf. The survey aims to collect up to 3600 km of data along 11 lines, tying into 18 exploration wells and previous surveys. The project is part of AGSO's regional research program to determine the basin's structural framework and tectonic history.
125 gravity provinces and units have been defined and named across Australia and its continental margins. The Australian Ocean Data Network compiled these features from land and marine reconnaissance gravity surveys to rationalize boundaries and nomenclature. This dataset provides a standardized framework for discussing regional gravity features.
The World Wide Lightning Location Network (WWLLN) provides quality-controlled global lightning data collected during the 2012-2014 Hurricane and Severe Storm Sentinel campaign. This dataset maps lightning activity for storms studied in the HS3 campaign, using a global ground-based sensor network operated by the University of Washington. Data is provided by NASA and was last updated on the Data.gov platform in March 2026.
Measurements from the High Altitude MMIC Sounding Radiometer (HAMSR) instrument aboard NASA's Global Hawk aircraft during the Hurricane and Severe Storm Sentinel (HS3) campaign. The dataset provides 25-channel microwave radiometry data to infer 3D atmospheric profiles of temperature, water vapor, and cloud liquid water, even in cloudy conditions. It is managed by the GHRC DAAC and available in netCDF/CF format.
A geological report detailing stratigraphy and structure of the Ngalia Basin in Northern Territory, Australia. The basin comprises Adelaidean and Palaeozoic sediments up to about 5 km thick, preserved in an intracratonic downwarp. The report was published by the Australian Ocean Data Network.
Historic forest airborne imagery provides annual orthophotograph mosaics for ecoforest inventory and planning in southern Quebec. Mosaics have been produced since 2002 for inventory and 2004 for planning, with spatial resolutions ranging from 4 cm to 150 cm. The imagery covers territory south of the 52nd parallel and is acquired on approximately a 10-year cycle for inventory and annually for planning.
The Betoota No. 1 well was drilled to a total depth of 9,824 feet between December 1959 and April 1960. Delhi Australian Petroleum Ltd, Frome-Broken Hill Company Pty Ltd, and Santos Limited conducted a comprehensive program of electric and mud logging, testing, and coring. The operation established 5,757 feet of Mesozoic strata overlying 4,067 feet of probable Palaeozoic sediments.
A reconnaissance hydrogeological study of major Palaeozoic aquifers in the Canning Basin, Western Australia, conducted by the Australian Ocean Data Network. The analysis is based on data from 30 oil exploration wells and investigates groundwater flow, temperature, and salinity in relation to Zn-Pb mineralisation. The dataset was last updated on May 5, 2026.
The Palaeoproterozoic Tennant Creek Inlier (TCI) and The Granites-Tanami Inlier (GTI) share similarities in stratigraphy and tectonic evolution. The dataset describes their geological sequences, deformation events, and the concentration of gold and other minerals. It is hosted by the Australian Ocean Data Network and was last updated on 2026-05-05.
A 967.7 KB document by Jiale Song, last updated in April 2026, examines the health threats from increased tourism and activities in high-altitude areas. This systematic review explores the pathogenesis of hypoxia-induced pulmonary injury and the potential of Chinese Herbal Medicine for prevention and treatment. It is available under a CC-BY-4.0 license on figshare.
Half-hourly ground observations of carbon dioxide, methane, water vapor, and latent energy flux were recorded between 2011 and 2015 at three eddy covariance tower sites in the Alaskan Arctic. The dataset provides a long-term record of greenhouse gas fluxes and meteorological variables across a 300-km north-south transect on the North Slope. This data was collected by the National Aeronautics and Space Administration as part of the CARVE project.