Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,564 datasets
Three experiments using a modality-switch paradigm investigate whether readers engage in embodied mental simulation when comprehending verbs. The 677.7 KB document, authored by Zhiwei Cai and last updated in April 2026, presents results showing action channel switching costs for different verb-noun phrase types. The findings support the Perceptual Symbol Systems theory of language grounding.
The central Great Barrier Reef Province's Cainozoic evolution is deduced from seismic reflection profiling. The dataset likely contains interpretations of shallow, intermediate, and deep focus seismic data, describing depositional episodes from the Late Cretaceous to Pleistocene. It was contributed by the Australian Ocean Data Network and last updated in May 2026.
Experimental data from a study investigating how Duhuo Jisheng Decoction (DHJSD) ameliorates intervertebral disc degeneration (IVDD) by modulating macrophage polarization and immune-inflammatory responses. The dataset was authored by Yongliang Mei and shared on figshare under a CC-BY-4.0 license, with a last update timestamp of 2026-04-21. The file is 193.5 KB in size.
Experimental data from a study investigating how Duhuo Jisheng Decoction (DHJSD) ameliorates intervertebral disc degeneration (IVDD) by modulating macrophage polarization and immune-inflammatory responses. The dataset, shared by Yongliang Mei on figshare, includes results from in vivo and in vitro experiments, as well as bioinformatics analysis of single-cell sequencing data. It was last updated on 2026-04-21 and is shared under a CC-BY-4.0 license.
Experimental data from a study investigating the effects of Duhuo Jisheng Decoction (DHJSD) on intervertebral disc degeneration (IVDD). The dataset likely contains results from in vitro and in vivo experiments analyzing macrophage polarization and immune-inflammatory responses. The data was authored by Yongliang Mei and shared under a CC-BY-4.0 license on figshare in April 2026.
103 hair samples from intact cats were analyzed for cortisol, a biomarker for long-term stress, with an overall mean concentration of 9.99 pg/mg. The data, collected by Veronika Vojtkovská, includes owner-reported information on cat characteristics, behavior, and housing. Male cats exhibited significantly higher cortisol levels than females, while age, breed, and environmental factors showed no significant effect.
The Rouchel district in eastern New South Wales occupies an area of about 1200 km² northwest of Newcastle. Carboniferous sedimentary and volcanic rocks in the district are part of the New England Belt of the Palaeozoic Tasman Geosyncline. The dataset likely contains geological descriptions and maps, provided by the Australian Ocean Data Network.
MIL3MAE_4 is a Level 3 satellite data product from NASA's Multi-angle Imaging SpectroRadiometer (MISR) instrument. It provides a monthly, global statistical summary of column aerosol optical depth at 555 nanometers and aerosol compositional type frequencies, gridded at a 0.5-degree resolution. The dataset is designed to monitor long-term trends in atmospheric particles, clouds, and land surface cover.
The southwestern corner of Australia's tectono-stratigraphic development is illustrated in a plate reconstructed setting. The dataset includes palaeogeographic maps based on a structural elements map and chronostratigraphic section for the Perth Basin, compiled by the Australian Ocean Data Network. The record was last updated on 2026-05 05.
Tropical Cyclone Dominic generated a record Endeavour River discharge of nearly 50,000 megalitres/day and delivered 135-228 tonnes of terrestrial clay to Boulder Reef on the Great Barrier Reef. Sediment and water flux were monitored, showing water velocities up to 60 cm/s and sediment loads two to five times greater during the high-energy event. This dataset, sourced from the Australian Ocean Data Network, captures the interplay of reef-derived carbonate and terrigenous material deposition over a monitoring period.
REGISTRO DE ACTIVOS DE LA CONTRALORÍA MUNICIPAL DE YUMBO is a public information asset registry created in compliance with Decree 103 of 2015, Article 37. The dataset, published on www.datos.gov.co, inventories information classified as public by the Municipal Comptroller's Office of Yumbo. It was last updated on 2026-05-18.
Summer measurements from 1994 and 1995 at various sites in Port Phillip Bay, Australia, define solute exchange rates between sediment and water. The data, from Geoscience Australia, indicate benthic recycling accounted for 63% and 72% of annualized nitrogen and phosphorus inputs to the bay, respectively, while denitrification removed 63% of potentially recyclable nitrogen.
A discussion document from 1971 presents corrected geological data for Papua New Guinea. It critiques a prior publication's inaccuracies regarding lineaments and fault systems, such as the omission of the Owen Stanley and Frieda faults. The Bureau of Mineral Resources and the Geological Survey of Papua New Guinea compiled this data through ground surveys, airphoto analysis, and side-looking airborne radar imagery over the preceding 5-25 years.
The dataset likely contains surficial geology maps and field observations from the Vestfold Hills in East Antarctica, presented at the 2024 Australian Antarctic Research Conference. It was contributed by the Australian Ocean Data Network and last updated on 2026-05 04. The data is intended to provide scientific evidence for managing Antarctic Specially Protected Areas (ASPAs), such as ASPA No. 143 Marine Plain.
Ten measurement sites collected surface meteorological and radiation data across a 1000 km by 1000 km area of northern Manitoba and Saskatchewan. The dataset includes Suite A measurements from all ten sites and Suite B diffuse solar and longwave measurements from five of those sites. Data collection occurred from December 1993 through December 1996 under the BOREAS project managed by the Saskatchewan Research Council and NASA.
Two natural olivine reference materials, AOL and POL, are reported for in situ oxygen isotopic analysis. The dataset includes recommended δ¹⁸O values and homogeneity data, with a two-standard deviation of 0.33‰ (N = 105) for AOL and 0.37‰ (N = 105) for POL. Author Juan Li published this 44.7 KB Excel file on figshare in April 2026 under a CC-BY-4.0 license.
GOT_Uncensored is a curated dataset designed for fine-tuning language models on uncensored reasoning tasks. The dataset was created by author '11-47' and was last updated on May 31, 2026. Its description indicates it is a high-quality merged collection from multiple sources.
Recorded violent encounters between the Mexican army and alleged criminal groups from January 2007 to December 2022. The data originates from the Mexican Secretariat of National Defense (SEDENA) and was processed by Guillermo Escaño. It includes counts of soldiers and alleged aggressors killed, wounded, or detained.
A synthetic, paired-image benchmark for evaluating concept-based interpretability. Each item is an (original, synthetic) image pair where exactly one object class is removed, generated with FLUX.2 [dev] conditioned on COCO reference images. It accompanies the paper 'Evaluating the Interpretability of Sparse Autoencoders with Concept Annotations'.
OMUANC provides selected GEOS-5 Forward Processing model parameters co-located with the Ozone Monitoring Instrument (OMI) UV-2 satellite swath. The dataset includes snow cover, sea ice cover, land cover, terrain height, a row anomaly flag, and pixel area to support Level 2 algorithm processing and downstream research. Each netCDF4 file is approximately 45 MB and maps original 0.625°x0.5° model data to the OMI's 13km x 24km nadir resolution.