Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,407 datasets
Legacy product describing heavy-mineral sand deposits along the Western Australian coast. The dataset is published by the Australian Ocean Data Network on data_gov_au. It was last updated on 2026-06-23.
The Albany Canyon complex extends 700 km from Cape Leeuwin to east of Esperance, with canyons cutting down up to 2000 meters. Geoscience Australia Data compiled this information on canyon structure and geological history, last updated in May 2026. The data likely contains details on canyon dimensions, thalweg slopes, and the exposed Jurassic and younger rock sequences.
SemantaAI's dataset suite demonstrates a World Intelligence Operating System capable of generating large, structured, and scenario-covered synthetic worlds for many industries. The dataset is a market-facing proof of concept, intended to show capabilities beyond a single narrow demo. It was last updated on June 22, 2026.
United Nations Human Settlements Programme data tracks the proportion of urban populations with access to services like improved water, sanitation, clean energy, internet, and durable housing. The dataset is provided in XLSX format and was last updated on May 29, 2026. It originates from the UN-Habitat Data and Analytics Section.
Ophiomicros bathursti, a new genus and species of ophiuroid (brittle star), is described from Cenomanian (Upper Cretaceous) strata on Bathurst Island, Northern Territory. The description highlights morphological distinctions, such as unusually large oral plates and small adoral plates, which differentiate it from allied genera like Ophiura and Amphiura. This dataset comprises the formal taxonomic publication detailing the fossil's discovery and classification.
9.5 KB of simulation analysis data supporting a novel data compression method for bridge monitoring. The dataset, authored by Ming Chen and shared on figshare, demonstrates a domain knowledge-based compression method achieving a 75% compression ratio, with a synergistic processing method exceeding 92% compression and 95% data fidelity. The data was last updated on April 15, 2026.
5.5 KB of simulation results evaluating a novel domain-specific data compression algorithm for bridge structural health monitoring. The dataset, authored by Ming Chen and last updated in April 2026, contains error statistics for sparse data after a supplementation process. The described method achieved a 75% compression ratio, exceeding 92% with synergistic processing, while retaining 95% data fidelity.
Bench-easy-6-2026 is an Effortless-to-Easy tier question-answering benchmark designed by Seton Labs to evaluate basic reasoning and generalization in small AI systems. The dataset was last updated on June 22, 2026, according to the platform metadata. Its creator notes the benchmark is intended for research, experiments, or fun, with an acknowledgment that accuracy issues are being addressed.
Seasonal variations in major ions, nutrients, and chlorophyll a were examined at two sites in the upper Swan River estuary. The data likely captures intra-annual variations influenced by riverine discharge, with temperature ranging from 13-29Β°C and salinity from 3-30. The dataset is provided by Geoscience Australia Data and was last updated in May 2026.
Legacy product from the Australian Ocean Data Network concerning heavy-mineral sands along the east coast of Australia. The dataset is published on data_gov_au and was last updated on 2026-06-23 04:11:12.884076. No abstract or detailed column information is available for this resource.
A 5.5 KB dataset from figshare contains experimental data on a novel SIRT5 inhibitor designed using X-ray cocrystal structures. Yingyi Jiang published the data in May 2026, detailing the inhibitor's IC50 of 0.29 ΞΌM and its effects on renal function and inflammation markers in mouse models of septic acute kidney injury.
Legacy product from the Australian Ocean Data Network, last updated on 2026-06-23. The dataset concerns the morphology of the east Australian continental shelf between Cape Moreton and Tweed Heads. It likely contains geospatial data related to offshore heavy-mineral prospects.
A systematic mapping study analyzing 54 publications from ACM, IEEE, and Scopus on Usability and User Experience evaluation of Generative AI tools in the post-ChatGPT period. The study examined 2,473 publications and identified substantial documentation gaps and terminological fragmentation. The dataset was created by Rafael Pereira and last updated in May 2026.
The Eval Cards Backend Dataset contains pre-computed evaluation data for 5,678 models across 798 benchmarks. Generated by the eval-cards backend pipeline, it powers the Eval Cards frontend and includes 1,321 metric-level evaluations. The dataset was last generated on May 5, 2026.
Simulation data from two-dimensional magnetohydrodynamic (MHD) and runaway electron fluid models for disruption events in the SPARC tokamak. The work provides a systematic comparison and benchmarking of different primary runaway electron sources, including activated tritium beta decay and Compton scattering. The dataset was authored by Datta, R., C. Clauser, N. Ferraro, C. Liu, R. Sweeney, R. A. Tinguely from the Plasma Science and Fusion Center Dataverse.
Port Curtis in Queensland, Australia, is the location for this water quality dataset collected by sensors deployed as part of the Port Curtis Integrated Monitoring Program (PCIMP) in Zone 05, the Inner Harbour. The data likely contains time-series measurements from 01 July 2006 to 26 March 2026 and is managed by the Australian Ocean Data Network.
Legacy product from the Australian Ocean Data Network with no abstract available. The dataset likely contains information on heavy-mineral deposits along the coasts of Victoria, Tasmania, and South Australia. It was last updated on 2026-06-23 01:47:06.115067.
A protocol for a cross-sectional observational study of 180 Mandarin-speaking children aged 4-6, developed by Cai Wang. The study aims to evaluate a culturally adapted framework for profiling conversational abilities in children with and without Developmental Language Disorder, using audio-video recordings and linguistic annotation. The protocol was last updated in May 2026.
A methodological protocol for an age-stratified cross-sectional observational study involving 180 children aged 4-6. The protocol, authored by Cai Wang and last updated in May 2026, describes a multimodal framework for assessing conversational abilities in Mandarin-speaking children with and without Developmental Language Disorder.
A 720.4 KB document describing CS-DTA, a language model-driven framework for predicting drug-target affinity under strict cold-start conditions. The framework was developed by Zhaokun Jiang and integrates large language models for compound and protein representation learning with a cross-modal interaction module. The associated data was last updated on 2026-04-28.