Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
42,823 datasets
Australia's Identified Mineral Resources 2012 is an annual national assessment providing a long-term view of mineral resources available for mining. The revised 2014 version includes evaluations of long-term trends, world rankings, and summaries of exploration results. It is published by the Australian Ocean Data Network.
figshare admin karger published survey data from 2065 adults with overweight or obesity in Germany on 2026-05-05. The data likely contains responses on awareness, use, interest, and barriers regarding primary care consultations, behavioural programmes, and pharmacotherapy for weight management. The dataset is a 4.1 MB PDF file licensed under CC-BY-4.0.
A 2026 theoretical study by Chenhui Wang provides high-accuracy interaction energies for linear alkane dimers (C_n H_{2n+2}, n=1 to 18). The dataset includes BSSE-corrected results from -2.2 kJ/mol (n=1) to -62.6 kJ/mol (n=18), with relative errors below 5% against benchmark calculations. It also contains thermodynamic analysis indicating spontaneous dimerization from n β₯ 8 at 100 K.
Twenty-five spectral channels from the High Altitude MMIC Sounding Radiometer (HAMSR) captured atmospheric data during the NASA EPOCH project in August 2017. This dataset provides measurements to infer three-dimensional profiles of temperature, water vapor, and cloud liquid water, even in cloudy conditions. It was collected from the NASA Global Hawk aircraft as part of a training and research mission focused on tropical cyclogenesis in the Eastern Pacific.
Navigation and housekeeping data from NASA's Global Hawk aircraft during the Hurricane and Severe Storm Sentinel campaign. The dataset contains real-time 1 Hz UDP packets broadcast in IWG1 format, capturing flight and atmospheric measurements to study tropical storm formation and the Saharan Air Layer. It is produced by the National Aeronautics and Space Administration, with metadata last updated in March 2026.
DNABERT embeddings calculated from plasmids and chromosomes. Maho Tokuda created this 2.1 MB dataset for a RandomForest model predicting plasmid destinations. The dataset was last updated in June 2026.
A longitudinal study dataset from 25 Mandarin-speaking children who received cochlear implants before 30 months of age. The data includes parent lexical diversity (NDW) and grammatical complexity (MLU) measures at 1 and 2 years post-implant, correlated with children's standardized language test scores at 3 years post-implant. The dataset was published by Luo et al. in 2026 and is hosted on figshare.
A Serena L. DiLiberti study reports the defluorination of trifluoromethyl arenes bearing an ortho N-heterocycle upon reaction with potassium tert-butoxide in THF at ambient temperature. The dataset likely contains experimental results from 14 demonstrated examples of this reaction, with yields up to 85%. The findings, shared on figshare in May 2026, are intended to inform synthetic route design for targets containing these functional groups.
DavidAU created an AI-tuned and condensed text dataset in June 2026. The original 'polaris' dataset was generated by GPT 5, and this version was processed and optimized by the Qwen 3.6 35B-A3B model via LMStudio. The dataset size was reduced by approximately 25%, from 5.3 MB to 3.5 MB.
Thirteen ground stations across Europe, Africa, and Brazil collected global lightning activity data from August 1 to October 1, 2006. This dataset was generated for the NASA African Monsoon Multidisciplinary Analyses campaign to study African Easterly Waves and Mesoscale Convective Systems. The network provides high temporal resolution of 1 millisecond and spatial accuracy ranging from 10-20 km within the network to over 50 km outside its periphery.
Mouse and human spatial transcriptomics data generated using SPTT and SPTEdu-seq techniques. The dataset includes multiple mouse embryo, kidney, and brain samples, as well as human ccRCC frozen sections, with digital expression data in .mtx format and metadata. The dataset is 1.6 GB in size, authored by Shuang Zhang, and was last updated on May 14, 2026.
Roberta Martino's dataset from figshare, last updated May 2026, provides morphometric and microwear data on European hippopotamus fossils. The 177.4 KB XLSX file includes data from a review of Pleistocene specimens from Central and Western Europe. It focuses on mandibular and cranial features to assess phenotypic diversity and dietary shifts in Hippopotamus antiquus populations.
An interim report by Wright Engineers Ltd. details the 1987 operation of Airgold and Beron Placer tailings pumping systems. The Beron pump failed after 150 hours of operation, while the Airgold system was used for 62.5 hours. The report concludes pumping sluice tailings is practical but premature wear affected economics.
Government of Yukon published a dataset on placer gold grains from the South Nahanni River drainage in Northwest Territories. The dataset likely contains morphological shape analysis and compositional data for Au-Ag-Cu-Hg values from electron microprobe analysis, comparing grains from Selena Creek and isolated showings. The dataset was last updated on April 17, 2026.
Kalzas, in central Yukon, is a porphyry-style wolframite deposit with an alteration zone exceeding 2 km in diameter and a mineralized oval area measuring 1500 m by 800 m. The dataset includes results from mapping, geochemistry, airborne surveys, trenching, and drilling conducted from 1981 to 1984, as well as a 2001 sample program showing tungsten oxide (WO3) grades from 0.3% to 0.5%. It is published by the Government of Yukon under an open license.
A geological dataset evaluates the origins of gold hosted by conglomerates of the Indian River formation, south of the Klondike goldfield in Yukon. It uses a combined sedimentological and mineralogical approach to distinguish between paleoplacer and epithermal gold sources. The dataset was published by the Government of Yukon and last updated on 2026-04-17.
Government of Yukon reports describe the placer mining industry in Yukon from 1978 to 1982. The volume contains two sections: industry-wide reports on regulations, deposit formation, and marketing, and descriptions of 288 individual mining operations. Information was compiled from field investigations, records, and the field notes of Dr. D.B. Craig.
A geological and geochemical dataset for the Teslin Crossing Pluton, a small (~75 kmΒ²) Early Jurassic alkalic plutonic complex in Yukon's Stikine Terrane. The data includes rock descriptions, mineralogy, and geochemical analyses (e.g., 3.1-3.4% K2O, 60-68% SiO2) relevant to gold-rich porphyry copper mineralization. It is published by the Government of Yukon on the open_canada platform.
344 claims on the Dromedary property northeast of Whitehorse contain rock units from the Proterozoic-Cambrian Hyland Group to Permian shelf sediments. Anaconda staked the area in 1980, and Blackstone Resources Inc. drilled in 1996, encountering massive sulphide mineralization in all five holes. Best samples from the Kal-Cave area contain 5.53% Pb and 5.83% Zn.
Liang Zhao published this dataset on figshare in May 2026. It contains single-nucleus multiomics data from a study on the transgenerational effects of prenatal exposure to bisphenol A (BPA) and bisphenol S (BPS) in mice. The data likely includes paired snRNA-seq and snATAC-seq results from neonatal spermatogonia across three generations (F1-F3) following exposure to environmentally relevant doses.