Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,673 datasets
The British Geological Survey maintains one of the world's largest databases on mineral production and trade, covering more than 70 economically important mineral commodities. Annual production statistics by mass are recorded for individual countries, grouped by continent, with import and export data available up to 2002. The data is compiled from primary official sources and is used by government, industry, and researchers for policy, economic analysis, and commercial strategy.
A 1.1 GB dataset from figshare supports research into brain implants that measure neural magnetic fields. Computational modeling illustrates how dense neuron networks are easier to distinguish via magnetic spike templates. The dataset, authored by Ziad Ali and last updated in April 2026, is shared under a CC-BY-4.0 license.
Benthic chamber measurements of oxygen, ammonium, nitrate, nitrite, phosphate, silicate, TCO2, and alkalinity define solute exchange rates between sediment and water in Port Phillip Bay. Data from the summers of 1994 and 1995 across various sites show benthic recycling accounted for 63% and 72% of the annualized N and P input to the entire bay, respectively. The dataset, sourced from the Australian Ocean Data Network, also includes radon-222 and CsCl spike injection measurements to study bio-irrigation.
A bathymetry survey covering an area east of the Approaches to Newcastle, NSW, acquired from 4 December 2020 to 15 January 2021. The survey was conducted for the Australian Hydrographic Office by Guardian Geomatics using a Kongsberg EM 2040-07 multibeam echosounder. Data was processed with Caris Hips & Sips software and exported as a 30-meter resolution, 32-bit floating point GeoTIFF grid.
A cleaned mathematical supervised fine-tuning dataset designed for instruction tuning and mathematical capability adaptation. The dataset introduces a simplified instruction–response format and removes intermediate reasoning contamination. It was created by author kaushik-harsh-99 and was last updated on 2026-06-07.
A 2021 bathymetry survey of Lacepede Channel, Western Australia, acquired between 19 May and 22 September. The data was collected for the Australian Hydrographic Office by Fugro using a Kongsberg EM2040 Mk II multibeam sonar and processed with Caris Hips & Sips software. The final product is a 30-meter resolution, 32-bit floating point GeoTIFF grid.
Australia's Identified Mineral Resources 2010 provides estimates of the country's mineral resources as of December 2009, based on data from Geoscience Australia. The report compares these long-term resource estimates with short-to-medium term industry ore reserves and includes analysis of mineral exploration expenditures for 2008-09. Data on mine production is sourced from the Australian Bureau of Agricultural and Resource Economics and Sciences, with world rankings calculated from United States Geological Survey publications.
Supplementary materials and structured data from a systematic literature review on Renewable Energy Certificates (RECs), International RECs (I-RECs), and Guarantees of Origin (GOs). The review followed PRISMA 2020 guidelines and addressed five core research questions on investment, storage, regulation, demand, and business models. The dataset was created by Flavio Geraldo Nogueira and last updated in June 2026.
A metadata catalogue containing key information for datasets available on the Government of Canada's Open Data portal. It includes multiple flattened resources such as datasets metadata, resources metadata, and resource views metadata. The data was last updated in March 2026 by the Treasury Board of Canada Secretariat.
A computational study investigating the structural impacts of two genetic variants (Trp240Arg and Arg226Cys) in the IL2RG protein. The dataset, authored by Aswini S and last updated in April 2026, contains results from homology modeling, molecular dynamics simulations, and protein-protein docking analyses. It includes binding free energy calculations comparing wild-type and variant complexes with IL-2 and IL-21 cytokines.
A bitext corpus assembled from upstream releases of mtdata and OPUS projects. The dataset includes source and target sentences with ISO 639-3 language codes and origin sub-corpus identifiers. It was created by author natgillin and last updated on June 2, 2026.
Yulong Su published a 2.3 GB dataset on figshare in 2026 containing seismic waveform and event data. The collection includes SAC waveform files and Excel tables detailing 94 events for PKPPcP analysis and 57 events for PKPPcP–PKKPab phase pairs analysis, along with corresponding event-station pair information. The data is used to study de-degeneracy effects of specific seismic phases and implications for 3-D Earth's mantle heterogeneity.
Estimates to the nearest thousand of employed people in London who have more than one job. The data is derived from the UK Office for National Statistics' Annual Population Survey, with records starting from 2004. It is published by the Greater London Authority.
Data from 2002 onward, collected by the Atmospheric Infrared Sounder (AIRS) aboard NASA's Aqua satellite, provides calibrated, geolocated infrared radiances for approximately 2378 spectral channels, simulated for cloud-free conditions. These radiances are a fundamental input for retrieving standard atmospheric products and are generated globally at all observation points, with 240 data granules produced per day. The dataset's high spectral resolution (R=1200) and synergy with microwave sensors (AMSU/HSB) support detailed atmospheric profiling.
The Yukon Mineral Exploration Program (YMEP) is a funding program for mineral exploration in Yukon. It provides financial support to prospectors, partnerships, and companies across four modules for hard rock and placer resource projects. The dataset is published by the Government of Yukon and was last updated on April 17, 2026.
Registro_activos_informacion documents compliance with the Transparency and Access to Public Information Index (ITA) mandated by Colombia's Procuraduría General de la Nación under Law 1712 of 2014 and Resolution 1519 of 2020. The dataset is hosted on the Colombian open data portal www.datos.gov.co and was last updated on May 18, 2026. It likely contains metadata records describing published information assets.
Quan Zuo's dataset from 2026 presents a programmable platform using eight bi-triazine cross-linkers to establish structure–conformation–biology relationships for peptide therapeutics. The data likely contains results from binding assays, cell studies, and in vivo PET/CT imaging for cyclic RGD and dimeric KTLLPTP peptide models targeting integrin αvβ3 and Plectin-1. The work identifies a lead candidate with high tumor contrast and provides a framework for precision peptide engineering.
Aqua/AIRS L2 Cloud-Cleared Infrared Radiances are calibrated, geolocated infrared radiance measurements from the Atmospheric Infrared Sounder aboard NASA's EOS Aqua satellite. The data product contains channel-by-channel radiances for approximately 2378 spectral channels, processed to simulate cloud-free observations within each Advanced Microwave Sounding Unit footprint. It is generated as a separate, high-volume output from the AIRS Standard Product due to its size, with a temporal resolution of 6-minute granules and a 16-day orbit repeat cycle.
A study by Caroline Varella Rodrigues investigated in situ biomethanation as a biogas upgrading strategy. The data likely contains results from fed-batch reactors treating pulp and paper industry wastewater, with hydrogen injected at two pressures to evaluate methane production and microbial dynamics. The dataset was last updated on 2026-04-24.
A cross-sectional survey of 3,795 middle school students in Hangzhou, China, collected by Ruiyi Chen. The dataset includes assessments of emotional distress using DASS-21, gaming motives using MOGQ, and Internet Gaming Disorder symptoms using IGDT-10. Network analysis was performed to explore the interrelationships among these constructs and identify core and bridge symptoms.