Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
43,899 datasets
A discussion paper from Geoscience Australia Data challenges geological data presented by Wellman (1971) regarding fault systems in Papua New Guinea. The paper references ground and remote sensing surveys, including airphotos and radar imagery, conducted over the last 25 years by the Bureau of Mineral Resources and the Geological Survey of Papua New Guinea. It was last updated on the platform in May 2026.
Xueshi Li published experimental data for a monolithic meta-cavity platform on figshare in May 2026. The dataset, 1.5 GB in CSV format, likely contains measurements from a 200-nm-thin membrane device that demonstrates Purcell enhancement, spin-momentum-locked emission, orbital angular momentum generation, and holographic shaping. The data is licensed under CC-BY-4.0.
Geoscience Australia Data presents a geological hypothesis linking Australia's buoyant cratonic platform to its origins in the Gondwanaland supercontinent. The text-based resource, last updated in May 2026, contrasts non-marine facies in Australia with marine facies in Laurasia and postulates that Pan-African orogenic heat created a permanently buoyant lower crust via mafic underplating. It discusses evidence from the Australian Proterozoic shield and East Africa, and explains regional uplift patterns.
A 2018 research publication describes a method for generating continental-scale pixel composites of surface reflectance in coastal regions. The approach uses a multi-resolution tidal model and a Voronoi mesh to address tidal influences, enabling the creation of high and low tide mosaics for the Australian coastline. The composites are intended for coastal change detection and monitoring applications.
Twelve indicators on housing within the city of Utrecht, published by the municipality's Research & Advice department. The dataset includes figures on dwelling stock, property values, construction years, and ownership types. It is sourced from municipal tax cooperation, land registry, and housing corporation platforms.
An inventory of public information generated, obtained, acquired, or controlled by the Colombian National Planning Department (DNP) that has been classified as confidential or reserved. The dataset includes metadata such as classification rationale, responsible officials, and legal basis. It was published on datos.gov.co and last updated in May 2026.
The National Base Map service provides seamless topographic colour mapping for the whole of Australia, including outer islands and external territories. The service consists of data sourced from Geoscience Australia, Australian Antarctic Division, OpenStreetMap, and other programs like ACLUMP. The topographic information was checked in 2008 using satellite imagery and supplemented in 2009.
The Department of Management Agreements and Accountability and Institutional Relations of the MSSS manages permits for public and private institutions and facilities in Quebec. This dataset provides a snapshot of licensed capacities and services for these facilities. The data is also available on the M02-Directory of Establishments web portal, which may have daily updates, unlike this fixed-date publication.
Since the winter of 2001-2002, district foremen have recorded data on skating rink conditions. The database initially included districts corresponding to former delimitations, with other boroughs contributing from the winter of 2011-2012 onward. The dataset is provided by the Government and Municipalities of Québec.
The MYD17A2H Version 6.1 product provides cumulative 8-day composites of Gross Primary Productivity (GPP) and Net Photosynthesis (PSN) at a 500-meter pixel resolution. This data is produced by NASA's Aqua Moderate Resolution Imaging Spectroradiometer (MODIS) and is based on the radiation use efficiency concept. The dataset includes a quality control layer and was last updated in March 2026.
MODIS/Terra Gross Primary Productivity (GPP) Version 6.1 is a cumulative 8-day composite data product with 500-meter pixel resolution. It is produced by the National Aeronautics and Space Administration (NASA) and provides information on GPP and Net Photosynthesis (PSN) for modeling terrestrial energy, carbon, and water cycles. The dataset was last updated on March 13, 2026.
The VIIRS FILDA-2 Modified Combustion Efficiency (MCE) Version 2 swath product (VJ147IMG) is a satellite-derived dataset from the NOAA-20 satellite. It provides 83 layers of fire detection data, including retrievals of Fire Radiative Power (FRP), Visible Energy Fraction (VEF), and Modified Combustion Efficiency (MCE) at a 375-meter resolution in 6-minute orbit segments. The algorithm leverages visible band observations at night to assess combustion efficiency and detect smaller, cooler fires.
The VIIRS/JPSS1 FILDA-2 Fire Modified Combustion Efficiency Product (VJ147MOD) is a 750-meter resolution swath dataset produced in 6-minute orbit segments from the NOAA-20 satellite. It contains 85 layers for fire detection and retrievals of Fire Radiative Power (FRP), Visible Energy Fraction (VEF), and Modified Combustion Efficiency (MCE). The algorithm leverages visible band observations at night to assess combustion efficiency and detect smaller, cooler fires.
Newcastle City Council has tracked monthly usage of over a dozen digital library resources, including family history archives and citizenship test platforms, from January 2005 to the present. The dataset documents a significant service disruption, noting all library buildings closed from March 19, 2020, due to the coronavirus outbreak. It captures evolving metrics, such as Ancestry's shift from counting sessions to content pages viewed in June 2015.
Vehicle classification counts collected by the New York City Department of Transportation (DOT) for the New York Metropolitan Transportation Council (NYMTC) to validate the New York Best Practice Model (NYBPM). The data covers screenline locations in the New York area, with DOT collecting data on at least 10% of total NYBPM screenline locations annually. The dataset includes hourly vehicle counts by class from 2011 to 2025.
Northern Ireland's coast is assessed for physical asset vulnerability to erosion in this 2018 study. The layer represents a high-level preliminary Erosion Risk Appraisal, created by Amey Consulting with HR Wallingford for the Department for Infrastructure and DAERA. It compares areas of high, medium, and low erosion risk against asset values.
A 1.8 MB research publication by Yanhui Li, last updated in April 2026, proposing a new nonparametric framework for testing independence between two sets of variables. The method uses weighted and unweighted graph representations and is designed for data where only pairwise distances are observed. Simulation studies suggest the methods control type I error rates and exhibit higher power than competing approaches.
ESGenius is an EMNLP 2025 Main Conference Oral benchmark for evaluating large language models on Environmental, Social, and Governance (ESG) and sustainability knowledge. The paper was nominated for the EMNLP 2025 Resource and Theme Paper Awards, Top 1%. It was authored by cy0307 and last updated on June 15, 2026.
The Australia’s Future Energy Resources project reinterpreted the geology of the Pedirka and western Eromanga basins using new seismic and biostratigraphic data. This dataset likely contains interpretations of sedimentary records from the early Paleozoic to Late Cretaceous, focusing on fluvial-lacustrine and shallow marine environments. It was produced by the Australian Ocean Data Network as part of the Exploring For The Future program.
A briefing package prepared for a Standing Committee on Public Accounts hearing on October 21, 2025. The document likely contains analysis and summaries of the Auditor General of Canada's 2025 Fall Reports. It was published by the Office of the Auditor General of Canada and last updated on the open_canada platform in May 2026.