Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
166,477 datasets
Legacy product from the Australian Ocean Data Network with no abstract available. The dataset likely contains aerial photographs documenting coastal changes in southeast Queensland. It was last updated on 2026-06-23 01:16:31.720140.
Produced during the GRIP Field Experiment, this dataset contains satellite-derived overshooting top magnitudes for tropical storms and hurricanes. It was created by NASA for use with the Real Time Mission Monitor tool to study storm formation and intensification. The data is visualized as color-coded overlays in Google Earth.
Legacy product from the Australian Ocean Data Network with no abstract available. The document collection likely outlines strategic approaches and ideas for future marine research programs, focusing on new basins. It was last updated on 2026-06-23.
Scott Plateau - structure, isopach, and potential field maps is a legacy geophysical dataset published by the Australian Ocean Data Network. The dataset includes structure, isopach, and potential field maps, but no abstract or detailed description is available. It was last updated on 2026-06-23.
Legacy product from the Australian Ocean Data Network focusing on the geological history of the Arafura Sea. The dataset is published on data_gov_au and was last updated on 2026-06-23. Metadata is minimal; the actual data content and structure require verification after download.
San José de Cúcuta municipality provides data on free public Wi-Fi zone usage from 2021 to 2022. The dataset likely contains session counts and user demographics, including gender, age, device type, and operating system. It originates from the Colombian open data portal www.datos.gov.co and was last updated in May 2026.
Northern California and Nevada are the geographic scope for this data release. It contains raw timeseries and metadata for 827 minidisk infiltrometer measurements conducted across nine burned areas and nearby unburned areas. Scott McCoy authored the dataset, which covers measurements from 2018 to 2023.
Mooring data for offshore drilling vessels, likely containing positional or operational information. The dataset is published by the Australian Ocean Data Network on data_gov_au and was last updated on 2026-06-23. The raw description indicates it is a legacy product with no abstract available.
1995 to 2014 monthly gridded climatologies of total lightning flash rates derived from two satellite-based sensors, the Optical Transient Detector (OTD) and Lightning Imaging Sensor (LIS). The dataset provides a merged, long-term record, with robust tropical and subtropical coverage from LIS and high-latitude data from OTD. It is produced by the National Aeronautics and Space Administration and is available in formats including BIN, ISO, HTML, and PDF.
Legacy product from the Australian Ocean Data Network concerning the Russian oceanographic vessel 'Vitiaz'. The dataset likely contains information on the vessel's techniques and equipment. Metadata is minimal, with the record last updated on 2026-06-23.
ARTPARK-IISc's Vaani Benchmark V1.0 is a curated Hindi automatic speech recognition (ASR) evaluation set. It contains 5,343 audio segments from 1,103 speakers across 104 Indian districts, totaling approximately 11.7 hours. Each audio segment includes three independent human transcriptions.
NASA's MEaSUREs program provides a daily record of global landscape freeze/thaw status at 6 km resolution. The data is derived from microwave radiometer observations by JAXA's AMSR-E and AMSR2 instruments. This dataset is maintained by NASA and is available on multiple government platforms.
Geoscience Australia's Science Principles document outlines six foundational principles guiding its scientific work. The principles, which include Relevance to Government and Quality Science, are embedded into the agency's long-term strategic planning and daily operations. The document, published by the Australian Ocean Data Network, was last updated on May 5, 2026.
Legacy product from the Australian Ocean Data Network, last updated on 2026-06-23. The dataset likely contains geospatial and geological information assessing the potential for offshore phosphate deposits in the southwest Pacific region. Available file formats include HTML and PDF.
Legacy product from the Australian Ocean Data Network with no abstract available. It contains cation electrode measurements collected in the Capricorn area of the southern Great Barrier Reef province. The dataset was last updated on 2026-06-22.
AGSO Formats for Marine Navigation Digital Data is a legacy dataset from the Australian Ocean Data Network. Its columns suggest it contains geophysical and marine navigation information, likely for scientific and operational use. Metadata is minimal; actual content requires verification after download.
Embodied-R1.5-SFT-Dataset is a subset of the Stage 1 Supervised Fine-Tuning data used to train the Embodied-R1.5 model. The dataset is hosted by author IffYuan on HuggingFace, with a partial release noted as of June 9, 2026. The full dataset is described as a work in progress, with JSON files still being uploaded.
1,507 episodes of robot demonstrations for the Tower of Hanoi puzzle, comprising 3,264,454 frames at 60 frames per second. The dataset was created using LeRobot and is hosted on Hugging Face by the author jellyho. It was last updated on 2026-06-13.
Global lightning signatures were detected from visible channel imagery by the Defense Meteorological Satellite Program (DMSP) Operational Linescan System (OLS) flown on satellite F12. The dataset contains extracted time and location data for each lightning streak, stored in monthly HDF files. It was produced by the National Aeronautics and Space Administration and covers a seven-month period from May through November 1995.
1.6 MB of mass spectrometry data from a study identifying photoproducts of the antibiotic sulfamethoxazole (SMX) under environmentally relevant UV irradiation (300–350 nm). The dataset was authored by Pavla Fojtíková and last updated in June 2026. Files are provided in XML and MGF formats.