Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
14,865 datasets
Hourly maximum air temperature records from automated weather stations across Colombia. Data has undergone basic quality control per World Meteorological Organization recommendations, with some values corrected or removed. The dataset is provided by the Colombian government's open data portal, www.datos.gov.co, with a last recorded update in April 2026.
NASA's Advanced Composition Explorer (ACE) satellite provides propagated solar wind data, processed to a 60-second resolution. The data consists of tri-axial fluxgate magnetometer measurements in Geocentric Solar Ecliptic (GSE) coordinates, propagated to a point just outside Earth's bow shock using a minimum variance technique. This dataset was constructed by Dr. J.M. Weygand for Prof. R.L. McPherron under National Science Foundation grants and was used in superposed epoch studies.
Propagated solar wind and magnetic field data from the ACE satellite, linearly interpolated to a consistent 60-second resolution in GSM coordinates. The data was constructed by Dr. J.M. Weygand for Prof. R.L. McPherron under NSF grants ATM 02-1798 and ATM 02-08501, primarily for superposed epoch studies. The propagation method, based on the minimum variance technique, is detailed in peer-reviewed publications by Dan Weimer et al. from 2003 and 2004.
Propagated solar wind data from the ACE satellite, processed to a 60-second cadence and projected to a point just outside Earth's bow shock. The dataset was constructed by Dr. J.M. Weygand for Prof. R.L. McPherron under NSF grants ATM 02-1798 and ATM 02-08501, primarily for superposed epoch studies. The propagation method uses a minimum variance technique on the magnetic field, as detailed in publications by Dan Weimer et al. (2003, 2004).
Australian Ocean Data Network hosts a dataset of Upper Jurassic (Kimmeridgian and Tithonian) and Lower Cretaceous (Neocomian and Aptian) marine fossils from the Dampier Peninsula in Western Australia. The fossils were documented in publications from the 1940s to 1958, with some specimens lost in a 1953 fire. The dataset includes photographic illustrations of the fossils.
I1 Captions is a collection of image-text pairs aggregated from multiple sources, hosted by zlab-princeton on Hugging Face. The dataset comprises several large subsets, including fluxreason (5.9M rows), imagenet22k (13.7M rows), megalith10m (9.4M rows), and others, with the most recent update recorded on May 13, 2026. It provides multiple descriptive captions per image across different subsets.
Propagated solar wind data from the Wind satellite, processed to a 60-second resolution in GSM coordinates. The data was constructed by Dr. J.M. Weygand for Prof. R.L. McPherron under NSF grants and uses a minimum variance propagation technique. A version 2 exists with a corrected offset in the Bz component after November 2004.
Automated multi-label annotations for the ImageNet-1K training split include spatial masks for selected object-level labels. The dataset is designed for easy inspection and reuse but does not include the original images. The annotations were created by author k3999 and last updated on June 1, 2026.
Wind direction data recorded at 10-minute intervals from automated weather stations across Colombia. The dataset includes quality-controlled observations from stations managed by IDEAM and partner entities under inter-administrative agreements. Data is provided as raw, near-real-time sensor readings for transparency and risk management support.
The Australian Ocean Data Network hosts data from a 1976 cruise to sample the Cape Leeuwin manganese nodule deposit. HMAS Diamantina occupied 9 stations, recovering about 2000 nodules from the sea bed for chemical and mineralogical study. The deposit covers an area of approximately 900,000 km^2 on the Indian Ocean floor southwest of Cape Leeuwin.
A version of the Robust systems and multifunctional areas policy adopted by the Provincial Staten van Drenthe on 2 July 2014. The dataset likely contains geospatial information defining areas where a single main function (living, working, water, nature, or agriculture) is paramount, alongside multifunctional areas with mixed ambitions. It is provided by the Ministerie van Binnenlandse Zaken en Koninkrijksrelaties under a CC-PDM-1.0 license.
1976 geological mapping of the Mount Anderson Sheet area examined an exposure of the Jurassic Jurgurra Sandstone. The Australian Ocean Data Network presents an environmental analysis of a section containing evidence of aeolian deposition, with vertical thickness averaging about 5 meters. The note correlates the unit with subsurface Wallal Sandstone, which thickens to 369 meters in the Munro No.1 well.
Epicure corpus resources provide the canonical vocabulary and validation results for three ingredient-embedding models. The dataset includes cross-modal validation against external USDA and FlavorDB labels, WEAT and Procrustes robustness checks, and a full SLERP direction-arithmetic result table. It was created by Kaikaku and last updated on May 27, 2026.
Mid Ulster Council provides geospatial data on the location of its main offices. The dataset is published by the Government Digital Service under the OGL-UK-3.0 license and is available in multiple formats including KML, GeoJSON, and CSV. The council was established on 1 April 2015, replacing three legacy district councils.
Barnet Homes is an Arm’s Length Management Organisation (ALMO) established in 2004, wholly owned by the London Borough of Barnet. The dataset describes a 10-year Management Agreement for providing housing management services to council-owned tenanted and leasehold properties. Services covered include income collection, empty property management, repairs, estate cleaning, and grounds maintenance.
A first-person underwater video test set captured with a BlueROV2 Heavy ROV. The footage was recorded across 15 distinct motion sequences and is annotated in YOLO format. The dataset was created by author rifqijuli and last updated on 2026-05 25.
Geoscience Australia houses one of the world's largest collections of petroleum data. The collection includes well completion reports, logs, analysis reports, seismic profiles, and core photography submitted by industry and research projects. Data is accessible via the National Offshore Petroleum Information Management System (NOPIMS).
A 674.9 KB dataset on figshare, last updated April 15, 2026, by Jing Qiu. It investigates dissolved organic matter released from Japanese knotweed-derived biochar produced at 500 °C. The data compares seven extraction methods, measuring dissolved organic carbon, aromaticity, molecular weight, and fluorescence components.
Hourly minimum air temperature observations from automated weather stations across Colombia. Data undergoes basic quality control per World Meteorological Organization guidelines and is published by www.datos.gov.co. The dataset was last updated in April 2026.
A research paper details the impact of the probiotic Enterococcus faecium HDRsEf1 on host health. The study uses DSS-induced colitis in mice and nursery pig models to evaluate changes in gut microbiota composition and fecal metabolite profiles. The 189.8 KB PDF was authored by Shuaifei Feng and last updated in April 2026.