Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,472 datasets
2026-03-24 updated geospatial dataset from the City of Canning's GIS Services. It maps footpath centerlines as polylines, including paths owned by the City and other organizations like Main Roads WA. The data includes attributes for street name, suburb location, and path type classification.
41,664 images of Burmese text at 512 × 512 resolution comprise this dataset for OCR and multimodal AI research. It contains 1,139 unique Burmese text entries, each rendered in 32 stylistic variations, and was created by author kalixlouiis. The dataset was last updated on 2026-04-23.
City of Canning data contains footpath polygons digitized from aerial photography. The dataset includes attributes for street name, suburb location, path type, and construction material.
A high-resolution bathymetric surface covers the seafloor between Gantheaume Point and Talboys Rock near Broome, Western Australia. The Australian Hydrographic Office commissioned this survey from September 25 to 26, 2020, for calibrating multibeam echosounders. The processed data is provided as a 0.5-meter resolution GeoTIFF grid in multiple vertical datums.
A high-resolution bathymetric surface covers the seafloor between Gantheaume Point and Talboys Rock near Broome, Western Australia. The Australian Hydrographic Office commissioned this survey from September 25 to 26, 2020, for calibrating multibeam echosounders. The processed data is provided as a 0.5-meter resolution GeoTIFF grid in multiple vertical datums.
110 episodes of robot teleoperation data for 4 tasks, totaling 55,517 video frames at 30 fps, created using the LeRobot framework. The dataset was published by user juyoungggg on Hugging Face and last updated on May 14, 2026. It contains both data files and video files, structured into chunks for training.
The BIG register is a public database of healthcare professionals in the Netherlands, mandated by the BIG Act. It contains over 350,000 registered providers and is maintained by the CIBG, an implementing organization of the Ministry of Health, Welfare and Sport. The register aims to provide clarity about the competence of healthcare providers.
Research into the strategies of intermediary organizations proficient in the temporary use of vacant buildings as workplaces for social and creative entrepreneurs. The study analyzes and compares the strategies of three organizations—PreCare, Spare Space, and Urban Resort—across four aspects in the cities of Groningen, Brussels, and Amsterdam. The dataset is a PDF document published by the Dutch Ministry of the Interior and Kingdom Relations under a CC-BY-4.0 license.
De Stichting van de Straat, a collaboration of five organizations active in homeless reception in Groningen, commissioned a study among applicants for a letter address. The dataset likely contains survey results from this study, conducted by O&S Groningen on behalf of the foundation. The data is provided by the Dutch Ministry of the Interior and Kingdom Relations under a CC-BY-4.0 license.
EgoMonth is a large-scale dataset of egocentric videos collected from wearable cameras. The dataset includes structured annotations for visual question answering, temporal reasoning, spatial reasoning, and long-term reasoning. It was created by anonymous-egomonth and last updated on Hugging Face in May 2026.
juyoungggg's dataset contains 7 episodes of robot teleoperation data for a single task, totaling 4,009 frames. It was created using the LeRobot framework for a bi-manual robot system. The dataset was last updated on May 14, 2026.
Silica concentrations in water from Loch Leven, Scotland, were collected between 2007 and 2023. The UK Centre for Ecology & Hydrology (UKCEH) conducted sampling and analyses as part of a long-term monitoring programme that began in 1968. This data collection was supported by the Natural Environment Research Council (NERC) award NE/R016429/1 under the UK-SCAPE programme.
Australia is covered by a de-trended global isostatic residual gravity map derived from the 2019 B Series national gravity grids. The map combines approximately 1.4 million ground observations, 345,000 line km of airborne gravity, and 106,000 line km of gravity gradiometry data, sourced from government, industry, and research entities from the 1940s onward. It is presented as an HSI image with northwest shading and a linear color scale from -500 to +500 µm.s⁻².
Digital bathymetry, gravity, and magnetic grids for Australia's marine margin produced by the Australian Geological Survey Organisation and partners. The grids have resolutions of 250-1000 meters and represent an upgrade of marine ship-track data. Levelling techniques were used to integrate data from ship tracks, satellites, and high-resolution onshore sources.
This dataset contains 120 PDF pages from the U.S. Department of War's May 8, 2024 declassified UFO/UAP report. Each page is rendered as a 200 DPI JPEG image, paired with metadata extracted from the official release manifest.
Municipal boundaries and annexation records for the state of Maryland, maintained by the Maryland Department of Planning. The dataset includes fields for municipality name, annexation date, resolution number, and classification within Priority Funding Areas. It is provided in multiple formats including CSV, JSON, XML, and RDF.
Maryland coastal and estuarine shorelines are analyzed for rates of erosion and accretion using transect data. The dataset contains calculated change rates derived from shoreline vectors dating from 1841 to 1995, produced by the Maryland Geological Survey and U.S. Geological Survey using the Digital Shoreline Analysis System (DSAS). Updates began in 2015 to incorporate newer shoreline data for specific counties.
Boundaries for Maryland's Priority Funding Areas, a state policy tool for directing growth and infrastructure investment. The dataset is created and maintained by the Maryland Department of Planning and includes certification dates, eligibility status, and jurisdictional codes for 24 counties and Baltimore City. Its last known update was July 15, 2024.
KiTS23 is a dataset for kidney tumor segmentation. It contains CT scans with dense segmentation annotations for the kidney, tumor, and cyst. The dataset was uploaded by NanGongMing0514 to Hugging Face in May 2026.
50 synthetic Australian medical PDFs modelled on NSW Health practice, released under CC-BY-NC 4.0. This free sample is part of a larger 5,000-document library and is pre-labelled with structured ground truth and pixel-precise bounding boxes. The dataset was created by RootCauseAnalytics and last updated in May 2026.