Loading...
Loading...
Speech recognition, text-to-speech, speaker identification, music classification, audio event detection
1,908 datasets
NOAA NGS ortho-rectified mosaic tiles created from imagery acquired between August 10 and October 21, 2009. The National Oceanic and Atmospheric Administration produced this data through its Integrated Ocean and Coastal Mapping initiative using an Applanix Digital Sensor System. The source imagery was acquired at a higher resolution than the final mosaic product.
Ortho-rectified mosaic tiles were created from aerial imagery acquired on June 7, 2011, using an Applanix Digital Sensor System (DSS). This data product is part of the NOAA Integrated Ocean and Coastal Mapping initiative, covering the Maine coastline from Cutts Island to Prouts Neck. The source images were acquired at a higher resolution than the final ortho-rectified mosaic.
Ortho-rectified mosaic tiles created by NOAA's Integrated Ocean and Coastal Mapping initiative. The source aerial imagery was captured from June 5 to June 7, 2011, using an Applanix Digital Sensor System. The final product is derived from higher-resolution original images.
New Bedford, Massachusetts is covered by ortho-rectified mosaic tiles produced by the NOAA Integrated Ocean and Coastal Mapping initiative. The source imagery was acquired on October 5, 2011, using an Applanix Digital Sensor System aircraft. The final mosaic is derived from higher-resolution original images.
A unified Danish speech recognition dataset combines approximately 3.5 million audio samples from seven distinct sources, totaling roughly 16,000 hours of speech. The collection includes European and Danish Parliament recordings, read-aloud and conversational speech, broadcast media, and crowd-sourced samples. It was created by syvai and last updated on the Hugging Face platform in April 2026.
DMSP OLS satellite data provides visible and infrared imagery for monitoring global cloud distribution and cloud top temperatures twice daily. The archive includes low-resolution global and high-resolution regional imagery from a 3,000 km scan, alongside satellite ephemeris and solar/lunar information. Data is sourced from the DMSP Operational Linescan System instruments and archived by NOAA NCEI.
A bilateral social security agreement and administrative arrangement between Canada and the Federation of Saint Kitts and Nevis. The agreement coordinates the two countries' social security systems for individuals who have lived or worked in both jurisdictions. It was published by Global Affairs Canada and is archived as of February 2026, indicating it is out of date and for research purposes only.
TaigiSpeech is a spoken language understanding dataset containing over 3,000 Taiwanese speech utterances from 21 speakers. Each utterance is labeled with one of 8 intent classes, designed for elder-care and smart-home voice command scenarios to support research in a low-resource language.
A 5.5 KB tabular dataset documents the surface preservation condition of Tritia cf. gibbosula mollusk shells from archaeological unit US 8 at El Mnasra cave. The dataset, authored by Emilie Campmas and shared under CC BY 4.0, provides a taphonomic record for paleontological and archaeological analysis.
Slakh2100 is a large-scale dataset containing 2,100 automatically mixed music tracks with isolated instrument stems and aligned MIDI data. Created by Manilow et al. in 2019 at Northwestern University, it is designed for music information retrieval and source separation research. The dataset is hosted by schism-audio on Hugging Face.
104,478 fully synthetic duplex conversations provide 2,133 hours of 16kHz audio for training real-time spoken dialogue models. The dataset was created by author mailong225 for the RelayS2S hybrid architecture, converting text dialogues to speech. It was last updated on March 25, 2026.
Commonwealth Of Massachusetts Payrollv3 is a production dataset from the Statewide Payroll site. It contains payroll information for state employees, as indicated by columns such as NAME_FIRST, NAME_LAST, POSITION_TITLE, DEPARTMENT_DIVISION, and various pay-related fields. The data is hosted by cthru.data.socrata.com and was last updated on 2026-03-17.
Mooré Speech Bible is a curated collection of aligned audio and text data in the Mooré language (ISO 639-3: mos), gathered from publicly available religious sources. The dataset is intended for research in low-resource speech and language processing. It was created by goaicorp and was last updated in April 2026.
Supplementary materials for the article "Perceiving musical interaction between digital and acoustic instruments: A case study with the Karlax". The repository contains raw data results from a free categorization experiment and the video stimuli used in that experiment. Author Linglan Zhu and collaborators prepared this data, which was last updated on April 25, —.
Vaja-Thai is a unified Thai speech dataset containing 289,916 audio samples totaling 554.6 hours for Text-to-Speech research. The dataset was created by dubbing-ai and last updated in April 2026. All audio is resampled to 24 kHz WAV format and combines multiple quality-filtered sources.
25 distinct statistical reporting areas cover a large portion of the Gulf of Maine and south, including Massachusetts territorial waters. These data represent commercial shellfish landings in bushels, recorded by species, harvest location, and year from 1990 to 2001. Records originate from the Massachusetts Division of Marine Fisheries' Commercial Shellfish Landings Database and are used for mapping and annual publications.
1DamnAudio is a large-scale Urdu text-to-speech dataset created by mahwizzzz and last updated on 2026-03-18. It contains 221,929 audio-text pairs totaling 208.5 hours of speech. The audio files are 16kHz WAV format with an average duration of 3.38 seconds.
A spoken collection of traditional Mooré riddles created for academic and research use in low-resource speech and language technologies. The dataset was created by goaicorp and was last updated in April 2026. It is designed to support work in text-to-speech, automatic speech recognition, and oral tradition modeling for the Mooré language.
Goai Moore Speech Proverbs is a bilingual audio-text corpus of traditional proverbs in Mooré and French. Created by goaicorp, it is designed for research in low-resource speech and language processing. The dataset was last updated in April 2026.
Goai Moore Speech Proverbs is a bilingual audio-text corpus containing traditional proverbs in the Mooré and French languages. It was created by goaicorp for research in low-resource speech and language processing, with the dataset page last updated in April 2026. The dataset is specifically designed for academic purposes in text-to-speech and automatic speech recognition for Mooré.