Loading...
Loading...
Speech recognition, text-to-speech, speaker identification, music classification, audio event detection
1,907 datasets
SlovakSpeechMale is a speech synthesis dataset containing approximately one hour of Slovak language audio recorded by a male speaker. The dataset is hosted on Hugging Face by the author 'neurlang' and was last updated in May 2026. It is specifically designed for text-to-speech (TTS) applications and includes Slovak text transcripts.
Raw Emocean is a large-scale English speech dataset designed for training autoregressive text-to-speech models. It contains 8,649 audio segments totaling 15.39 hours, sourced from 22 videos, with an average segment duration of 6.4 seconds. The dataset was created by author somu9 and last updated on Hugging Face in April 2026.
22 institutionalized older adults participated in a quasi-experimental study examining arts-based interventions. Daniela Lourenço collected data on life satisfaction and depressive symptoms at baseline and post-intervention using the Satisfaction With Life Scale and Geriatric Depression Scale. The dataset was last updated on 2026-04-22 via Harvard Dataverse.
CYGNSS Level 1 Science Data Record Version 2.1 provides calibrated Delay Doppler Maps from a constellation of eight satellites. The dataset includes bistatic radar cross section measurements, quality flags, and geolocation parameters, with up to eight files generated daily. NASA produced this second science-quality release, which includes improvements like additional data during orbital maneuvers and reduced measurement biases.
A booklet describing the musical landscape and live performances in the city of Groningen during 2010. The document was published by the Dutch Ministry of the Interior and Kingdom Relations and is available under a CC-BY-4.0 license. The exact data format and volume within the PDF are unspecified.
9,941 high-quality examples of advanced music production workflows, created by author 11-47. The dataset is intended for training large language models to act as elite music producers across genres like Rap, Crunk, and Dubstep. It was last updated on April 23, 2026.
30 speakers contributed over 300,000 audio clips for text-to-speech synthesis, with individual speaker durations ranging from 5 to 118 hours. The dataset, titled 'Nsfw Tts Dataset 30Speakers', was created by author DMC-ykfx33 and hosted on Hugging Face. It was last updated on April 17, 2026.
South African English audio recordings and transcriptions for developing Lwazi speech recognition systems. The corpus contains telephone-quality audio files at 8 KHz, 16-bit, mono channel, with corresponding orthographic transcriptions in Unicode text format. Researcher Jaco Badenhorst created this dataset, which was last updated in April 2026.
El Mnasra cave archaeological data provides descriptive measurements and condition assessments for shell bead specimens from US 8. The dataset includes features such as morphological type, perforation details, use-wear intensity, and evidence of heating or pigment. It was created by Emilie Campmas and is available as an XLSX file under a CC BY 4.0 license.
Emilie Campmas published a dataset in 2026 detailing taphonomic and use-wear features for mollusk shells from US 8 of El Mnasra cave. The dataset records surface preservation, abrasion types, smoothing intensity, and presence of ochre or heating for species including Tritia cf. gibbosula and Columbella rustica. It is a small, specialized archaeological dataset shared as an 11.6 KB XLSX file under a CC BY 4.0 license.
Sampleflip MIDI Chord Progressions is a collection of 3,764 MIDI files containing chord progressions. The dataset was created by author ronantakizawa and was last updated on 2026-04-21. It is used by SampleFlip for melody derivation and harmonic reference.
A high-quality audio dataset designed for training and fine-tuning NSFW text-to-speech models. It includes over 1000 hours of audio from 30 characters, with annotations for emotion and sound effects. The dataset was created by DMC-ykfx33 and was last updated on Hugging Face in April 2026.
500,000 user profiles containing top artists, tracks, albums, and playcounts. The dataset includes rankings, user countries, and MusicBrainz IDs where available, created by GabeKahen and last updated on April 28, 2026. It is designed for modeling music taste and analyzing listening behavior.
Paolo Bonora's dataset provides examples of classifying musicological terms of interest using Large Language Models (LLMs). The dataset is 59.6 KB in size and was last updated on April 15, 2026. It is available under a CC-BY-4.0 license on figshare.
9,941 high-quality examples of music production workflows and reasoning, created by author gss1147. The dataset is intended for training large language models to become elite music producers across genres like Rap, Crunk, East Coast Boom Bap, West Coast G-Funk, and Dubstep. It was last updated on April 23, 2026.
54 hours 54 minutes of studio-quality Hindi speech from a single professional female voice artist, recorded at 48kHz and 24-bit. The dataset contains 22,058 utterances, split into 21,662 for training and 396 for evaluation, and was created by the Indian Institute of Science (IISc) SYSPIN project. It was uploaded to Hugging Face by user 'somu9' and last updated on April 15, 2026.
100,000+ hours of speech data collected from a community of over 2 million contributors across 180+ countries and 100+ languages. This sample dataset is provided by SilencioNetwork, with the full corpus last updated in April 2026. The data is described as being collected 'in the wild' from an opt-in community.
Moroccan Darija dialect speech recordings for automatic speech recognition. The dataset contains audio files in WAV format at 16 kHz mono, sourced from segmented YouTube excerpts. It was created by author mohamedmou and last updated on 2026-04-27.
A 12.1 KB PDF file containing a musical score for Capriccio 22 in B flat minor, sourced from the Berkeley Ms 791 manuscript on page 87. The dataset was authored by Matthew James Zenas Dicken and last updated on April 13, 2026. The description notes the piece features varied texture.
A PDF document containing the musical score for Capriccio 21 in A major, sourced from the Berkeley Ms 791 manuscript on page 86. The 11.7 KB file was authored by Matthew James Zenas Dicken and last updated on April 13, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.