Loading...
Loading...
Speech recognition, text-to-speech, speaker identification, music classification, audio event detection
1,962 datasets
viVoice provides between 100,000 and 1,000,000 Vietnamese audio-text pairs for multi-speaker speech synthesis, released by capleaf in 2024. The dataset is specifically formatted for text-to-speech tasks and is distributed via Parquet files.
LibriSpeech contains approximately 1000 hours of 16kHz read English speech. The corpus was prepared by Vassil Panayotov with assistance from Daniel Povey, derived from audiobooks in the LibriVox project. The dataset was uploaded to Hugging Face by nguyenvulebinh in December 2022.
Water depth and temperature data collected from February 7, 1989 to December 14, 1989 as part of the Global Temperature-Salinity Pilot Project (GTSPP). The data was gathered by the Australian Oceanographic Data Center using bathythermograph (XBT) instruments aboard HMAS Cook and submitted to NOAA NCEI.
Bathythermograph (XBT) data on water depth and temperature collected by the Australian Oceanographic Data Center. The data was gathered from the HMAS Cook as part of the Global Temperature-Salinity Pilot Project (GTSPP). It covers a specific time range from January 20, 1987, to November 9, 1987.
7,537 bathythermograph observations of water depth and temperature collected from over 45 different ships. The data was submitted by Ted Dalzell of the Hydro Department in Birkenhead, UK, and spans from August 1947 to October 1983. It is now available online through NOAA NCEI in C116 and C128 file formats.
From November 28, 1988 to December 10, 1988, water depth and temperature data were collected as part of the Global Temperature-Salinity Pilot Project (GTSPP). The data originates from bathythermograph (XBT) casts taken from the vessel HMAS Cook and was submitted by the Australian Oceanographic Data Center. It is archived by NOAA's National Centers for Environmental Information under accession 9500115.
Seventeen years of ocean current data were collected by the U.S. Geological Survey using current meters deployed in New England coastal waters. The collection includes U and V velocities in cm/s, rotor speeds, current directions in degrees, and water temperatures. Data collection spanned from May 1975 to February 1992.
Geospatial data details the proposed locations for individual wind turbines in the Cape Wind energy project on Horseshoe Shoal in Nantucket Sound, Massachusetts. The layout reflects the revised project plan from July 2005. The dataset was created by the organization SCIOPS, with coordinates based on GPS surveys.
Hydrophysical and hydrochemical data were collected from CTD casts in Massachusetts Bay and adjacent waters from April 1990 to June 1991. The dataset includes measurements of water depth, temperature, salinity, chlorophyll a concentration, percent light transmission, and beam attenuation. Data were gathered from the R/V Asterias and other platforms as part of the Massachusetts Bays Program.
Phytoplankton cell counts were collected from May to December 2006 using a moored submersible flow cytometer at the Martha's Vineyard Coastal Observatory in the Northwest Atlantic Ocean. The data were submitted by Dr. Robert J. Olson of the Woods Hole Oceanographic Institution and are recorded as an ASCII (.txt) file.
Phytoplankton cell counts were collected from a moored submersible flow cytometer at the Martha's Vineyard Coastal Observatory in Massachusetts. The data, provided by NOAA NCEI, cover the period from May 2003 to September 2004 and report concentrations for Synechococcus and Eukaryotes in cells per milliliter. Data are stored in yearly, space-separated ASCII text files for 2003 and 2004.
Phytoplankton cell counts were collected from the Martha's Vineyard Coastal Observatory in the Northwest Atlantic Ocean. The data were submitted by Dr. Robert J. Olson of the Woods Hole Oceanographic Institution and cover the period from 20 May 2004 to 30 September 2004. Data were recorded as an ASCII (.txt) file.
Lux Japanese Speech Corpus is a collection of Japanese text-to-speech audio recordings by the original character 'Lux'. The dataset contains WAV files at 96kHz/16bit, provided in both raw and noise-cleaned versions, with corresponding transcriptions in a metadata file. The dataset was authored by Lami and last updated on Hugging Face on February 24,我们发现了一个问题。
This 3-hour audio collection by MrDragonFox, updated in March 2025, provides speech recordings paired with classified emotional tags. It functions as a demo test set for voice fine-tuning, containing over 600 specific instances of non-verbal sounds such as laughs, sighs, and giggles.
Agreements for KZSMO "Musical School No3" in the KMR have been concluded from 2019 to the present. The dataset is sourced from the States site of Ukraine and was last updated on June 12, 2024. The specific contents and scale of the agreements are not detailed.
Additional agreements to contracts from 2019 to the present time for KZSMO 'Musical School No3' KCC. The data originates from the States site of Ukraine and was last updated on 2024-06-12. The specific number of contracts, rows, and file size are not provided in the metadata.
A collection of Mexican Spanish voice audio files designed for Text-to-Speech (TTS) applications. The dataset is intended for fine-tuning advanced models like Sesame and Orpheus. It is under continuous construction by author jmpanozzo.
The People's Speech Dataset contains over 30,000 hours of transcribed English speech, licensed for academic and commercial use under CC-BY-SA and CC-BY 4.0. It was created by MLCommons to train speech-to-text systems and features a diverse set of speakers.
Laying out neural firing rate evidence from a rodent model (Meriones unguiculatus) investigating midbrain adaptation's role in musical beat perception. It compares on-beat and off-beat sound responses, with findings suggesting basic firing rate adaptation shapes perceptual emergence. Specific row and column counts are unavailable.
A 2020 dataset from Piotr Łukasik investigates the effects of facultative endosymbiotic bacteria on aphid hosts. It measures aphid fecundity and susceptibility to an entomopathogenic fungus across multiple host genotypes and species, with symbionts transferred either together or separately. The study explores how protective phenotypes conferred by Rickettsia and Spiroplasma vary among host backgrounds and are influenced by co-infection with Hamiltonella.