DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Speech & Audio Datasets | DataSalon

All Categories

🎤

Speech & Audio

Speech recognition, text-to-speech, speaker identification, music classification, audio event detection

2,588 datasets

Speech & Audio

Massachusetts Nearshore Ocean Management Boundary Line

A boundary line defines the landward limit of the Ocean Management Planning Area for Massachusetts, established 0.3 nautical miles from the mean high water shoreline. The data set, provided by SCIOPS, includes official coordinate values and GIS shapefiles for this legal boundary. It was created pursuant to 'An Act Relative to Oceans' to regulate coastal development.

GeospatialMarine PolicyCoastal BoundaryOcean ManagementGeospatial Planning+1

0 views

Speech & Audio

Massachusetts Shellfish Fishery Regulations And Management

Records from the Massachusetts Shellfish Sanitation Program managed by the Division of Marine Fisheries (MarineFisheries). It details regulatory activities for commercial shellfish harvesting, aquaculture, and local technical partnerships. The dataset originates from the SCIOPS organization via NASA Earthdata.

TabularAquacultureCoastal RegulationMarine FisheriesShellfish Management+1

0 views

Speech & Audio

Noisy Librispeech For MetricGAN+ and ASR

Kaggle hosts this audio dataset derived from the LibriSpeech corpus. The title suggests it contains speech recordings with added noise, intended for training or evaluating MetricGAN+ and Automatic Speech Recognition systems. The dataset's author, organization, and specific details like size and license are unknown.

AudioNoise AugmentationSpeech ProcessingAutomatic Speech RecognitionAudio Synthesis+1

0 views

Speech & Audio

SLP301_MusicNet: Music Audio Dataset

A dataset titled 'SLP301_MusicNet' published on Kaggle. The title suggests it contains music audio data, likely for machine learning tasks. The dataset's specific size, creator, and temporal coverage are unknown.

AudioMachine LearningAudio Analysis+1

0 views

Speech & Audio

Massachusetts Bay Hydrodynamic Model for Water Quality and Sediment Studies

A three-dimensional numerical model simulates circulation in Massachusetts and Cape Cod Bays, driven by tides, wind, river runoff, and thermal forcing. The U.S. Geological Survey developed this model to study the transport of nutrients, contaminants, and red tide populations. The dataset was last updated in 1992.

Time SeriesGeospatialSediment TransportGulf Of MaineWater Quality ModelingMarine ResearchCoastal Hydrodynamics+1

0 views

Speech & Audio

2025 Taiwan Table Tennis Time Series (TTTTS): Swing Dataset

Taiwan is the geographic focus of this dataset. It contains time-series data related to table tennis swings, as indicated by its title and raw description. The dataset is hosted on Kaggle, but specific details about its size, structure, and creation are currently unknown.

Time SeriesTaiwanTable TennisSports Analytics+1

0 views

Speech & Audio

Vaani Atypical Speech Corpus: Multilingual Speech Data for Accessibility

Project Euphonia, a public initiative led by Google, aims to improve Automatic Speech Recognition for individuals with atypical speech. The Vaani corpus expands this work beyond English to include languages such as French, Spanish, Japanese, and Hindi. This dataset is hosted by ARTPARK-IISc and was last updated on March 18, 2026.

AudioMultilingualAi For GoodAccessibilityNatural Language ProcessingAtypical SpeechSpeech Recognition+1

0 views

Speech & Audio

F5-TTS Clean Voice Dataset

F5-TTS Clean Voice Dataset is a collection of audio data published on Kaggle. The dataset likely contains voice recordings intended for text-to-speech model training. Its specific size, source, and creation date are not detailed in the available metadata.

AudioText To SpeechAudio DatasetVoice SynthesisClean Audio+1

0 views

Speech & Audio

Road Centerlines for the City of Bloomington and Monroe County, Indiana

Road Centerlines is a geospatial dataset representing the centerline of roadways for the City of Bloomington, Indiana, extended to a countywide network. The data includes public roads, named private roads, major multi-use trails, and proposed roadways, with attributes updated from multiple local government sources. The dataset was last updated on March 8, 2026.

GeospatialCSVXMLJSONRoadsTransportationStreetsCenterlinesMaps+1

0 views

Speech & Audio

Hindi Podcast Audio Collection for Speech Model Training

Hindi Podcast Asr Dataset is a large-scale collection of raw Hindi podcast audio designed for speech and language model development. It captures real-world interactions across diverse topics and formats. The dataset was created by InfoBayAI and was last updated in March 2026.

AudioLanguage ModelingHindi SpeechLarge ScalePodcast AudioSpeech Recognition+1

0 views

Speech & Audio

English Female Monologue Audio for ASR and VAD Alignment

A collection of unscripted human monologues in English, spoken by a female voice. The dataset provides 3-minute preview clips intended for use in automatic speech recognition and voice activity detection tasks. The source, author, and specific collection details are not provided.

AudioMonologueAudio AlignmentVadSpeech Recognition+1

0 views

Speech & Audio

New Music Collection

A dataset titled 'New_music' published on the Kaggle platform. The dataset's specific content, size, and origin are not detailed in the provided metadata. Further details about the data's creator, collection method, and temporal scope require verification after accessing the dataset files.

AudioNew Music+1

0 views

Speech & Audio

Digital Geologic-GIS Map of the Moccasin Quadrangle, Colorado (1:24,000 Scale)

A digital geologic-GIS dataset for the Moccasin Quadrangle in Colorado, adapted from a 1999 National Park Service geologic map. The dataset is composed of GIS data layers and tables, available in multiple formats including a file geodatabase and an OGC geopackage. It was completed as part of the NPS Geologic Resources Inventory program and includes ancillary documents with geologic unit descriptions.

GeospatialGeodatabaseNational Park ServiceFaults MomefltDigital Geologic Gis MapArcgisArcmap DocumentGeologic Contacts MomeglgaFile GeodatabaseDigital Geologic MapGeologic Unit DescriptionsGeologic Resources InventoryGeologic Attitude Observation Localities MomeatdGeologic Resource EvaluationGeologic Gis MapDigital DataArcgis Pro Map FileArcgis ProGeologic Unit Information TableGeologic Resources DivisionColoradoGeologic Map+1

0 views

Speech & Audio

Geologic Map of the Mancos Quadrangle, Colorado at 1:24,000 Scale

A National Park Service digital geologic-GIS map for the Mancos Quadrangle in Colorado, adapted from a 1999 source map. The dataset includes GIS data layers, tables, and ancillary documents like unit descriptions and metadata. Data locational accuracy is specified to be within 12.2 meters horizontally, based on the source map scale of 1:24,000.

GeospatialGeodatabaseGeologic Contacts MancglgaGeologic UnitsGeologyDigital Geologic Gis MapArcgisArcmap DocumentFile GeodatabaseDigital Geologic MapGeologic Unit DescriptionsGeologic Resources InventoryGeologic Resource EvaluationGeologic Gis MapNational ParkDigital DataArcgis Pro Map FileArcgis ProGeologic Unit Information TableGeologic Attitude Observation Localities MancatdGeologic Resources DivisionColoradoGeologic Map+1

0 views

Speech & Audio

Digital Geologic Map of Trail Canyon Quadrangle, Colorado at 1:24,000 Scale

A National Park Service Geologic Resources Inventory digital map of the Trail Canyon Quadrangle in Colorado, adapted from a 1999 geologic map by Griffitts. The dataset includes GIS data layers and tables in multiple formats, such as a file geodatabase and geopackage, along with supporting documentation. It was produced by the NPS Geologic Resources Division as part of the Inventory and Monitoring program.

GeospatialGeodatabaseGeologic Attitude Observation Localities TrcaatdGeologic UnitsDigital Geologic Gis MapArcgisArcmap DocumentFile GeodatabaseDigital Geologic MapGeologic Unit DescriptionsGeologic Resources InventoryGeologic Resource EvaluationGeologic Gis MapNational ParkGeologic Contacts TrcaglgaDigital DataArcgis Pro Map FileArcgis ProGeologic Unit Information TableGeologic Resources DivisionColoradoGeologic Map+1

0 views

Speech & Audio

Digital Geologic-GIS Map of Wetherill Mesa Quadrangle, Colorado

A digital geologic-GIS map of the Wetherill Mesa Quadrangle in Colorado, adapted from a 1999 National Park Service geologic map. The dataset includes GIS data layers and tables available in file geodatabase and geopackage formats, along with ancillary PDF documents containing unit descriptions and metadata. It was produced by the National Park Service's Geologic Resources Inventory program.

GeospatialGeodatabaseGeologic Contacts WemeglgaDigital Geologic Gis MapArcgisArcmap DocumentFile GeodatabaseDigital Geologic MapGeologic Unit DescriptionsGeologic Resources InventoryGeologic Resource EvaluationGeologic Gis MapNational ParkDigital DataFaults WemefltArcgis Pro Map FileArcgis ProGeologic Unit Information TableGeologic Resources DivisionGeologic Attitude Observation Localities WemeatdColoradoGeologic Map+1

0 views

Speech & Audio

Point Lookout Quadrangle Geologic Map in Colorado at 1:24,000 Scale

A digital geologic-GIS map of the Point Lookout Quadrangle in Colorado, composed of GIS data layers and tables. The dataset was produced by the National Park Service's Geologic Resources Inventory program, adapted from a 1999 source map by Griffitts. It is available in multiple GIS formats including a file geodatabase and an OGC geopackage.

GeospatialGeodatabaseDigital Geologic Gis MapFaults PolofltArcgisArcmap DocumentFile GeodatabaseDigital Geologic MapGeologic Unit DescriptionsGeologic Resources InventoryGeologic Resource EvaluationGeologic Gis MapGeologic Contacts PologlgaNational ParkDigital DataArcgis Pro Map FileArcgis ProGeologic Unit Information TableGeologic Resources DivisionGeologic Attitude Observation Localities PoloatdColoradoGeologic Map+1

0 views

Speech & Audio

Digital Geologic Map of the Cortez Quadrangle, Colorado at 1:24,000 Scale

A National Park Service Geologic Resources Inventory digital map of the Cortez Quadrangle, Colorado, derived from a 1999 source map. The dataset includes GIS data layers, tables, and ancillary documents like unit descriptions. Based on a source map scale of 1:24,000, features have a stated horizontal locational accuracy within 12.2 meters or 40 feet.

GeospatialGeodatabaseNational Park ServiceGeologic Attitude Observation Localities CortatdFaults CortfltDigital Geologic Gis MapArcgisArcmap DocumentFile GeodatabaseDigital Geologic MapGeologic Unit DescriptionsCortez QuadrangleGeologic Resources InventoryGeologic Resource EvaluationGeologic Gis MapDigital DataArcgis Pro Map FileArcgis ProGeologic Contacts CortglgaGeologic Resources DivisionColoradoGeologic Map+1

0 views

Speech & Audio

Digital Geologic-GIS Map of Mesa Verde National Park and Vicinity, Colorado

A digital geologic-GIS dataset for Mesa Verde National Park and vicinity, Colorado, composed of GIS data layers and tables. The data were completed as a component of the National Park Service's Geologic Resources Inventory program, adapted from source maps by Griffitts (1999). It is available in multiple GIS formats including a file geodatabase, OGC geopackage, and KMZ/KML for Google Earth.

GeospatialGeodatabaseGeologyDigital Geologic Gis MapArcgisArcmap DocumentFile GeodatabaseDigital Geologic MapGeologic Cross Section Lines MevesecCortez QuadrangleGeologic Resources InventoryGeologic Resource EvaluationGeologic Contacts MeveglgaGeologic Gis MapNational ParkDigital DataArcgis Pro Map FileArcgis ProFaults MevefltGeologic Resources DivisionColoradoGeologic MapGeologic Attitude Observation Localities Meveatd+1

0 views

Speech & Audio

VOXCeleb: Celebrity Speech and Audio-Visual Data

VOXCeleb is a dataset of speech and video clips featuring celebrities. It is hosted on the Kaggle platform. The specific size, collection method, and time range are not detailed in the provided metadata.

AudioVideoSpeaker IdentificationAudio ProcessingCelebritiesSpeech Recognition+1

0 views

PreviousPage 46 of 130Next