Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Treble10-Speech is an automatic speech recognition (ASR) dataset featuring 16 kHz audio files generated by convolving LibriSpeech data with high-fidelity room-acoustic simulations. Created by Treble Technologies and updated in November 2025, the collection includes between 1,000 and 10,000 records across 10 distinct furnished room environments. The dataset provides speech samples with reverberation times ranging from 0.17 to 0.84 seconds.
The dataset is distributed under a CC BY 4.0 license and is provided in optimized Parquet format for use with libraries like Polars or Dask.