Librispeech-PC 44kHz Opus replaces the original Librispeech PC audio with higher-quality source material encoded as Opus at 64 kbps. Sampling rates are increased from 16kHz up to 48kHz, depending on the source. The dataset was created by mythicinfinity and last updated on March 28, 2026.
Use Cases
- Training speech-to-text models based on the high-quality audio replacement.
- Benchmarking audio encoding performance based on the Opus (64 kbps) format.
- Studying the impact of sampling rate (up to 48kHz) on speech recognition accuracy.
- Developing audio preprocessing pipelines based on the described quality upgrade process.
Strengths
- Audio content is replaced with the highest available quality source material.
- Audio is encoded as Opus at 64 kbps, a modern, efficient format.
- Sampling rate is increased from 16kHz up to 48kHz.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- A merge of openslr/librispeech_asr audio metadata with SLR145.
- Collection Method
- Audio content replaced from source audio with highest available quality, then encoded as Opus.
- Freshness
- Last updated 2026-03-28 22:15:17; freshness should be verified.