AxonData's English Contact Center Audio Dataset provides over 1,000 hours of inbound and outbound telephone call audio paired with English transcripts. The data consists of real-world, non-synthetic conversations featuring diverse English accents. The dataset was last updated on February 13, 2026.
Use Cases
- Train speech recognition models based on real-world telephone audio and transcripts.
- Perform sentiment analysis based on authentic customer-agent conversations.
- Develop customer support AI models based on inbound and outbound call interactions.
- Analyze conversational patterns and accents based on diverse English speech data.
Strengths
- Contains over 1,000 hours of audio data.
- Includes full English transcripts for all audio.
- Features real-world, non-synthetic telephone conversations.
- Covers diverse English accents.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count, file formats, and license information are unknown.
- Data may reflect geographic or accent bias inherent to the source collection.
Provenance
- Source
- AxonData
- Collection Method
- Likely collected from real-world inbound and outbound contact center calls.
- Freshness
- Last updated 2026-02-13 12:28:16; freshness should be verified.