MGB-1: Multi-Genre Broadcast Challenge (BBC Audio)

Name: MGB-1: Multi-Genre Broadcast Challenge (BBC Audio)
Creator: cdminix
Published: 2022-03-02T23:29:22
Keywords: Regionus

by cdminixUpdated 5y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

Multi-genre TV recordings from the British Broadcasting Corporation (BBC) covering a broad range of English-language broadcast output. The data includes audio and metadata for speech recognition, speaker diarization, and lightly supervised alignment tasks from the 2015 challenge.

Use Cases

Train speech recognition models using the multi-genre TV audio and corresponding transcripts
Develop speaker diarization algorithms to segment audio by individual speakers in broadcast settings
Evaluate lightly supervised alignment techniques to synchronize text with the audio stream

Strengths

Multi-genre TV recordings sourced from the British Broadcasting Corporation (BBC)
Includes data for speech recognition, speaker diarization, and lightly supervised alignment
Represents the full range of TV output from the 2015 MGB-1 Challenge

Regionus

Related Datasets

Quality Score

D24

Description

21

Source

36

Reputation

8

Access

22

Community

14 downloads

0 views

Dataset Info

Author: cdminix
Created: Mar 2, 2022
Updated: Feb 5, 2021
Last synced: Apr 29, 2026

Access

22

Community

14 downloads

0 views

Dataset Info

Author: cdminix
Created: Mar 2, 2022
Updated: Feb 5, 2021
Last synced: Apr 29, 2026

MGB-1: Multi-Genre Broadcast Challenge (BBC Audio)

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info