Spoken Marathi Dialect Identification Dataset

Available on 1 platform

Sign in to view source links and access this dataset

Description

Spoken Marathi Dialect Identification Dataset is a collection of audio recordings for dialect recognition. It is hosted on Kaggle and described as a deep learning approach for dialect recognition. The dataset's specific size, collection method, and origin are not detailed in the provided metadata.

Use Cases

Training a model to classify audio clips by Marathi dialect (inferred from domain, verify after download)
Benchmarking speech recognition systems on dialectal variations (inferred from domain, verify after download)
Studying acoustic features that distinguish regional speech patterns (inferred from domain, verify after download)

Strengths

Published on Kaggle, a major platform for data science resources.
Focuses on a specific language and task, which may fill a niche in speech data.

Limitations

Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count and file size are unknown, which may limit suitability assessment.

Provenance

Geography: Likely India, given the focus on Marathi dialects.

Tabular Audio Marathi Dialect Identification Audio Classification Speech Recognition Spoken Language

Related Datasets

Quality Score

D16

Description

8

Source

17

Reputation

18

Access

31

Community

0 views

Dataset Info

Last synced: Jun 11, 2026

Access

31

Community

0 views

Dataset Info

Last synced: Jun 11, 2026

Spoken Marathi Dialect Identification Dataset

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info