DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Urdu Text-to-Speech Corpus Subset Processed | DataSalon

Home Speech & AudioUrdu Text-to-Speech Corpus Subset Processed

Speech & Audio

Urdu Text-to-Speech Corpus Subset Processed

Available on 1 platform

Description

A processed subset of an Urdu text-to-speech corpus, published on Kaggle. The dataset likely contains aligned audio recordings and corresponding text transcripts for speech synthesis tasks. Specific details on size, creation date, and original source are not provided in the available metadata.

Use Cases

Train a text-to-speech model for Urdu (inferred from domain, verify after download)
Benchmark speech synthesis quality for low-resource languages (inferred from domain, verify after download)
Create phonetic alignments for Urdu speech data (inferred from domain, verify after download)

Strengths

Published on Kaggle, a platform for sharing datasets.
The title indicates the data has been processed, which may imply some level of cleaning or formatting.

Limitations

Metadata is minimal; actual content requires verification after download.
Row count, file size, and column definitions are unknown.
License, author, and last update information are unavailable.

Provenance

Source: Kaggle
Collection Method: Likely derived from a larger Urdu speech corpus, but the specific gathering method is unknown.
Time Range: null
Freshness: Last updated date is unknown; freshness unverified.
Geography: null

null

Text Audio Text To Speech Urdu Speech Corpus Natural Language Processing Audio Processing

Related Datasets

Quality Score

D16

Description

Source

Reputation

Quality Score

D16

Description

Source

Reputation

Access

Community

0 views

Dataset Info

Last synced: Apr 24, 2026

Access

Community

0 views

Dataset Info

Last synced: Apr 24, 2026

Urdu Text-to-Speech Corpus Subset Processed

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info