DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Munch 1 Latent New Parquet: Precomputed Urdu TTS Latent Representations | DataSalon

Home Speech & AudioMunch 1 Latent New Parquet: Precomputed Urdu TTS Latent Representations

Speech & Audio

Munch 1 Latent New Parquet: Precomputed Urdu TTS Latent Representations

Name: Munch 1 Latent New Parquet: Precomputed Urdu TTS Latent Representations
Creator: zuhri025
Published: 2026-04-18T06:50:58
Keywords: Text To Speech, Urdu Speech, Latent Representations, Tabular, Audio, Audio Processing

by zuhri025·Updated 3mo ago

Available on 1 platform

Description

51,021 pre-computed latent representations for Urdu utterances, designed to bypass audio decoding during TTS model training. The latents are derived from the Humair332/Urdu-munch-1 audio source using the Aratako/Semantic-DACVAE-Japanese-32dim codec at a 25 Hz frame rate. Author zuhri025 uploaded this dataset to Hugging Face in April 2026.

Use Cases

Train Urdu text-to-speech models based on pre-computed latent representations.
Benchmark TTS model performance using a standardized latent feature set.
Experiment with latent space manipulation for speech synthesis based on the described codec features.

Strengths

Contains 51,021 pre-processed Urdu utterance latents, providing a substantial starting point for model training.
Specifies technical parameters like a codec sample rate of 48,000 Hz and a latent frame rate of 25.0 Hz.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Data may reflect source bias inherent to the original Humair332/Urdu-munch-1 audio collection.

Provenance

Source: Humair332/Urdu-munch-1
Collection Method: Pre-computed DACVAE latent representations.
Freshness: Last updated 2026-04-18 16:28:59; freshness should be verified.

License restrictions are unknown and should be verified before use.

Tabular Audio Text To Speech Urdu Speech Latent Representations Audio Processing

Related Datasets

Quality Score

C42

Description

Source

Reputation

Quality Score

C42

Description

Source

Reputation

Access

Community

619 downloads

1 likes

0 views

Dataset Info

Author: zuhri025
Created: Apr 18, 2026
Updated: Apr 18, 2026
Last synced: May 23, 2026

Access

Community

619 downloads

1 likes

0 views

Dataset Info

Author: zuhri025
Created: Apr 18, 2026
Updated: Apr 18, 2026
Last synced: May 23, 2026

Munch 1 Latent New Parquet: Precomputed Urdu TTS Latent Representations

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info