DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

IndicTTS Malayalam: Speech Recordings for Text-to-Speech Research | DataSalon

Home Speech & AudioIndicTTS Malayalam: Speech Recordings for Text-to-Speech Research

Speech & Audio

IndicTTS Malayalam: Speech Recordings for Text-to-Speech Research

Name: IndicTTS Malayalam: Speech Recordings for Text-to-Speech Research
Creator: SPRINGLab
Published: 2025-01-24T12:12:38
Keywords: Text To Speech, Malayalam, Speech Synthesis, Audio

by SPRINGLab·Updated 1y ago

Available on 1 platform

Description

SPRINGLab's IndicTTS Malayalam dataset contains high-quality speech recordings with transcriptions for text-to-speech research. The dataset includes approximately 17.89 hours of audio from male and female speakers, sourced from the Indic TTS Database project. It was last updated on January 25, 2025.

Use Cases

Train Malayalam text-to-speech models based on high-quality audio recordings.
Benchmark speech synthesis systems based on male and female speaker data.
Develop multilingual TTS pipelines based on Indic language resources.
Study prosody and pronunciation in Malayalam based on transcribed speech.

Strengths

Contains approximately 17.89 hours of audio data.
Includes recordings from both male (9.7 hours) and female (8.19 hours) speakers.
Audio files are in WAV format, suggesting high-quality recordings.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count and total file size are unknown, which may limit suitability assessment.
The description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: Indic TTS Database project.
Collection Method: Derived from Malayalam monolingual recordings.
Freshness: Last updated 2025-01-25 05:52:03.

Audio Text To Speech Malayalam Speech Synthesis

Related Datasets

Quality Score

D39

Description

Source

Reputation

Quality Score

D39

Description

Source

Reputation

Access

Community

153 downloads

2 likes

0 views

Dataset Info

Author: SPRINGLab
Created: Jan 24, 2025
Updated: Jan 25, 2025
Last synced: Jun 8, 2026

Access

Community

153 downloads

2 likes

0 views

Dataset Info

Author: SPRINGLab
Created: Jan 24, 2025
Updated: Jan 25, 2025
Last synced: Jun 8, 2026

IndicTTS Malayalam: Speech Recordings for Text-to-Speech Research

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info