Omnidistilthinking: Conversational AI Speech and Transcripts

Name: Omnidistilthinking: Conversational AI Speech and Transcripts
Creator: ShiniChien
Published: 2026-05-14T16:54:14
Keywords: Audio Transcript, Speech Synthesis, Conversational Ai, Tabular, Audio, Multimodal Dialogue

by ShiniChienUpdated 2mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

A collection of conversational turns with audio recordings and transcripts. The dataset includes columns for conversation identifiers, speaker agents, prompts sent to a Gemini Live model, spoken transcripts, and audio durations. It was created by ShiniChien and last updated on May 18, 2026.

Use Cases

Train or evaluate text-to-speech models based on the provided audio and transcript pairs.
Analyze conversational patterns and agent behavior based on turn-by-turn dialogue data.
Develop multimodal AI systems that integrate speech and text based on the synchronized audio and transcript fields.
Benchmark speech generation quality using the specified TTS voice and duration metadata.

Strengths

Includes synchronized audio files (WAV format) and text transcripts for each conversational turn.
Contains structured metadata such as conversation IDs, turn indices, agent names, and prompt instructions.

Limitations

Dataset size, row count, and file formats beyond audio are unknown, limiting suitability assessment.
Column-level documentation beyond the provided list is absent; field semantics may require further inference.
Freshness should be verified as the last update timestamp is in the future (2026-05-18).

Provenance

Source: huggingface
Collection Method: Likely generated from interactions with a Gemini Live model.
Freshness: Last updated 2026-05-18 11:35:27

License information is unknown.

Tabular Audio Audio Transcript Speech Synthesis Conversational Ai Multimodal Dialogue

Related Datasets

Quality Score

D31

Description

24

Source

36

Reputation

42

Access

22

Community

101 downloads

1 likes

0 views

Dataset Info

Author: ShiniChien
Created: May 14, 2026
Updated: May 18, 2026
Last synced: May 22, 2026

Access

22

Community

101 downloads

1 likes

0 views

Dataset Info

Author: ShiniChien
Created: May 14, 2026
Updated: May 18, 2026
Last synced: May 22, 2026

Omnidistilthinking: Conversational AI Speech and Transcripts

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info