DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

HDTF: High-Definition Talking Face Videos and Audio for Avatar Synthesis | DataSalon

Home Multimodal & LLMHDTF: High-Definition Talking Face Videos and Audio for Avatar Synthesis

Multimodal & LLM

HDTF: High-Definition Talking Face Videos and Audio for Avatar Synthesis

Name: HDTF: High-Definition Talking Face Videos and Audio for Avatar Synthesis
Creator: global-optima-research
Published: 2025-06-01T07:16:26
Keywords: Multimodal Synthesis, Talking Face Generation, Video Clips, Audio, Audio Embeddings, Time Series, Video, Multimodal

by global-optima-research·Updated 1y ago

Available on 1 platform

Description

400 full-length high-definition talking face videos, split into 81-frame clips and paired with audio embeddings. The dataset was curated by global-optima-research and last updated on June 4, 2025. It is intended for tasks in talking-head generation and multimodal avatar synthesis.

Use Cases

Train models for talking-head generation based on the provided high-definition video clips.
Develop video captioning systems using the temporal video units.
Perform multimodal avatar synthesis by leveraging the paired video and audio embeddings.

Strengths

Contains 400 original high-definition videos.
Videos are preprocessed into 81-frame clips for temporal modeling.
Includes extracted audio embeddings for multimodal tasks.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: global-optima-research
Collection Method: Curated and preprocessed version of the HDTF dataset.
Time Range: null
Freshness: Last updated 2025-06-04 06:31:41.
Geography: null

null

Audio Time Series Video Multimodal Multimodal Synthesis Talking Face Generation Video Clips Audio Embeddings

Related Datasets

Quality Score

D39

Description

Source

Reputation

Quality Score

D39

Description

Source

Reputation

Access

Community

515 downloads

8 likes

0 views

Dataset Info

Author: global-optima-research
Created: Jun 1, 2025
Updated: Jun 4, 2025
Last synced: Jun 19, 2026

Access

Community

515 downloads

8 likes

0 views

Dataset Info

Author: global-optima-research
Created: Jun 1, 2025
Updated: Jun 4, 2025
Last synced: Jun 19, 2026

HDTF: High-Definition Talking Face Videos and Audio for Avatar Synthesis

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info