Audio Embedding WavLM Large: Precomputed Audio Features

Available on 1 platform

Sign in to view source links and access this dataset

Description

Audio embeddings generated by the WavLM-Large model, a transformer-based architecture for audio representation learning. The dataset likely contains precomputed feature vectors for audio samples, facilitating downstream machine learning tasks. It is hosted on Kaggle, a platform for data science competitions and datasets.

Use Cases

Train a classifier for audio event detection using precomputed embeddings (inferred from domain, verify after download)
Benchmark audio representation models against WavLM-Large features (inferred from domain, verify after download)
Perform similarity search or clustering on audio files (inferred from domain, verify after download)

Strengths

Published on Kaggle, a major platform for data science resources.
Features are derived from WavLM-Large, a state-of-the-art audio model.

Limitations

Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats, and sample data are unknown, which may limit suitability assessment.

Provenance

Source: Kaggle
Collection Method: Likely generated by processing audio data through the WavLM-Large model.
Time Range: null
Freshness: Last update date is unknown; freshness unverified.
Geography: null

null

Audio Machine Learning Feature Extraction Audio Embeddings

Related Datasets

Quality Score

D16

Description

8

Source

17

Reputation

18

Access

31

Community

0 views

Dataset Info

Last synced: Apr 16, 2026

Access

31

Community

0 views

Dataset Info

Last synced: Apr 16, 2026

Audio Embedding WavLM Large: Precomputed Audio Features

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info