Loading...
Loading...
Speech recognition, text-to-speech, speaker identification, music classification, audio event detection
2,018 datasets
Encompassing text data for generating lyrics in the style of Jim Morrison. It is part of the HuggingArtists project and was last updated in October 2022.
Featuring text data for generating lyrics in the style of Selena Gomez. It is part of the HuggingArtists project and was last updated in October 2022. Specific details on row count, column structure, and data size are not provided.
A collection of text data for generating lyrics using the HuggingArtists framework. It is categorized as a text modality dataset with a size category of 1K and is tagged for English language content. The dataset was last updated on October 25, 2022.
Encompassing text data for generating lyrics related to the artist Machine Gun Kelly. It is sourced from the HuggingArtists project and was last updated in October 2022. The specific number of rows, columns, and data size is unknown.
A text dataset for generating lyrics, created by huggingartists. The dataset contains lyrics data, is categorized as having a size under 1,000 entries, and was last updated in October 2022.
Encompassing text data for generating lyrics in the style of the artist Lyapis Trubetskoy. It is part of the HuggingArtists project, which focuses on creating models for lyric generation. The dataset was last updated in October 2022.
A collection of lyrics from the artist Jah Khalib for use with the HuggingArtists framework. It is a text corpus intended for training language models to generate lyrics. The dataset size and specific features are not detailed in the input.
1 collection of song lyrics from the artist Chief Keef, formatted for fine-tuning generative language models. The content consists of raw text strings representing verses and choruses extracted for use in the HuggingArtists framework.
A collection of text-based song lyrics attributed to Chester Bennington, curated for the HuggingArtists library. The data consists of raw text sequences representing the artist's discography for use in training generative language models.
A collection of text data for generating lyrics in the style of the artist Slava Marlow. It is sourced from the HuggingArtists project and was last updated in October 2022. The dataset size is categorized as 'n1 K', indicating it contains over 1,000 text entries.
A collection of song lyrics from the band Duran Duran categorized for text generation and linguistic modeling. The data is formatted specifically for the HuggingArtists library to facilitate the fine-tuning of causal language models on artist-specific styles.
Comprising text data for generating lyrics, created by the huggingartists project. It is categorized as a text modality dataset with a size category of 'n1 K' and is focused on English language content from the US region. Specific details on row count, columns, and data structure are unavailable.
Version 3.0 Level 1 science data provides calibrated Delay Doppler Maps (DDMs) from the eight-satellite CYGNSS constellation. The dataset includes geo-located measurements of Power Received, Bistatic Radar Cross Section (BRCS), Normalized BRCS, and Leading Edge Slope, produced by POCLOUD. Data from up to 8 spacecraft is typically available daily with a latency of approximately 6 days from the last measurement.
A collection of song lyrics and text data from the artist Andre 3000. It is designed for fine-tuning language models via the HuggingArtists library to replicate specific hip-hop stylistic elements.
A collection of song lyrics from the artist Bladee categorized for generative text tasks. The data includes text sequences representing the artist's discography, specifically formatted for fine-tuning language models via the HuggingArtists framework.
A text dataset for generating lyrics using the HuggingArtists framework. It contains lyrics data, as indicated by the 'Lyrics' tag, and was last updated in October 2022.
Encompassing lyrics from the band Sum 41, intended for use with the HuggingArtists framework for text generation. The dataset is categorized as containing text data in English and is hosted on the Hugging Face platform. The specific number of rows, columns, and total size is not provided.
A collection of 50 short music clips, each 3 to 5 seconds in length. It was created by author 'yongjian' and last updated in October 2022.
For fact-checking and natural language inference tasks in the Danish language. It contains between 1,000 and 10,000 rows, as indicated by its size category, and features expert-generated annotations. The data was created by strombergnlp and last updated in October 2022.
Petersham, Massachusetts is the location for these lidar-derived digital surface model (DSM) data, representing surface elevations for 'leaf-on' conditions in August 2022. The data were collected by the NSIDC_CPRD organization as part of the SMAPVEX19-22 campaign to validate satellite-derived soil moisture estimates in forested areas. The DSM captures the highest elevation of features, which may include bare-earth, vegetation, and human-made objects.