Loading...
Loading...
Speech recognition, text-to-speech, speaker identification, music classification, audio event detection
1,907 datasets
Berkeley, California is the location associated with this musical score. The dataset is a 21.4 KB PDF file containing the second movement of Sonata VI in G major from Berkeley Manuscript 792, likely an Adagio. It was authored by Matthew James Zenas Dicken and last updated on April 13, 2026.
A 14.5 KB PDF file contains the musical score for Capriccio 23 in F Major. The piece, authored by Matthew James Zenas Dicken, features 2 and 3 part texture and was last updated on figshare in April 2026.
One missing sonata from a set of 12, with 10 present in the related Ms 791 collection. The dataset is a 24.3 KB PDF of the first movement (Allegro assai) of Sonata VI in G major, authored by Matthew James Zenas Dicken and last updated in April 2026. It is shared under a CC-BY-4.0 license on figshare.
A 26.8 KB PDF file containing the second movement (Allegro moderato) of Sonata 1 in G major for keyboard, violin, and cello from the Berkeley Ms 793 manuscript. The dataset was uploaded by Matthew James Zenas Dicken on figshare and last updated on 2026-04-13. It is licensed under CC-BY-4.0.
18.8 KB PDF of the third movement from a missing sonata in a historical music manuscript collection. The file contains the Presto movement from Sonata VI in G major, part of a set of 12 sonatas, authored by Matthew James Zenas Dicken and last updated in April 2026. This single movement is sourced from the Berkeley Ms 792 manuscript, f. 3v-4r.
A PDF musical score for the first movement (Allegro assai) of Sonata 1 in G major, part of a set of three sonatas for keyboard with cello and violin. The 22.3 KB file, authored by Matthew James Zenas Dicken, was last updated on April 13, 2026, and is shared under a CC-BY-4.0 license on figshare.
Berkeley Ms 791, page 88, contains the musical score for Capriccio 24 in A major, a piece of chamber music. The score is a 15.6 KB PDF file uploaded by Matthew James Zenas Dicken to figshare in April 2026. The description notes the piece features a varied texture, mostly in two parts.
A 26.0 KB PDF file containing the second movement (Molto adagio) of Sonata 3 in G major from a set of three sonatas for keyboard, cello, and violin. The score is from Berkeley Ms 793, pages 12r-13r, and was uploaded by Matthew James Zenas Dicken to figshare in April 2026.
Berkeley Ms 793, f. 8v contains the second movement (Adagio) of Sonata 2 in C major from a set of three sonatas for keyboard, cello, and violin. The dataset is a 21.5 KB PDF file uploaded by Matthew James Zenas Dicken to figshare under a CC-BY-4.0 license. It was last updated on April 13, -2026.
A 23.0 KB PDF file containing the musical score for the first movement (Allegro assai) of Sonata 2 in C major from a set of three sonatas for keyboard, cello, and violin. The score is from the Berkeley Ms 793 manuscript, authored by Matthew James Zenas Dicken and last updated on 2026-04-13. The dataset is licensed under CC-BY-4.0.
Berkeley Ms 793, ff. 12v-14r, contains the second movement (Allegrino) of Sonata 3 in G major from a set of three sonatas for keyboard, cello, and violin. The dataset is a 24.9 KB PDF file authored by Matthew James Zenas Dicken and last updated on 2026-04-13. It is shared under a CC-BY-4.0 license on the figshare platform.
Berkeley Ms 793, ff. 8v-10v, contains the third movement (Presto assai) of Sonata 2 in C major, part of a set of three sonatas for keyboard, cello, and violin. The dataset is a 20.9 KB PDF file authored by Matthew James Zenas Dicken and shared under a CC-BY-4.0 license on figshare. It was last updated on April 13, 2026.
A 23.8 KB PDF file containing the musical score for the first movement (Allegro) of Sonata 3 in G major from a set of three sonatas for keyboard, cello, and violin. The score is from the Berkeley Ms 793 manuscript, folios 10r-12r, and was authored by Matthew James Zenas Dicken. It was last updated on April 13, 2026.
Berkeley Ms 795 is a manuscript containing 25 variations on a theme, structured for two parts (treble and bass). The 18.0 MB PDF file, authored by Matthew James Zenas Dicken, was last updated on figshare in April 2026. This sketch can be performed by two separate instruments or on a single keyboard.
15,000 examples intended to train large language models to emulate the creative decision-making of prominent hip-hop producers. The dataset, created by user gss1147, was last updated on Hugging Face in April 2026. It aims to teach AI the combined mindset of producers like Lil Jon, Dr. Dre, and Pharrell.
A domain-specific, multilingual agricultural speech dataset with a primary focus on Hindi, Telugu, and Odia. It features human-annotated transcriptions and is intended for benchmarking ASR model performance in real-world agricultural scenarios, created by DigiGreen. The dataset page was last updated on 2026-04-15.
An exploratory pilot study protocol investigating the behavioral and neurophysiological response to two types of Rhythmic Auditory Stimulation (RAS) in individuals with Parkinson's Disease. The study, authored by Kyurim Kang and last updated in March 2026, likely contains data on gait parameters and local field potentials recorded from deep brain stimulation devices. The dataset is small, with a file size of 5.5 KB.
UNDP Human Development Reports Office (HDRO) data on human development and multidimensional poverty for Saint Kitts and Nevis. The dataset includes the Human Development Index (HDI), which measures average achievement in health, knowledge, and living standards, and the 2019 Global Multidimensional Poverty Index (MPI). The data was last updated on 2026-03-04 03:36:00.965148.
SPGISpeech 2.0 is a dataset for speaker-tagged transcription in the financial domain, created by Kensho. It contains audio snippets and their corresponding fully formatted text transcriptions, suitable for end-to-end automatic speech recognition (ASR). The dataset improves the diversity of applicable modeling tasks while maintaining the core characteristics of the original SPGISpeech dataset.
74,858 high-quality Vietnamese audio samples with phonemized transcripts, designed for fine-tuning modern Text-to-Speech models. The dataset was created by LanguaMan, who collected audio from YouTube, cleaned background noise, and used the Whisper-large-v3 model for transcription, followed by agent-assisted spelling correction and human feedback. The dataset page was last updated on April 21, 2026.