Loading...
Loading...
Image-text pairs, instruction tuning, visual QA, cross-modal data, foundation model training data
1,534 datasets
A multimodal dataset from a three-stage study examining color preference stability in spatial contexts. The data includes baseline preferences for ten Munsell hues, Preference and Comfort ratings, eye-tracking, and pupillometric data from a simulated makerspace environment, authored by Hourong Yu and last updated in May 2026. The dataset is shared under a CC-BY-4.0 license on figshare.
3,771 labeled satellite images from Landsat-8 and GOES-16 sources, split into training, validation, and test subsets. The dataset was created by Aydin Ayanzadeh for early wildfire detection and smoke analysis, with images resized to 416 × 416 pixels. It was last updated on April 20, 2026.
86.7 KB of data supporting a neural-symbolic dynamic graph framework for real-time anomaly detection. The dataset, authored by Senlin Jiang, was last updated on May 30, 2026, and is shared under a CC-BY-4.0 license on figshare. It is associated with a model designed to address cross-modal attack evidence and dynamic topology changes in industrial systems.
1,287 elderly clinical records from Beijing Jishuitan Hospital, including 643 patients with hip fractures and 644 controls, were analyzed to develop a progressive fusion model for risk prediction. The model, created by Songyuan Chen, achieved an accuracy of 90.94% and an AUC of 0.9423 on an independent test set. The dataset was last updated on 2026-04 28.
YTClickbait21K is a human-annotated dataset of 21,238 YouTube videos for clickbait detection research. It includes video metadata like titles, descriptions, and thumbnails, along with binary clickbait labels from three annotators per video. The dataset was uploaded by Md. Minhazul Islam to figshare on April 9, 2026.
47 video appearances by Italian Prime Minister Giorgia Meloni are analyzed through a 74-variable multimodal coding scheme. The dataset, created by Canan Cetin and last updated in 2026, supports a study on communicative domestication, tracking shifts in themes, gestures, and presentation from opposition to office. Analysis reveals significant changes, such as motherhood references collapsing from 43% to 2% of appearances.
1.2 GB of power quality measurements recorded during laboratory experiments on high-impedance faults in medium-voltage covered conductors. The data includes RMS voltage and current, harmonics, phase angles, and power metrics, collected using a Hioki PQ3198 analyzer following IEC standards. Author Diogo Biasuz Dahlke published this dataset on figshare in May 2026.
A methodological framework and computational pipeline for evaluating symbolic and emotional responses to virtual architectural spaces. The framework integrates symbolic modeling of concepts like justice and identity with psychophysiological measures and AI-ready computational inference. It was authored by Jesus Rafael Hechavarria-Hernandez and published on figshare in May 2026.
Waveform data from a multimodal dataset of high-impedance faults in medium-voltage covered conductors contains high-resolution electrical waveform recordings from controlled laboratory experiments. Diogo Biasuz Dahlke collected the data using a Hioki MR8741 waveform recorder, sampling synchronized voltage and current at 20 kS/s. The dataset was last updated on 2026-05-05.
41,914 unattributed artworks form a curated subset of the OpenBrush-75K dataset, designed to provide broad-style training data without artist-specific bias. The dataset was created by jaddai and was last updated on 2026-05-27. It is derived from a parent collection of 75,313 images, with the same CC0 license and caption schema.
110 participants with bulimia nervosa, binge eating disorder, and matched controls were studied using a multimodal machine learning framework. The dataset integrates task-based fMRI, intrinsic connectivity, voxel-based morphometry, neuropsychological assessments, and peripheral blood biomarkers. It was authored by Lena Rommerskirchen and last updated on 2026-04 13.
HakushoBench is a Japanese visual question answering benchmark built from 33 governmental white papers. It contains 2,053 images spanning over 10 chart and table types, with manually annotated QA pairs. The dataset was created by llm-jp and last updated on Hugging Face in June 2026.
India is the source for this dataset of two-wheeler rider behavior in dense, unstructured traffic. The full dataset comprises 1,629 annotated sequences (~25 hours) from 16 riders, collected across diverse traffic scenarios. It was created by Voxel51 and is a multi-view, multimodal dataset.
OpenBrush Landscapes is a curated subset of the OpenBrush-75K dataset containing every landscape painting from the parent collection. It includes 12,612 images across all artists, movements, and centuries, curated so users do not need to download the full 75,313-image dataset. The subset was created by jaddai and was last updated on May 27, 2026.
6,119 religious paintings curated from the OpenBrush-75K collection. The dataset focuses on saints, biblical scenes, and devotional works, with a heavy emphasis on Renaissance and Baroque eras. It was created by jaddai and last updated on May 27, 2026.
4.3 GB of thermal recordings from controlled laboratory experiments on high-impedance faults in medium-voltage covered conductors. Diogo Biasuz Dahlke created the dataset, which includes thermal video files (MP4) and native radiometric files (HRV) capturing temperature evolution during fault initiation. The dataset was last updated on 2026-05-05.
13,059 portrait paintings curated from the OpenBrush-75K dataset, spanning artistic movements from Renaissance to Realist. The subset was created by jaddai using the Qwen3-VL-30B-A3B vision-language model and last updated on May 27, 2026. It provides a focused collection of portraits under a CC0 license.
A research dataset containing performance metrics for a multimodal conversational agent named EAC-Agent. The dataset likely contains results from validation on benchmark datasets IEMOCAP and MELD. It was uploaded by Shahid Jamil to figshare on 2026-04-17.
4,240 Baroque-era artworks curated from the larger OpenBrush-75K collection. The subset focuses on the canonical Baroque visual language from approximately 1600 to 1750, characterized by chiaroscuro and dramatic lighting. It was created by jaddai and last updated on May 27, 2026.
UniCure is a multi-modal framework integrating omics and chemical foundation models to predict transcriptomic drug responses. This repository contains the pre-processed datasets, configuration files, and pre-trained model weights required to reproduce the results. The archive is 12.4 GB and was last updated on 2026-04-23 by Zexi Chen.