Hateful Memes: 10,000 Images for Multimodal Hate Speech Detection

Name: Hateful Memes: 10,000 Images for Multimodal Hate Speech Detection
Creator: cs5242-hateful-memes
Published: 2026-03-07T12:33:00
Keywords: Social Media, Memes, Computer Vision, Hateful Content, Multimodal

by cs5242-hateful-memesUpdated 2mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

Facebook's Hateful Memes Challenge dataset (Kiela et al., 2020) contains 10,000 PNG meme images. The dataset is structured into training, development, and test splits, totaling 8,500, 500, 540, 1,000, and 2,000 entries respectively. This mirror was created by cs5242-hateful-memes for reproducibility of a CS5242 (NUS) submission.

Use Cases

Train multimodal hate speech classifiers based on meme images and associated text.
Benchmark model performance on seen and unseen splits for generalization testing.
Study the intersection of visual and textual hateful content in social media memes.
Develop preprocessing pipelines for image-text data pairs mentioned in the description.

Strengths

Contains 10,000 PNG images, providing a substantial corpus for training.
Includes structured splits (train, dev_seen, dev_unseen, test_seen, test_unseen) for robust evaluation.
Based on a well-known benchmark dataset from Meta (Facebook), offering a recognized standard.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: Mirror of the Facebook Hateful Memes Challenge dataset (Kiela et al., 2020).
Collection Method: Merges two existing mirrors of the original Meta release.
Freshness: Last updated 2026-04-24 12:28:47; freshness should be verified.

Multimodal Social Media Memes Computer Vision Hateful Content

Related Datasets

Quality Score

D38

Description

42

Source

36

Reputation

44

Access

26

Community

113 downloads

2 likes

0 views

Dataset Info

Author: cs5242-hateful-memes
Created: Mar 7, 2026
Updated: Apr 24, 2026
Last synced: May 1, 2026

Access

26

Community

113 downloads

2 likes

0 views

Dataset Info

Author: cs5242-hateful-memes
Created: Mar 7, 2026
Updated: Apr 24, 2026
Last synced: May 1, 2026

Hateful Memes: 10,000 Images for Multimodal Hate Speech Detection

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info