Reddit NSFW Dataset

Available on 1 platform

Sign in to view source links and access this dataset

Description

Kaggle hosts a dataset of content from the social media platform Reddit, specifically flagged as Not Safe For Work (NSFW). The dataset's size, specific collection method, and time range are not detailed in the available metadata. Its author and organization are also unknown.

Use Cases

Training a binary classifier to detect NSFW text or image posts (inferred from domain, verify after download)
Analyzing linguistic patterns or topics within flagged social media content (inferred from domain, verify after download)
Benchmarking content filtering algorithms against real-world user-generated data (inferred from domain, verify after download)

Strengths

Published on Kaggle, a major platform for sharing datasets.

Limitations

Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Data may reflect temporal or source bias inherent to its collection from Reddit.

Provenance

Source: Reddit

Text Social Media Nsfw Content Text Data

Related Datasets

Quality Score

D16

Description

8

Source

17

Reputation

18

Access

31

Community

0 views

Dataset Info

Last synced: Jun 7, 2026

Access

31

Community

0 views

Dataset Info

Last synced: Jun 7, 2026

Reddit NSFW Dataset

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info