Sign in to view source links and access this dataset
Description
A dataset of image captions related to illegal activities, created by Lenkashell and last updated on July 16, 2025. The dataset is hosted on HuggingFace, but its specific content, size, and structure are not detailed. Its intended purpose appears to be for training or evaluating content safety models.
Use Cases
Training content moderation classifiers based on textual descriptions of images.
Evaluating the safety alignment of vision-language models based on their handling of sensitive topics.
Benchmarking the performance of safety filters against descriptions of illegal activities.
Studying the representation of harmful concepts in multimodal training data.
Strengths
Dataset is hosted on HuggingFace, a major platform for AI datasets.
Last updated on July 16, 2025, indicating recent maintenance.
Limitations
Description metadata is limited; actual data quality requires manual inspection after download.
Row count, column definitions, and sample data are unknown, which limits suitability assessment.
License information is unavailable, complicating usage rights evaluation.
Provenance
Source
Lenkashell via HuggingFace.
Freshness
Last updated 2025-07-16 17:27:11.
License is unknown; users must verify terms before use.