Sign in to view source links and access this dataset
Description
An adversarial dataset containing 1,000 distinct communication strings, job descriptions, and direct messaging scripts modeling deceptive recruitment loops. It targets remote software engineering, AI/ML, and data science talent. The dataset was created by author sohaibdevv and was last updated on 2026-05-31.
Use Cases
Train NLP classifiers to detect scam job postings based on deceptive language patterns.
Analyze predatory recruitment tactics targeting remote AI/ML and data science professionals.
Benchmark adversarial text generation models for security research.
Study the structure and content of deceptive direct messaging scripts used in hiring scams.
Strengths
Contains 1,000 distinct communication strings and scripts.
Focuses on a specific, high-impact domain of tech job scams.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
Source
huggingface
Collection Method
Likely collected or synthesized to model deceptive recruitment communications.
Freshness
Last updated 2026-05-31 16:49:24; freshness should be verified.
License is unknown; terms of use must be verified before application.