A dataset of job postings, likely containing both real and fraudulent listings. It was published on Kaggle, but the specific collection date, author, and data volume are unknown. The dataset's primary purpose appears to be for training models to identify deceptive employment advertisements.
Use Cases
- Train a binary classifier to detect fraudulent job advertisements (inferred from domain, verify after download)
- Analyze linguistic patterns and keywords common in fake job postings (inferred from domain, verify after download)
- Benchmark anomaly detection algorithms on imbalanced text data (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a platform with an established community for data sharing.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Row count, column definitions, and license information are unknown.
- Data may reflect temporal or source bias inherent to its original collection on Kaggle.