Gorkhapatra newspaper pages likely intended for optical character recognition tasks. The dataset is hosted on Kaggle, but its specific contents, scale, and creation details are not provided in the available metadata. The title suggests it contains scanned images of the historic Nepali newspaper, Gorkhapatra.
Use Cases
- Train an OCR model on historical Nepali newspaper text (inferred from domain, verify after download)
- Benchmark document layout analysis algorithms (inferred from domain, verify after download)
- Digitize and archive historical Nepali publications (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a major platform for sharing datasets.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Row count, file formats, and data quality are unknown.
- Column-level documentation is absent; field semantics must be inferred after download.
Provenance
- Geography
- Nepal (inferred from newspaper name)