Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Penn State ScholarSphere provides 9,363 open access books with page images and bibliographic metadata extracted from MARC21 records. The dataset was curated by author biglam for training and evaluating Vision Language Models on automatic metadata extraction from scholarly monographs. It was last updated on October 16, 2025.
License is unknown; terms of use must be verified before application.