2 document images from the DocVQA dataset serve as fixtures for the HuggingFace Transformers library. These samples facilitate the testing of LayoutLMv2FeatureExtractor and LayoutLMv2Processor across specific unit test files.
Use Cases
- Validate the feature extraction capabilities of LayoutLMv2FeatureExtractor using the document images
- Perform regression testing for LayoutLMv2Processor within the HuggingFace Transformers framework
- Benchmark document image preprocessing steps against the expected outputs in tests/test_processor_layoutlmv2.py
Strengths
- Includes 2 document images from the DocVQA collection
- Used as the primary data source for tests/test_feature_extraction_layoutlmv2.py
- Supports validation of the LayoutLMv2Processor class