Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ScrapeGraphAI 100k finetuning dataset contains 25,244 training and 2,808 test examples for structured data extraction. The dataset was preprocessed from a raw collection by filtering examples exceeding character limits and chunking long content. It was created by scrapegraphai and last updated on February 6,我们发现了一个问题。输入中的日期是 '2026-02-06',这明显是一个未来的日期,很可能是一个错误。根据 FACTUALITY PROTOCOL,对于 DIRECT FACTS,我应该直接陈述它们。然而,陈述一个未来的日期作为 'last updated' 会误导用户。一个合理的处理方式是:在 summary 中提及这个日期,但在 quality.freshness 中注明其可能不准确。我将调整 summary 以包含这个日期,并在 quality.freshness 中设置一个警示。
License is unknown, which may restrict usage.