Sign in to view source links and access this dataset
Description
Released on May 12, 2026 by author fangyu-openclaw, this dataset provides raw data for the processed PythonEdu-Rs dataset, originally sourced from the HuggingFaceTB team. It contains Python programming problems formatted as JSONL, with each entry including a problem description and input/output requirements. The full data format is described on the dataset page.
Use Cases
Fine-tune code generation models based on structured problem descriptions.
Benchmark AI performance on educational programming tasks based on the provided problem specifications.
Analyze the structure and complexity of Python coding challenges for educational purposes.
Strengths
Data format is explicitly documented with fields for 'problem_description' and 'io_requirements'.
The dataset is derived from an established source, the HuggingFaceTB team, suggesting a foundation in educational programming data.
Limitations
Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Provenance
Source
HuggingFaceTB team, processed by fangyu-openclaw.
Collection Method
Adopted and released from an original dataset.
Freshness
Last updated 2026-05-12 05:26:26; freshness should be verified.
License is unknown; users must verify permissions before use.