Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Companion training data for the LiteResearcher paper, which describes a low-cost, scalable Agentic RL training framework for deep-research agents. The dataset contains the two-stage curriculum of question–answer prompts used to train the LiteResearcher-4B model with on-policy GRPO+TIS against a local search/browse environment. Both curriculum stages share the same validation set.
The full description is hosted externally; users must visit the provided Hugging Face link for complete details. The license is unknown and must be verified before use.