Sign in to view source links and access this dataset
Description
WaltonFuture compiled this dataset for Agentic Supervised Fine-Tuning from multiple open-source sources. It contains 711,852 samples covering capabilities like tool calling, code editing, terminal interaction, multi-hop reasoning, and web browsing. The dataset was last updated on April 5, 2026.
Use Cases
Fine-tuning language models for tool calling based on the described agent capabilities
Training models for code editing and terminal interaction tasks mentioned in the description
Developing agents for multi-hop reasoning as indicated by the dataset's scope
Building web browsing agents using the described web browsing capability examples
Strengths
Large scale with 711,852 total samples
Curated and merged from multiple high-quality open-source datasets
Covers a range of agent capabilities including tool calling and multi-hop reasoning
Limitations
Column-level documentation is absent; field semantics must be inferred after download
Description metadata is limited; actual data quality requires manual inspection after download
Row count is unknown, which may limit suitability assessment
Provenance
Source
Compiled from multiple open-source datasets, including MiroVerse-v0.1 (147,985 samples)
Collection Method
Curated and merged from source datasets
Freshness
Last updated 2026-04-05 14:30:23; freshness should be verified
License is unknown; users should verify licensing terms before use.