Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Microsoft's Orca Agentinstruct 1M V1 is a fully synthetic dataset containing approximately 1 million instruction-response pairs. The data was generated using the AgentInstruct framework, which processes raw text content from the public web. The dataset was published by Microsoft in November 2024.
License terms are unknown; users must verify permissions before use. The dataset page recommends viewing the full description on Hugging Face for complete details.