4,000 synthetic U.S. federal tax law question-and-answer pairs categorized into six domains including international, estate/gift, and business entity taxes. Each entry includes Internal Revenue Code (IRC) citations to provide legal grounding for the provided answers across 3,500 training and 500 test examples.
Use Cases
- Fine-tune large language models for legal reasoning using the category and subcategory fields to ensure domain-specific accuracy
- Develop automated tax advisory systems that cite specific legal authorities using the IRC grounding provided in the answers
- Evaluate model performance on complex tax scenarios across the test split of 500 examples
Strengths
- 4,000 total examples split into 3,500 training and 500 testing records
- Categorized into 6 distinct tax law domains: international, estate_gift, business_entity, individual, procedure, and specialized
- Includes Internal Revenue Code (IRC) citation grounding for every question-answer pair