Sign in to view source links and access this dataset
Description
Indian legal documents from writs, PIL, civil procedure, constitutional law, and the Indian Penal Code. The dataset contains 26,326 examples preprocessed into ChatML format for supervised fine-tuning, sourced from the viber1/indian-law-dataset. It was created by pkheria7 and last updated on Hugging Face in April 2026.
Use Cases
Train an opposing counsel AI model based on the described legal document examples.
Fine-tune language models for Indian legal text generation based on the provided ChatML format.
Conduct research on legal argumentation patterns based on the described content areas like writs and constitutional law.
Develop educational tools for legal reasoning based on the structured examples of Indian law.
Strengths
Contains 26,326 total examples, providing a substantial corpus for model training.
Includes a predefined train-test split of 25,009 and 1,317 rows respectively.
Data is preprocessed into a ready-to-use ChatML format for SFT training.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
Source
viber1/indian-law-dataset on Hugging Face.
Collection Method
Combined and preprocessed from source dataset.
Freshness
Last updated 2026-04-26 07:35:06; freshness should be verified.
Geography
India
License is unknown; users should verify licensing terms before use.