1 raw text file containing the complete articles, provisions, and sections of the Constitution of India. The dataset comprises approximately 5,000 words sourced directly from official Government of India documents for legal NLP applications.
Use Cases
- Develop a legal question-answering system by training on the articles and provisions text.
- Perform keyword extraction and frequency analysis on the 5,000-word corpus to identify core legal themes.
- Fine-tune a language model for legal domain adaptation using the raw text of the Indian Constitution.
Strengths
- Contains the full text of all articles and provisions of the Indian Constitution.
- Sourced from official Government of India documentation.
- Provided in a raw TXT format for easy text analysis and preprocessing.