Sign in to view source links and access this dataset
Description
InRhetoricalRoles is a corpus for automatic structuring of legal documents, presented at the Language Resources and Evaluation Conference in 2022. The dataset was created by authors including Prathamesh Kalamkar, Aman Tiwari, Astha Agarwal, Saurabh Karn, Smita Gupta, Vivek Raghavan, and Ashutosh Modi. It was last updated on the Hugging Face platform on 2024-05-08.
Use Cases
Train models for rhetorical role classification based on the corpus's annotation scheme.
Develop systems for automatic segmentation and structuring of legal documents.
Benchmark NLP tools for legal text processing and summarization.
Analyze linguistic patterns and argumentation structures in legal texts.
Strengths
Dataset is associated with a peer-reviewed conference paper (LREC 2022).
The corpus is specifically designed for the structured analysis of legal documents.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
Source
opennyaiorg
Freshness
Last updated 2024-05-08 06:28:21; freshness should be verified.
License is unknown; terms of use must be verified before application.