DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Vietnamese Legal Documents Corpus for Information Retrieval Benchmark | DataSalon

Home Government & LegalVietnamese Legal Documents Corpus for Information Retrieval Benchmark

Government & Legal

Vietnamese Legal Documents Corpus for Information Retrieval Benchmark

Name: Vietnamese Legal Documents Corpus for Information Retrieval Benchmark
Creator: YuITC
Published: 2025-04-24T06:41:31
Keywords: Legal Documents, Benchmark, Text, Natural Language Processing, Vietnamese Language, Information Retrieval

by YuITC·Updated 3mo ago

Available on 1 platform

Description

YuITC's Vietnamese Legal Documents Dataset provides a benchmark corpus for legal information retrieval. The dataset includes a collection of legal documents and train/test splits with natural language queries paired with relevant documents. It was last updated on March 18, 2026.

Use Cases

Benchmarking legal information retrieval models based on the provided corpus and query-document pairs.
Training natural language processing models for Vietnamese legal text understanding.
Evaluating the performance of search algorithms on domain-specific legal queries.

Strengths

Dataset is explicitly designed as a benchmark for legal information retrieval.
Includes structured train/test splits with natural language queries and corresponding relevant documents.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.

Provenance

Source: YuITC, based on raw data from tmnam20/BKAI-Legal-Retrieval.
Freshness: Last updated 2026-03-18 12:29:31; freshness should be verified.

Text Legal Documents Benchmark Natural Language Processing Vietnamese Language Information Retrieval

Related Datasets

Quality Score

D39

Description

Source

Reputation

Quality Score

D39

Description

Source

Reputation

Access

Community

264 downloads

4 likes

0 views

Dataset Info

Author: YuITC
Created: Apr 24, 2025
Updated: Mar 18, 2026
Last synced: Jun 14, 2026

Access

Community

264 downloads

4 likes

0 views

Dataset Info

Author: YuITC
Created: Apr 24, 2025
Updated: Mar 18, 2026
Last synced: Jun 14, 2026

Vietnamese Legal Documents Corpus for Information Retrieval Benchmark

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info