Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Presenting a gold-standard benchmark dataset for sentence alignment between Sinhala, English, and Tamil languages. The data was crawled from news websites including Army, Hiru, ITN, and Newsfirst, with aligned sentences derived from a prior document alignment dataset.
The full description is available on the Hugging Face dataset page; key details like size, format, and license are not provided in this summary.