Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Malayalam Instruct Dataset-L is a large-scale instruction-tuning dataset for the Malayalam language. It was programmatically compiled from over 20 multilingual text corpora, translation engines, and RSS feeds, heavily featuring the CulturaX database. The dataset was created by author siyah1 and was last updated on June 17, 2026.
License information is unknown.