ArSyra: Arabic Instruction Tuning Dataset for LLMs

Available on 1 platform

Sign in to view source links and access this dataset

Description

Instruction tuning data for fine-tuning large language models on Arabic language tasks. The dataset is hosted on Kaggle, but its specific size, creation date, and authorship are not provided in the available metadata. Columns and sample data are unknown, limiting immediate assessment of its content and structure.

Use Cases

Fine-tune a base LLM to follow Arabic instructions (inferred from domain, verify after download)
Benchmark the performance of multilingual models on Arabic tasks (inferred from domain, verify after download)
Create a specialized Arabic chatbot or assistant (inferred from domain, verify after download)

Strengths

Published on Kaggle, a platform with established data sharing and versioning.

Limitations

Metadata is minimal; actual content requires verification after download.
Row count, column definitions, and sample data are unknown, which limits suitability assessment.
Data may reflect bias inherent to its unspecified source and collection method.

Provenance

Source: kaggle

Text Fine Tuning Arabic Nlp Large Language Models

Related Datasets

Quality Score

D15

Description

8

Source

17

Reputation

18

Access

22

Community

0 views

Access

22

Community

0 views

ArSyra: Arabic Instruction Tuning Dataset for LLMs

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Community