Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A Portuguese translation of the 'mlabonne/orpo-dpo-mix-40k' dataset, created by user BornSaint and last updated on 2025-05 06. The dataset was translated using a quantized machine translation model over more than a week on a single GPU thread. The original dataset is likely used for preference tuning and reinforcement learning from human feedback.
License is unknown; users must verify terms before use. The dataset page notes a link to a full description.