DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Uncensor V1 Dpo: Uncensored Direct Preference Optimization Dataset | DataSalon

Home Multimodal & LLMUncensor V1 Dpo: Uncensored Direct Preference Optimization Dataset

Multimodal & LLM

Uncensor V1 Dpo: Uncensored Direct Preference Optimization Dataset

Name: Uncensor V1 Dpo: Uncensored Direct Preference Optimization Dataset
Creator: rx-dev
Published: 2025-05-24T23:35:24
Keywords: Size Categories1 Kn10 K, Librarypolars, Modalitytext, Librarymlcroissant, Librarydatasets, Librarypandas, Regionus, JSON

by rx-dev·Updated 1y ago

Available on 1 platform

Description

This DPO dataset contains pairs of harmful prompts and model responses derived from the LLM-LAT/harmful-dataset. It reconfigures the preference structure by labeling standard model refusals as 'rejected' and the original harmful or incorrect answers as 'chosen'.

Use Cases

Fine-tune models using Direct Preference Optimization to minimize refusal behaviors using the 'chosen' and 'rejected' fields
Conduct safety alignment research by analyzing the delta between the 'rejected' refusal text and the 'chosen' harmful text
Develop adversarial testing suites for LLMs based on the harmful prompt-response pairs provided in the dataset

Strengths

Derived directly from the LLM-LAT/harmful-dataset source
Utilizes a DPO format with explicit 'chosen' and 'rejected' response pairs
Inverts the standard alignment objective by designating safety refusals as the 'rejected' class

JSON Size Categories1 Kn10 K Librarypolars Modalitytext Librarymlcroissant Librarydatasets Librarypandas Regionus

Related Datasets

Quality Score

D31

Description

Source

Reputation

Quality Score

D31

Description

Source

Reputation

Access

Community

101 downloads

16 likes

0 views

Dataset Info

Author: rx-dev
Created: May 24, 2025
Updated: May 24, 2025

Access

Community

101 downloads

16 likes

0 views

Dataset Info

Author: rx-dev
Created: May 24, 2025
Updated: May 24, 2025

Uncensor V1 Dpo: Uncensored Direct Preference Optimization Dataset

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info