DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

PaTaRM: Training Data for Pairwise and Pointwise Alignment | DataSalon

Home EducationPaTaRM: Training Data for Pairwise and Pointwise Alignment

Education

PaTaRM: Training Data for Pairwise and Pointwise Alignment

Name: PaTaRM: Training Data for Pairwise and Pointwise Alignment
Creator: AIJian
Published: 2026-03-20T03:36:14
Keywords: Machine Learning, Ai Safety, Preference Data, Text, Llm Training

by AIJian·Updated 3mo ago

Available on 1 platform

Description

PaTaRM-data is the training data for the PaTaRM series, a collection for aligning language models. The dataset contains two subsets: one with 35.6k samples for supervised fine-tuning (SFT) and another with 41.7k samples for reinforcement learning (RL). It was created by AIJian and last updated on April 1, 2026.

Use Cases

Supervised fine-tuning of language models based on the 35.6k SFT samples
Training reinforcement learning from human feedback (RLHF) agents based on the 41.7k RL samples
Bridging pairwise and pointwise alignment signals as referenced in the associated research paper

Strengths

Contains 35.6k samples for supervised fine-tuning
Contains 41.7k samples for reinforcement learning training
Associated with a cited research paper (Jian, 2026)

Limitations

Column-level documentation is absent; field semantics must be inferred after download
Row count for the full dataset is unknown, which may limit suitability assessment
Data format and sample data are unavailable for preview

Provenance

Source: AIJian
Collection Method: Likely created for training the PaTaRM series models, but specific gathering method is not detailed.
Freshness: Last updated 2026-04-01 06:19:25

For full details including data format and usage examples, users must refer to the main collection README linked in the description.

Text Machine Learning Ai Safety Preference Data Llm Training

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

30 downloads

1 likes

0 views

Dataset Info

Author: AIJian
Created: Mar 20, 2026
Updated: Apr 1, 2026
Last synced: May 22, 2026

Access

Community

30 downloads

1 likes

0 views

Dataset Info

Author: AIJian
Created: Mar 20, 2026
Updated: Apr 1, 2026
Last synced: May 22, 2026

PaTaRM: Training Data for Pairwise and Pointwise Alignment

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info