DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Honesty Reason 120X: 120 Synthetic Examples for LLM Truthfulness | DataSalon

Home Genomics & BioinformaticsHonesty Reason 120X: 120 Synthetic Examples for LLM Truthfulness

Genomics & Bioinformatics

Honesty Reason 120X: 120 Synthetic Examples for LLM Truthfulness

Name: Honesty Reason 120X: 120 Synthetic Examples for LLM Truthfulness
Creator: Aadeshisdoingsomething
Published: 2026-06-03T22:56:13
Keywords: Truthfulness, Text, Reasoning, Llm Training, Synthetic Data, Synthetic

by Aadeshisdoingsomething·Updated 26d ago

Available on 1 platform

Description

120 high-quality synthetic examples are designed to train smaller language models to become more truthful and recognize their own knowledge boundaries. The dataset, created by Aadeshisdoingsomething, forces models to output a structured query-and-verification routine before answering. It was last updated on June 4, 2026.

Use Cases

Training models to output a verification scratchpad based on the described structured routine.
Improving model truthfulness based on the dataset's focus on explicit verification.
Teaching models to recognize knowledge boundaries using the provided synthetic examples.

Strengths

Contains 120 examples explicitly described as high-quality.
Designed for a specific model size range of 1B to 8B parameters.
Focuses on a clear, structured training mechanic for truthfulness.

Limitations

Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: huggingface user Aadeshisdoingsomething
Collection Method: Synthetically generated, as described.
Freshness: Last updated 2026-06-04 19:14:59; freshness should be verified.

Text Truthfulness Reasoning Llm Training Synthetic Data Synthetic

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

14 downloads

1 likes

0 views

Dataset Info

Author: Aadeshisdoingsomething
Created: Jun 3, 2026
Updated: Jun 4, 2026
Last synced: Jun 10, 2026

Access

Community

14 downloads

1 likes

0 views

Dataset Info

Author: Aadeshisdoingsomething
Created: Jun 3, 2026
Updated: Jun 4, 2026
Last synced: Jun 10, 2026

Honesty Reason 120X: 120 Synthetic Examples for LLM Truthfulness

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info