TPBench: A Benchmark for Dialogue Compression at Turning Points

Name: TPBench: A Benchmark for Dialogue Compression at Turning Points
Creator: 4papersubmission
Published: 2026-04-29T12:02:36
Keywords: Nlp Evaluation, Benchmark, Text, Dialogue Compression, Turning Points

by 4papersubmissionUpdated 1mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

TPBench is a dataset for evaluating long-dialogue compression around lifecycle turning points. It was submitted as an artifact for the NeurIPS 2026 Evaluations and Datasets Track by the author '4papersubmission'. The dataset includes probe JSONL files, result aggregates, scorer/reader code, license disclosures, and Croissant metadata with Responsible AI fields.

Use Cases

Benchmarking dialogue compression models based on the described turning-point focus.
Evaluating model performance on long-dialogue tasks using the provided probe files and scorer code.
Studying Responsible AI practices in dataset construction using the included Croissant metadata.
Analyzing compression quality around specific lifecycle events mentioned in the description.

Strengths

Includes Responsible AI metadata fields as part of the Croissant metadata.
Provides evaluation code (scorer/reader) alongside the data artifacts.
Designed for a specific, defined NLP task: evaluating long-dialogue compression around turning points.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
The dataset's provenance and organization are listed as unknown.

Provenance

Source: Author '4papersubmission' on Hugging Face.
Collection Method: Created as an artifact for NeurIPS 2026 Evaluations and Datasets Track.
Freshness: Last updated 2026-05-07 06:22:21; freshness should be verified.

Text Nlp Evaluation Benchmark Dialogue Compression Turning Points

Related Datasets

Quality Score

D37

Description

39

Source

36

Reputation

41

Access

26

Community

49 downloads

1 likes

0 views

Dataset Info

Author: 4papersubmission
Created: Apr 29, 2026
Updated: May 7, 2026
Last synced: May 14, 2026

Access

26

Community

49 downloads

1 likes

0 views

Dataset Info

Author: 4papersubmission
Created: Apr 29, 2026
Updated: May 7, 2026
Last synced: May 14, 2026

TPBench: A Benchmark for Dialogue Compression at Turning Points

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info