DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

AdvancedIF: A Benchmark for Complex and Multi-Turn LLM Instruction Following | DataSalon

Home Multimodal & LLMAdvancedIF: A Benchmark for Complex and Multi-Turn LLM Instruction Following

Multimodal & LLM

AdvancedIF: A Benchmark for Complex and Multi-Turn LLM Instruction Following

Name: AdvancedIF: A Benchmark for Complex and Multi-Turn LLM Instruction Following
Creator: facebook
Published: 2025-11-20T19:49:43
Keywords: Llm Benchmark, Ai Safety, Evaluation, Benchmark, Text, Instruction Following

by facebook·Updated 8mo ago

Available on 1 platform

Description

Facebook introduces AdvancedIF, a benchmark featuring over 1,600 prompts designed to assess large language models. The dataset includes expert-curated rubrics to evaluate proficiency in complex instruction following, multi-turn interactions, and system prompt steerability. It was last updated on November 26, 2025.

Use Cases

Benchmarking LLM performance on complex instructions based on prompts with 6+ combined constraints
Evaluating multi-turn conversational consistency based on instruction-carrying tasks
Testing model steerability based on adherence to system prompts
Analyzing failure modes in instruction following based on the expert-curated rubric

Strengths

Over 1,600 prompts provide a substantial test set
Each prompt contains 6+ instructions combining multiple constraint types
Includes expert-curated evaluation rubrics

Limitations

Column-level documentation is absent; field semantics must be inferred after download
Row count is unknown, which may limit suitability assessment
Last updated 2025-11-26 04:12:14; freshness should be verified

Provenance

Source: facebook
Collection Method: Expert-curated benchmark creation
Freshness: Last updated 2025-11-26 04:12:14

License is unknown and should be verified before use.

Text Llm Benchmark Ai Safety Evaluation Benchmark Instruction Following

Related Datasets

Quality Score

D40

Description

Source

Reputation

Quality Score

D40

Description

Source

Reputation

Access

Community

596 downloads

15 likes

0 views

Dataset Info

Author: facebook
Created: Nov 20, 2025
Updated: Nov 26, 2025
Last synced: Jun 22, 2026

Access

Community

596 downloads

15 likes

0 views

Dataset Info

Author: facebook
Created: Nov 20, 2025
Updated: Nov 26, 2025
Last synced: Jun 22, 2026

AdvancedIF: A Benchmark for Complex and Multi-Turn LLM Instruction Following

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info