9 million Twitter/X posts are linked to 11,952 Polymarket prediction markets from 2021 to 2026. The dataset, created by mlsys-io, provides a multimodal benchmark with seven tasks spanning resolution and forecast tiers. It was last updated on April 8, -2026.
Use Cases
- Train models for post-to-market linking based on the human-annotated resolution tier.
- Benchmark evidence grading models based on the annotated evidence grading task.
- Predict market volume based on the deterministic forecast tier labels.
- Predict market movement based on the market movement prediction task.
- Analyze the persistence of social media impact on prediction markets.
Strengths
- Links a large-scale corpus of 9 million social media posts.
- Connects posts to 11,952 distinct prediction market events.
- Provides a structured benchmark with seven specific tasks.
- Includes human-annotated labels for resolution tasks.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Last updated 2026-04-08 09:13:02; freshness should be verified.
Provenance
- Source
- huggingface
- Collection Method
- Likely aggregated from Twitter/X and Polymarket APIs.
- Time Range
- 2021 to 2026
- Freshness
- Last updated 2026-04-08 09:13:02.
- Geography
- null