37,954 articles from eight leading newspapers in Italy, Spain, the United Kingdom, and the United States published between 2018 and 2024. The dataset was created by David García-García for a study analyzing cited organizations and coverage tone using a large language model. It reveals patterns in the representation of economic actors, NGOs, academics, and trade unions across different media systems.
Use Cases
- Compare media system influence on interest group representation based on the cross-national newspaper analysis.
- Analyze the dominance of economic actors in AI policy debates based on the reported 85% appearance rate.
- Study editorial differentiation across outlets based on sourcing profile divergence patterns.
- Investigate the tone of coverage for specific groups like trade unions based on the mention of negative territory in Spain and Italy.
Strengths
- 37,954 articles provide a substantial corpus for analysis.
- Data spans a 6-year time range from 2018 to 2024.
- Cross-national coverage includes newspapers from four distinct countries.
- Analysis includes identified organizations and classified tone.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- Journal of Public Policy Dataverse
- Collection Method
- Articles collected from eight leading newspapers and analyzed with a large language model.
- Time Range
- 2018-2024
- Freshness
- Last updated 2026-04-20 03:39:57; freshness should be verified.
- Geography
- Italy, Spain, United Kingdom, United States