Harmonized U.S. Workplace Injury Data from Four Public Sources, 2015–2024
by Bauder, Richard / Workplace Injury Harmonization Dataverse·Updated 2mo ago
Available on 1 platform
Sign in to view source links and access this dataset
Description
A harmonized table integrates four public U.S. incident- and claim-level workplace injury sources: OSHA Severe Injury Reports, OSHA Injury Tracking Application, Oregon Workers’ Compensation, and New York State Workers’ Compensation. The dataset package includes the harmonized data, schema, mapping files, and processing documentation. The harmonized fields support constrained cross-dataset analysis of broad injury characteristics, including injury date, industry context, and event category.
Use Cases
Exploratory analysis of injury trends based on harmonized fields like injury date and industry context.
Building machine learning models for injury event categorization based on the harmonized event and nature-of-injury categories.
Comparative studies of injury reporting patterns across different administrative sources using the source dataset provenance field.
Demonstrating data harmonization techniques for public health or labor economics research.
Strengths
Harmonizes four distinct public U.S. data sources into a single table for constrained cross-dataset analysis.
Includes comprehensive supporting documentation such as a data dictionary, taxonomy mapping files, and processing provenance.
Covers a nine-year time range from 2015 to 2024.
Limitations
Row count and file size are unknown, which may limit suitability assessment.
Column-level documentation is absent from the input; field semantics must be inferred after download.
The authors caution against direct injury-rate comparisons across sources without additional adjustments.
Provenance
Source
Four public U.S. sources: OSHA Severe Injury Reports (SIR), OSHA Injury Tracking Application (ITA), Oregon Workers’ Compensation (OR_WC), New York State Workers’ Compensation (NY_WC).
Collection Method
Value-added harmonization of incident- and claim-level records, documented in a processing notebook.
Time Range
2015–2024
Freshness
Last updated 2026-04-26 23:10:42; freshness should be verified.
Geography
United States, with specific coverage for Oregon and New York state workers' compensation data.
License is unknown. Users must not use the data for direct injury-rate or population-risk comparisons across sources without additional denominators and jurisdiction-specific interpretation.