COVID-19 Case Surveillance: Deidentified U.S. Patient-Level Public Health Data
arff
Available on 1 platform
Sign in to view source links and access this dataset
Description
The COVID-19 Case Surveillance Public Use Dataset contains individual-level data reported to U.S. states, territories, and other jurisdictions and voluntarily shared with the CDC. The deidentified data includes demographic characteristics, exposure history, disease severity indicators, outcomes, clinical data, laboratory test results, and comorbidities. Data collection was formalized by a Council of State and Territorial Epidemiologists position statement in April 2020.
Use Cases
Analyzing demographic trends in COVID-19 cases and deaths based on the reported characteristics.
Modeling disease severity and outcomes based on clinical data and comorbidities.
Studying exposure history patterns to understand transmission routes.
Evaluating laboratory diagnostic test results in relation to patient outcomes.
Strengths
Data is sourced from a national public health surveillance system coordinated by the CDC.
Includes a wide range of deidentified patient-level features such as demographics, clinical data, and outcomes.
Limitations
Row count, file formats, and last update date are unknown, limiting suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download from the case report form.
Provenance
Source
U.S. Centers for Disease Control and Prevention (CDC).
Collection Method
Voluntarily reported case notifications from U.S. states, territories, and other jurisdictions.
Time Range
Data collection was formally initiated in April 2020.
Freshness
Last updated date is unknown; freshness unverified.
Geography
United States, including states, territories, New York City, and the District of Columbia.