A regional retrospective cohort of patients from the endemic southern region of Argentina provides the data. This dataset contains deidentified individual patient data used for developing and evaluating a case-fatality risk assessment model for New World hantavirus infection. The data includes clinical, epidemiologic, laboratory, and radiologic predictors, an outcome variable, and risk scores, with a data dictionary and analysis code provided by author Fernando Tortosa via Harvard Dataverse.
Use Cases
- Developing predictive models for hantavirus pulmonary syndrome case-fatality based on clinical and laboratory predictors.
- Validating evidence-informed risk scores using observed risk categories and outcome data.
- Conducting descriptive epidemiological analyses of hantavirus infection in a regional cohort.
- Reproducing score calculation and analysis workflows using the provided data dictionary and code.
Strengths
- Includes a data dictionary and analysis code to support reproducibility of score calculation.
- Contains multiple predictor types: clinical, epidemiologic, laboratory, and radiologic variables.
- Focuses on a specific infectious disease (New World hantavirus) in a defined endemic region.
Limitations
- Row count is unknown, which may limit suitability assessment.
- Column-level documentation is absent; field semantics must be inferred after download.
- Data may reflect geographic bias inherent to the regional cohort from southern Argentina.
Provenance
- Source
- Harvard Dataverse, author Fernando Tortosa.
- Collection Method
- Derived from a regional retrospective cohort of patients.
- Freshness
- Last updated 2026-05-12 17:07:44; freshness should be verified.
- Geography
- Endemic southern region of Argentina.