Loading...
Loading...
Disease surveillance, vaccination data, epidemiology, health system capacity, mortality statistics
2,873 datasets
Alan Fernihough's mfx package estimates marginal effects, odds ratios, and incidence rate ratios for generalized linear models. The output likely contains calculated statistical measures for models including probit, logit, Poisson, negative binomial, and beta regression. The methodology is based on a textbook introduction from Greene (2008, pp. 780-7).
U.S. Centers for Disease Control and Prevention (CDC) data documents recent outbreaks of syphilis among men who have sex with men (MSM) and increases in newly diagnosed HIV infections among MSM and heterosexuals. The dataset likely contains epidemiological indicators and programmatic information related to HIV prevention strategies launched in 2003. It was authored by the US Department of Health and Human Services and CDC.
Country-level data related to vaccination. The dataset is hosted on Kaggle, a platform for data science and machine learning projects. The specific source, collection method, and temporal coverage are not detailed in the provided metadata.
Civil birth registration data from the City of New York ranks baby names by frequency, segmented by sex and ethnic group. The dataset tracks name popularity over time, with rankings derived from official vital statistics. Users should exercise caution when a name's frequency count is near 10, as its rank may fluctuate annually.
Subhayu Bandyopadhyay's research paper analyzes the determinants of official development assistance in the post-Cold War era. It estimates the relationship between aid and recipient country metrics like per capita income, infant mortality, civil/political rights, and government effectiveness. The study was published in the Federal Reserve Bank of St. Louis Review in November/December 2007.
A dataset for developing fraud detection models in health insurance. The data is synthetic, likely generated to simulate patterns of fraudulent and legitimate claims. It is hosted on Kaggle, but the specific author, size, and creation date are unknown.
Daily aggregated counts of confirmed COVID-19 cases and deaths reported to the World Health Organization. The data covers the period from 31 December 2019 to 18 October 2020, with a reported total of 39,596,858 confirmed cases and 1,107,374 deaths as of the last update. The WHO Emergency Health Panel compiled the data from official communications, ministry of health websites, and social media accounts.
A time series of cumulative coronavirus cases compiled by The New York Times from state and local governments. The data covers the United States at national, state, and county geographic levels. The specific snapshot in this deposit was scraped by ICUSR staff on November 22, 2020.
A microsimulation analysis commissioned by the Greater London Authority in late 2020 to assess the impact of the Covid-19 crisis and policy responses on Londoners. Researchers from the University of Essex used the UKMOD tax-benefit model to evaluate emergency policies and counterfactual options like the £20 weekly uplift in Universal Credit. The dataset was last updated on the platform in March 2026.
IFF Research's final report from May 2022 covers the evaluation of the Adult Education Budget (AEB) in the 2020-21 Academic Year. The evaluation highlights the Mayor's management of London's AEB in its first two years of delegation, a period heavily affected by the Covid-19 pandemic. The dataset is associated with the Greater London Authority and was last updated on the platform in March 2026.
Reports from 2020-2021 to 2025-2026 analyze annual public health grants allocated to London's local authorities. The data is produced by City Intelligence at the Greater London Authority, examining funding following the 2013/14 devolution of public health responsibility. It focuses on local authorities' role in improving population health and reducing health inequalities.
socialmixr is an R package providing methods for sampling social contact matrices from diary survey data, as used in infectious disease modeling. The package is authored by Sebastian Funk and implements the methodology discussed in the foundational Mossong et al. (2008) study. The dataset's specific size, columns, and temporal coverage are not detailed in the provided metadata.
EpiEstim is a tool for estimating time-varying reproduction numbers (Rt) from epidemic incidence time series. It implements methods described in Cori et al. (2013) and Wallinga and Teunis (2004). The dataset's author is Anne Cori, and it is hosted on the paperswithcode platform.
A 1994 CDC-convened workshop addressed the public health threat of waterborne cryptosporidiosis. Representatives from 40 states, regulatory agencies, water utilities, and advocacy groups discussed surveillance, public health responses, impacts on immunocompromised persons, and water sampling methods. The report summarizes work group conclusions on current knowledge and strategies for managing these problems.
ACDC is a dataset of cardiac MRI images, likely containing scans of the heart. It is hosted on Kaggle, a platform for data science competitions and projects. The dataset's specific size, collection date, and originating organization are not provided in the available metadata.
A list of public events sponsored or supported by the Chicago Department of Public Health. The dataset is provided by the City of Chicago and was last updated on March 22, 2026. It is available in multiple formats including RDF, JSON, CSV, and XML.
BioGPT-Mortality is a dataset published on Kaggle. Its title suggests it contains text data likely related to biomedical language modeling and mortality outcomes. Metadata is minimal; actual content requires verification after download.
Mortality data related to drug use in the state of Maine. The dataset was authored by Marcella H Sorg and covers the period from 1997 to 2002. Its specific content and structure require verification after download.
2001-2003 survey data on mental health epidemiology in the United States. The dataset is associated with the Collaborative Psychiatric Epidemiology Surveys (CPES) and was published on paperswithcode. The author is listed as Margarita Alegría.
Federal data indicates the uninsured rate has been rising since 2016 and increased again in 2019. Analyses show consumers struggled with coverage affordability before the COVID-19 pandemic, with reports suggesting a deepening crisis in 2020. The dataset likely contains time-series metrics on insurance coverage and affordability.