Hospitalized Breast Cancer Patient Records from Zhuhai, China (2004-2024)
by Ruixin Fan·Updated 3mo ago
1.4 MB1files
Available on 1 platform
Sign in to view source links and access this dataset
Description
Zhuhai, China, provides the geographic scope for this retrospective study of 5,052 hospitalized breast cancer patients from 2004 to 2024. The dataset includes demographics, clinical characteristics, treatment patterns, costs, and outcomes, revealing trends like a 406% hospitalization surge in 2022 and a significant decrease in median length of stay. It was authored by Ruixin Fan and published on figshare.
Use Cases
Analyze treatment patterns over time, such as the increase in breast-conserving surgery rates to 21.7% in 2024 and trends in neoadjuvant chemotherapy usage.
Model the relationship between clinical characteristics (e.g., tumor stage, age) and treatment outcomes, using logistic regression results like the OR of 5.87 for adjuvant chemotherapy predicting cure.
Examine healthcare cost dynamics, including the peak total cost of ¥31,201 in 2019 and the subsequent 58.9% decrease, alongside shifts in out-of-pocket expense percentages.
Investigate demographic and insurance coverage patterns, such as the 70.3% of patients covered by rural resident basic medical insurance and the 99.5% female cohort.
Study hospitalization trends, including the 406% surge in 2022 and the reduction in median length of stay from 16.9 to 4.8 days over the study period.
Strengths
Contains records for 5,052 patients, providing a substantial sample for analysis.
Covers a long-term, 20-year time range from 2004 to 2024, enabling longitudinal trend analysis.
Includes multiple analytically rich data facets: demographics, clinical staging, treatment types, costs, and patient outcomes.
Published under a permissive CC BY 4.0 license, allowing for broad reuse and sharing.
Limitations
The dataset is specific to a single hospital in Zhuhai, China, limiting generalizability to other regions or healthcare systems.
Exact column names and the total number of rows within the spreadsheet file are unknown, which may complicate initial data exploration.
As a 1.4 MB XLSX file, the dataset is relatively small in scale, which may limit the complexity of models that can be trained directly on it.
Provenance
Source
figshare, authored by Ruixin Fan.
Collection Method
Retrospective analysis of hospital records.
Time Range
2004 to 2024.
Freshness
The dataset metadata was last updated on 2026-03-25, indicating recent availability.
Geography
A regional medical center in Zhuhai, Southern China.
The specific column structure within the XLSX file is unknown; users should inspect the file upon download to understand the data schema. The 1.4 MB file size indicates a small-scale dataset.