Titanic: Passenger Survival Data for Machine Learning
Available on 1 platform
Sign in to view source links and access this dataset
Description
A dataset from Kaggle's introductory machine learning competition. The data likely contains passenger information from the Titanic disaster, such as name, age, class, and survival status, to predict which passengers survived. The platform tags indicate it is a tabular dataset with a train-test split.
Use Cases
Train a binary classifier to predict passenger survival (inferred from domain, verify after download)
Practice feature engineering on categorical and numerical passenger attributes (inferred from domain, verify after download)
Benchmark and compare different classification algorithms (inferred from domain, verify after download)
Strengths
Published on Kaggle, a platform with established data practices.
Platform tags confirm it is a structured, tabular dataset with a train-test split.
Limitations
Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats, and license are unknown, which may limit suitability assessment.
Provenance
Source
Kaggle
Collection Method
Likely compiled from historical records for a machine learning competition.
Time Range
Data pertains to the 1912 Titanic disaster.
Freshness
Last updated date is unknown; freshness unverified.
Geography
Primarily concerns passengers aboard the RMS Titanic.
License is unknown; users should verify terms before commercial use.