Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Aggregating 1,365 observations for a predictive modeling exercise on income mobility across three generations using Panel Study of Income Dynamics (PSID) data. The task is to predict log income in generation 3 using log incomes from prior generations, education levels, race, and sex. It includes separate learning and holdout files for model training and evaluation.
The dataset is structured for a class exercise with separate files for students (learning.csv, holdout_public.csv) and instructors (holdout_private.csv); users must not access the private holdout file to maintain the integrity of the predictive task. Original data processing code is available in for_replication.zip.