Enhancing the Census Income Prediction Dataset: Social Justice in Machine Learning Pedagogy
The UCI "Adult" dataset was created in 1996 and yet is still used to this day to teach machine learning. While it is clean and unfussy, it perpetuates some outdated ideas about income and race. We created a new open-source income prediction dataset along with supplemental materials that include conversations about equity and data.