This is the homepage of the Data Science in Higher Education book. Here you’ll find the datasets for each of the chapters that involve a different machine learning method, and (possibly) updates/errata as needed.


Multiple Linear Regression:

  • Pre-Placement Scores With Classes (CSV)
  • Pre-Placement Scores Without Classes (CSV)

Logistic Regression:

  • Prevention Program Data (CSV)

Naive Bayes Classifier:

  • Marketing Data (CSV)