Psst! You can find my new site over at!

Data Science Book

This is the homepage of the Data Science in Higher Education book. Here you’ll find the datasets for each of the chapters that involve a different machine learning method, and (possibly) updates/errata as needed.


  • Multiple Linear Regression:
    • Pre-Placement Scores With Classes (CSV)
    • Pre-Placement Scores Without Classes (CSV)
  • Logistic Regression:
    • Prevention Program Data (CSV)
  • Naive Bayes Classifier:
    • Marketing Data (CSV)