Data Science and Machine Learning


Data Science

Machine Learning

Python

pandas

numpy

scikit-learn

Date A Scientist

Mining OK Cupid profile data for insights

This is my capstone project for Codecademy’s Machine Learning Fundamentals course. Using OK Cupid profile data, I built regression and classification models to predict age and body type responses. I used a Jupyter Notebook to process, clean, and explore the data, and to build my prediction models. I also created a PDF presentation to provide an overview of my process, research questions, and findings. Although my models performed poorly - due to poor feature selection, low correlation between the features, and limited data points after dropping not a number (NaN) values - my project was given a final score of 18/20 and I was awarded a certificate of completion for the course.