Welcome to my data science projects repository! This repository contains a collection of my data science projects, showcasing my skills and expertise in the field. Each project demonstrates different aspects of data analysis, machine learning, and visualization.
- Description : This project predicts the survival during the titanic disaster based on socio-economic passengers data (classification problem).
- Tools used : data cleaning, feature engineering, one-hot encoding, feature selection and classifier fitting.
- Results : The best classifier is Random Forest, with a train accuracy of 0.98 and an F1-score of 0.98. The kaggle submission scores 0.77 on the test set.
- Description : This project predicts the sales price of residential homes in Ames, Iowa, USA based on various house characteristics (regression problem). A Streamlit app helps with exploratory data analysis.
- Tools used :
- Results :