German second hand car price prediction using following methods:

1-Summary

In this project six different models are used to predict the price of second hand cars based on the following selected features: year, power per kw, mileage in km and brand. The models used are: Linear Regression, Decision Tree, Bagging, AdaBoost, K-Nearest Neighbours and Random Forest.

The chosen database consisted in more than 250,000 samples of used cars in Germany. The project starts with cleaning database and performing Explanatory Data Analysis (eda.ipynb). The data had a lot of noise and discrepancy and after EDA, the reamining data is about half of the original data.

Various erros are measured after performing our models on training and testing data using cross-validation. Common errors computed for all of our models are mean absolute error, median absolute error, R-squared error. Also accuracy error is computed for certain relevant models.

For visualisation, scattered plots comparing tested and predicted prices are done for all models.

2-Model Comparison

The best performance was for KNN and Random Forest, given their R2 error as well as mean absolute and median absolute errors. This can be due to the diversity and sporadicity of the data. The predictions done by linear regression, decision tree regression, bagging and adaboost consider a lot of samples in order to do prediction, while KNN and random forest choose the nearest samples which filters many non-redundant samples that were used in other models.

3-Recommendations and Future Work

-Feature Engineering via additional features and feature interaction.

-Data Augmentation via using web scrapping techniques through online used car platforms like immoscout.

-Model Optimization; Hyperparameter Tuning by grid search or random search with cross-validation.

-Using other ensemble methods such as Gradient Boosting, XGBoost, or LightGBM.

-Advanced Machine Learning Techniques such as Neural Networks

Name		Name	Last commit message	Last commit date
Latest commit History 146 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Bagging-regressor.ipynb		Bagging-regressor.ipynb
Decision_Tree-regressor.ipynb		Decision_Tree-regressor.ipynb
Future.txt		Future.txt
KNN.-regressor.ipynb		KNN.-regressor.ipynb
LICENSE		LICENSE
README.md		README.md
Random_Forest-regressor.ipynb		Random_Forest-regressor.ipynb
adaboost-regressor.ipynb		adaboost-regressor.ipynb
car_data.csv		car_data.csv
clean_car_data.csv		clean_car_data.csv
discussion.txt		discussion.txt
eda.ipynb		eda.ipynb
final_car_data.csv		final_car_data.csv
json_validator.ipynb		json_validator.ipynb
linear-regressor.ipynb		linear-regressor.ipynb
main.ipynb		main.ipynb
path_to_cleaned_dataset.csv		path_to_cleaned_dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

German second hand car price prediction using following methods:

1-Summary

2-Model Comparison

3-Recommendations and Future Work

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

rahkooy/Car_Price_Prediction

Folders and files

Latest commit

History

Repository files navigation

German second hand car price prediction using following methods:

1-Summary

2-Model Comparison

3-Recommendations and Future Work

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages