A classification model capable of accurately predicting the price of secondhand cars

Last update: Sep 13, 2022

Overview

Title: Secondhand-Car-Price-Predictor

-- Project Status: [Completed ✅ ]

Project Intro/Objective

The purpose of this project is create a classification model capable of accurately predicting the price of secondhand cars. The data used for model building is open source and has been added to this repository. Most packages used are usually pre-installed in most developed environments and tools like collab, jupyter, etc. This can be useful for people looking to enhance the way the code their predicitve models and efficient ways to deal with tabular data!

Methods Used

Inferential Statistics
Machine Learning
Feature Engineering
Predictive Modeling
Deep Learning
Data Visualization
Classification

Technologies

Python
Pandas, TensorFlow, SkLearn
Collab

Project Description

This Notebook is based off an open source dataset available on www.kaggle.com where I have created models to predict selling price of second hand cars on the basis of various parameters and attributes! The best score was 92.57% with the best MSE being around 4900
All models are subject to betterment with more stringent hyper-parameter tuning. This can be achieved by random selection, brute force methods, etc. Various other classifiers can also be used, but the most standard classifiers have been considered in this notebook.
Recommend standard practices for data transformation, outlier detection, and null value substitution have been incorporated in this notebook.
Good visualizations have also been shown in the notebook for explaining the importance and significance of certain parameters. It can be easily understood by people coming from non-technical backgrounds. Various parameter tuning and scaling methods are shown that helped me achieve enhanced results!
Recommend standard practices for data transformation, outlier detection, and null value substitution have been incorporated in this notebook.
This code has been UPVOTED by 10 People, Including Kaggle Grandmasters (Highly recognised people for their achievements in the data science Community). I have received a bronze medal for my code in the community.

Getting Started

One can simply download the notebook and dataset, open in platforms like Jupyter, Collab, and Run each cell to see results! This Python 3 environment comes with many helpful analytics libraries installed It is defined by the kaggle/python Docker image: https://github.com/kaggle/docker-python For example, here's several helpful packages to load

import numpy as np # linear algebra import pandas as pd # data processing, CSV file I/O (e.g. pd.read_csv)

Input data files are available in the read-only "../input/" directory For example, running this (by clicking run or pressing Shift+Enter) will list all files under the input directory

import os for dirname, _, filenames in os.walk('/kaggle/input'): for filename in filenames: print(os.path.join(dirname, filename))

Contact

Feel free to contact team leads with any questions or if you are interested in contributing!
https://www.linkedin.com/in/akarshsinghh/

A classification model capable of accurately predicting the price of secondhand cars

Related tags

Overview

Title: Secondhand-Car-Price-Predictor

-- Project Status: [Completed ✅ ]

Project Intro/Objective

Methods Used

Technologies

Project Description

Getting Started

Contact

Owner

Akarsh Singh

A unified framework for machine learning with time series

Titanic Traveller Survivability Prediction

ML Kaggle Titanic Problem using LogisticRegrission

Scikit-learn compatible wrapper of the Random Bits Forest program written by (Wang et al., 2016)

This is a public repo where code samples are stored for the book Practical MLOps.

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

Python factor analysis library (PCA, CA, MCA, MFA, FAMD)

Iris species predictor app is used to classify iris species created using python's scikit-learn, fastapi, numpy and joblib packages.

List of Data Science Cheatsheets to rule the world

Exemplary lightweight and ready-to-deploy machine learning project

QML: A Python Toolkit for Quantum Machine Learning

Machine learning algorithms implementation

ml4h is a toolkit for machine learning on clinical data of all kinds including genetics, labs, imaging, clinical notes, and more

Greykite: A flexible, intuitive and fast forecasting library

PySurvival is an open source python package for Survival Analysis modeling

Python bindings for MPI

ThunderGBM: Fast GBDTs and Random Forests on GPUs

Skforecast is a python library that eases using scikit-learn regressors as multi-step forecasters

A basic Ray Tracer that exploits numpy arrays and functions to work fast.

Neighbourhood Retrieval (Nearest Neighbours) with Distance Correlation.

A classification model capable of accurately predicting the price of secondhand cars

Related tags

Overview

Title: Secondhand-Car-Price-Predictor

-- Project Status: [Completed ✅ ]

Project Intro/Objective

Methods Used

Technologies

Project Description

Getting Started

Contact

Owner

Akarsh Singh

A unified framework for machine learning with time series

Titanic Traveller Survivability Prediction

ML Kaggle Titanic Problem using LogisticRegrission

Scikit-learn compatible wrapper of the Random Bits Forest program written by (Wang et al., 2016)

This is a public repo where code samples are stored for the book Practical MLOps.

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。 它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

Python factor analysis library (PCA, CA, MCA, MFA, FAMD)

Iris species predictor app is used to classify iris species created using python's scikit-learn, fastapi, numpy and joblib packages.

List of Data Science Cheatsheets to rule the world

Exemplary lightweight and ready-to-deploy machine learning project

QML: A Python Toolkit for Quantum Machine Learning

Machine learning algorithms implementation

ml4h is a toolkit for machine learning on clinical data of all kinds including genetics, labs, imaging, clinical notes, and more

﻿Greykite: A flexible, intuitive and fast forecasting library

PySurvival is an open source python package for Survival Analysis modeling

Python bindings for MPI

ThunderGBM: Fast GBDTs and Random Forests on GPUs

Skforecast is a python library that eases using scikit-learn regressors as multi-step forecasters

A basic Ray Tracer that exploits numpy arrays and functions to work fast.

Neighbourhood Retrieval (Nearest Neighbours) with Distance Correlation.

AutoX是一个高效的自动化机器学习工具，它主要针对于表格类型的数据挖掘竞赛。它的特点包括: 效果出色、简单易用、通用、自动化、灵活。

Greykite: A flexible, intuitive and fast forecasting library