Program that predicts the NBA mvp based on data from previous years.

Last update: Jan 21, 2022

Related tags

Overview

NBA MVP Predictor

A machine learning model using RandomForest Regression that predicts NBA MVP's using player data.
Explore the docs »

View Demo · Report Bug · Request Feature

About The Project

This project utilizes RandomForest Regression ML model to predict the NBA MVP. Now you may think that this is not a regression problem, but more of a classification problem, however our approach to predicting MVP consists of predicting a numerical variable called MVP win share. From that prediction, the player in the season with the highest MVP win share is predicted to be the MVP. As you can see structuring the problem like this lends more towards a regression solution.

Our machine learning model is trained on data from 1980-2010, and then we use that to predict the MVP's for the 2011-2021 season.

(back to top)

Built With

(back to top)

Examples of Graphs Used

Usage

To run this model on your system, download the jupyter notebook, and data. Then within the file change the URL for the raw_mvp_data variable to the path where the data is located on your system.

Results

The model achieved an R^2 value of 0.6127, guessing 8/10 of it's predictions correctly.

Acknowledgements

Inspiration from this article: https://towardsdatascience.com/predicting-the-next-nba-mvp-using-machine-learning-62615bfcff75

Program that predicts the NBA mvp based on data from previous years.

Related tags

Overview

NBA MVP Predictor

About The Project

Built With

Examples of Graphs Used

Usage

Results

Acknowledgements

Owner

Muhammad Rabee

ELFXtract is an automated analysis tool used for enumerating ELF binaries

Very useful and necessary functions that simplify working with data

This program analyzes a DNA sequence and outputs snippets of DNA that are likely to be protein-coding genes.

PyPDC is a Python package for calculating asymptotic Partial Directed Coherence estimations for brain connectivity analysis.

Learn machine learning the fun way, with Oracle and RedBull Racing

Repository created with LinkedIn profile analysis project done

BioMASS - A Python Framework for Modeling and Analysis of Signaling Systems

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

MIR Cheatsheet - Survival Guidebook for MIR Researchers in the Lab

Handle, manipulate, and convert data with units in Python

The official pytorch implementation of ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias

Streamz helps you build pipelines to manage continuous streams of data

Powerful, efficient particle trajectory analysis in scientific Python.

Demonstrate a Dataflow pipeline that saves data from an API into BigQuery table

Tuplex is a parallel big data processing framework that runs data science pipelines written in Python at the speed of compiled code

Lale is a Python library for semi-automated data science.

Python package for analyzing behavioral data for Brain Observatory: Visual Behavior

A crude Hy handle on Pandas library

peptides.py is a pure-Python package to compute common descriptors for protein sequences

Statistical package in Python based on Pandas