Preprossing-loan-data-with-NumPy - In this project, I have cleaned and pre-processed the loan data that belongs to an affiliate bank based in the United States.

Last update: Jan 03, 2022

Overview

Preprossing-loan-data-with-NumPy

In this project, I have cleaned and pre-processed the loan data that belongs to an affiliate bank based in the United States. This cleaning process is done using the NumPy library of Python. This cleaned data will be used to create a credit risk model which estimates the probability of default for every personal account. When we're measuring credit worthiness, we need to be extremely risk averse and distrustful of any unavailable data. That's why the consensus in the field is that missing information suggests foul play because loan applications are self-reported to elaborate since candidates fill out their loan applications manually. There is an incentive to withhold information which can lower their chances of getting a loan. Of course, we prefer to give out loans to applicants who can repay them so that the information isn't available we will just assume the worst and mark that field as ‘red_flag’ for further analysis and convenience in the model building phase. However, what is worst varies from one column to the next. All the values are in dollars, so we need to provide their euro equivalents. Every categorical variable must be quantified. So we need to change any text columns into numbers based on the information they contain. All the other tasks are explained in the jupyter notebook.

Preprossing-loan-data-with-NumPy - In this project, I have cleaned and pre-processed the loan data that belongs to an affiliate bank based in the United States.

Related tags

Overview

Preprossing-loan-data-with-NumPy

Owner

Dhawal Chitnavis

Performant, differentiable reinforcement learning

Re-implememtation of MAE (Masked Autoencoders Are Scalable Vision Learners) using PyTorch.

An efficient implementation of GPNN

simple artificial intelligence utilities

Pytorch Implementation for Dilated Continuous Random Field

GLIP: Grounded Language-Image Pre-training

Modular Gaussian Processes

The code is the training example of AAAI2022 Security AI Challenger Program Phase 8: Data Centric Robot Learning on ML models.

Implementations for the ICLR-2021 paper: SEED: Self-supervised Distillation For Visual Representation.

Aquarius - Enabling Fast, Scalable, Data-Driven Virtual Network Functions

ACL'2021: LM-BFF: Better Few-shot Fine-tuning of Language Models

DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition (CVPR 2021)

Modifications of the official PyTorch implementation of StyleGAN3. Let's easily generate images and videos with StyleGAN2/2-ADA/3!

Codes for the paper Contrast and Mix: Temporal Contrastive Video Domain Adaptation with Background Mixing

Official code for paper "Optimization for Oriented Object Detection via Representation Invariance Loss".

Code for "Learning Canonical Representations for Scene Graph to Image Generation", Herzig & Bar et al., ECCV2020

Minimalistic PyTorch training loop

ML course - EPFL Machine Learning Course, Fall 2021

ICRA 2021 - Robust Place Recognition using an Imaging Lidar

Source code release of the paper: Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation.