Employee Turnover Analysis

Last update: Feb 13, 2022

Overview

Employee Turnover Analysis

Submission to the DataCamp competition "Can you help reduce employee turnover?" (https://app.datacamp.com/workspace/w/e9089f70-9c9a-4b5e-b1c2-94144ac12cd4)

Background

You work for the human capital department of a large corporation. The Board is worried about the relatively high turnover, and your team must look into ways to reduce the number of employees leaving the company. The team needs to understand better the situation, which employees are more likely to leave, and why. Once it is clear what variables impact employee churn, you can present your findings along with your ideas on how to attack the problem.

Content

The report covers the following questions:

Which department has the highest employee turnover? Which one has the lowest?
Which variables seem to be better predictors of employee departure?
How can the board reduce employee turnover?

Owner

Jannik Wiedenhaupt

MS in Data Science at Columbia University

GitHub Repository

Data-sets from the survey and analysis

bachelor-thesis "Umfragewerte.xlsx" contains the orginal survey results. "umfrage_alle.csv" contains the survey results but one participant is cancele

1 Jan 26, 2022

Python library for creating data pipelines with chain functional programming

PyFunctional Features PyFunctional makes creating data pipelines easy by using chained functional operators. Here are a few examples of what it can do

2.1k Jan 05, 2023

This repo contains a simple but effective tool made using python which can be used for quality control in statistical approach.

This repo contains a powerful tool made using python which is used to visualize, analyse and finally assess the quality of the product depending upon the given observations

8 Oct 18, 2022

Employee Turnover Analysis

Related tags

Overview

Employee Turnover Analysis

Background

Content

Owner

Jannik Wiedenhaupt

Data-sets from the survey and analysis

Python library for creating data pipelines with chain functional programming

This repo contains a simple but effective tool made using python which can be used for quality control in statistical approach.

Open-source Laplacian Eigenmaps for dimensionality reduction of large data in python.

Template for a Dataflow Flex Template in Python

Single-Cell Analysis in Python. Scales to >1M cells.

Implementation in Python of the reliability measures such as Omega.

Yet Another Workflow Parser for SecurityHub

Python Library for learning (Structure and Parameter) and inference (Statistical and Causal) in Bayesian Networks.

MidTerm Project for the Data Analysis FT Bootcamp, Adam Tycner and Florent ZAHOUI

Hatchet is a Python-based library that allows Pandas dataframes to be indexed by structured tree and graph data.

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

PySpark Structured Streaming ROS Kafka ApacheSpark Cassandra

Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.

NumPy and Pandas interface to Big Data

Desafio proposto pela IGTI em seu bootcamp de Cloud Data Engineer

A Numba-based two-point correlation function calculator using a grid decomposition

Data pipelines built with polars

Calculate multilateral price indices in Python (with Pandas and PySpark).

Analyze the Gravitational wave data stored at LIGO/VIRGO observatories