✨ Real-life Data Analysis and Model Training Workshop by Global AI Hub.

Overview

🎓 Data Analysis and Model Training Course by Global AI Hub

Syllabus:

Day 1

  • What is Data?

  • Multimedia

  • Structured and Unstructured Data

  • Data Types

  • Data Visualization

    • What is Visualization?
    • Tufte's 6 Principle
    • Visualization Types
      • Line Plot
      • Scatter Plot
      • Bar Plot
      • Histogram
      • Pie Charts
      • Heatmap
      • Box Plot
      • Kartil Nedir? Nasıl Hesaplanır?
      • Joint Plot
      • KDE(Kernel Density Estimate)
  • Statistics

    • Descriptive Statistics Concepts
    • The Concept of Skewness
    • Correlation and Correlation Matrix
    • The Simpsons Paradox
    • Anscombe Quartet
    • Data Distribution and Hypothesis Testing
  • Data Distribution

    • Data and Distribution
    • Gaussian(Normal) Distribution
    • t-Distribution
    • Degrees of Freedom
    • Bernoulli's Distribution
    • Exponential Distribution
  • Application

    • Pandas Revision
    • Introduction to Data Preprocessing with Pandas

Day 2

  • Hypothesis Tests

    • Basic Hypothesis testing
    • P value
    • T test
    • Z test
    • Chi-square (Chi-Square) Test
    • Errors in Hypothesis Testing
  • Data Cleaning

    • The 68-95-99.7 Rule and 3 Sigma
    • Outlier, Missing and Duplicate Data and their Detection
    • Z-Score
    • Handling missing values
    • Null vs NaN
    • Pandas Functions for missing values
    • Dimensionality Reduction
    • PCA (Principal Component Analysis)
    • Collinearity (Multiple Linear Connection
  • Data Transformation

    • Data Conversion Techniques
      • round
      • Scaling
      • Label Encoding
      • One Hot Encoding
      • Stack
      • melt
      • Shorts
      • Feature Engineering
  • Data Augmentation

    • Aggregation Functions
  • Application

    • Data Visualization with Seaborn
    • Data Preprocessing with Pandas

Day 3

  • ML Review

    • What is Machine Learning?
    • Supervised Learning
    • Unsupervised Learning
    • Errors That May Be Encountered in Model Training
    • Tools Used in Data Analysis and Machine Learning
    • End-to-End Machine Learning Project Steps
  • Application

    • Training An End-to-End ML Model with a Real Dataset

Certification

The course completion is certified.

Owner
Global AI Hub
Where AI Talent Meets Opportunity
Global AI Hub
Swagger Documentation Generator for Django REST Framework: deprecated

Django REST Swagger: deprecated (2019-06-04) This project is no longer being maintained. Please consider drf-yasg as an alternative/successor. I haven

Marc Gibbons 2.6k Jan 03, 2023
Repository for tutorials, examples and starter scripts for using the MTU HPC cluster

MTU-HPC-Starter Repository for tutorials, examples and starter scripts for using the MTU HPC cluster Connecting to the MTU HPC cluster Within the coll

1 Jan 31, 2022
Fully reproducible, Dockerized, step-by-step, tutorial on how to mock a "real-time" Kafka data stream from a timestamped csv file. Detailed blog post published on Towards Data Science.

time-series-kafka-demo Mock stream producer for time series data using Kafka. I walk through this tutorial and others here on GitHub and on my Medium

Maria Patterson 26 Nov 15, 2022
Legacy python processor for AsciiDoc

AsciiDoc.py This branch is tracking the alpha, in-progress 10.x release. For the stable 9.x code, please go to the 9.x branch! AsciiDoc is a text docu

AsciiDoc.py 178 Dec 25, 2022
The OpenAPI Specification Repository

The OpenAPI Specification The OpenAPI Specification is a community-driven open specification within the OpenAPI Initiative, a Linux Foundation Collabo

OpenAPI Initiative 25.5k Dec 29, 2022
Parser manager for parsing DOC, DOCX, PDF or HTML files

Parser manager Description Parser gets PDF, DOC, DOCX or HTML file via API and saves parsed data to the database. Implemented in Ruby 3.0.1 using Acti

Эдем 4 Dec 04, 2021
Data-Scrapping SEO - the project uses various data scrapping and Google autocompletes API tools to provide relevant points of different keywords so that search engines can be optimized

Data-Scrapping SEO - the project uses various data scrapping and Google autocompletes API tools to provide relevant points of different keywords so that search engines can be optimized; as this infor

Vibhav Kumar Dixit 2 Jul 18, 2022
AiiDA plugin for the HyperQueue metascheduler.

aiida-hyperqueue WARNING: This plugin is still in heavy development. Expect bugs to pop up and the API to change. AiiDA plugin for the HyperQueue meta

AiiDA team 3 Jun 19, 2022
A curated list of python programming language blogs

Python Blogs A curated list of python programming language blogs Contribute Companies/Organization # A B C D E F G H I J K L M N O P Q R S T U V W X Y

Rizky D. Onto 48 Nov 15, 2022
A Sublime Text plugin to select a default syntax dialect

Default Syntax Chooser This Sublime Text 4 plugin provides the set_default_syntax_dialect command. This command manipulates a syntax file (e.g.: SQL.s

3 Jan 14, 2022
Numpy's Sphinx extensions

numpydoc -- Numpy's Sphinx extensions This package provides the numpydoc Sphinx extension for handling docstrings formatted according to the NumPy doc

NumPy 234 Dec 26, 2022
Soccerdata - Efficiently scrape soccer data from various sources

SoccerData is a collection of wrappers over soccer data from Club Elo, ESPN, FBr

Pieter Robberechts 195 Jan 04, 2023
Python syntax highlighted Markdown doctest.

phmdoctest 1.3.0 Introduction Python syntax highlighted Markdown doctest Command line program and Python library to test Python syntax highlighted cod

Mark Taylor 16 Aug 09, 2022
Automatically open a pull request for repositories that have no CONTRIBUTING.md file

automatic-contrib-prs Automatically open a pull request for repositories that have no CONTRIBUTING.md file for a targeted set of repositories. What th

GitHub 8 Oct 20, 2022
An interview engine for businesses, interview those who are actually qualified and are worth your time!

easyInterview V0.8B An interview engine for businesses, interview those who are actually qualified and are worth your time! Quick Overview You/the com

Vatsal Shukla 1 Nov 19, 2021
Python solutions to solve practical business problems.

Python Business Analytics Also instead of "watching" you can join the link-letter, it's already being sent out to about 90 people and you are free to

Derek Snow 357 Dec 26, 2022
Main repository for the Sphinx documentation builder

Sphinx Sphinx is a tool that makes it easy to create intelligent and beautiful documentation for Python projects (or other documents consisting of mul

5.1k Jan 04, 2023
This repository outlines deploying a local Kubeflow v1.3 instance on microk8s and deploying a simple MNIST classifier using KFServing.

Zero to Inference with Kubeflow Getting Started This repository houses all of the tools, utilities, and example pipeline implementations for exploring

Ed Henry 3 May 18, 2022
A simple flask application to collect annotations for the Turing Change Point Dataset, a benchmark dataset for change point detection algorithms

AnnotateChange Welcome to the repository of the "AnnotateChange" application. This application was created to collect annotations of time series data

The Alan Turing Institute 16 Jul 21, 2022
A tool that allows for versioning sites built with mkdocs

mkdocs-versioning mkdocs-versioning is a plugin for mkdocs, a tool designed to create static websites usually for generating project documentation. mk

Zayd Patel 38 Feb 26, 2022