SRA's seminar on Introduction to Computer Vision Fundamentals

Overview

Introduction to Computer Vision

This repository includes basics to :

  1. Python
  2. Numpy: A python library
  3. Git
  4. Computer Vision.

The aim of this repository is to provide:

  • A brief idea of algorithms involved in Computer Vision .
  • Introduction to Version Control System: Git and GitHub.
  • Computer Vision and Image Processing basics, idea of implementation of various algorithms involved using numpy (instead of any dedicated image processing library like OpenCV.)
  • Introduction to a commonly used Image Processing Library: OpenCV

Demonstration

Table Of Contents

Comments
  • Add suboptimal 2D convolution

    Add suboptimal 2D convolution

    This pull request intends to add a suboptimal implementation of generic 2D convolution. This is done for the purpose of giving a rough idea to Fys about how to work with python arrays/loops, etc. Fys will be asked to improve this implementation and complete tasks related to convolution on top of it.

    opened by meshtag 5
  • Morphology notes updated.

    Morphology notes updated.

    I have added images for dilation and erosion, replaced the previous gif of dilation and erosion with new ones and added a few lines explaining morphology.

    opened by Aryaman22102002 2
  • Updated cv-basics/

    Updated cv-basics/

    Optimised code and flow as discussed in:

    1. cv-basics/5_opencv_overview.ipynb
    2. python-numpy-basics/7_classes_and_objects.ipynb

    Added an image :

    1. cv-basics/image/bcci.png
    opened by dhairyashah1 1
  • Port to C++ : Assignments related to PIXELS seminar

    Port to C++ : Assignments related to PIXELS seminar

    Is your feature request related to a problem? Please describe. This feature request is created to keep a record of porting and potential addition of new assignments related to the seminar in C++ as discussed in this thread.

    Describe the solution you'd like

    • Create a separate main folder for containing all assignments. Individual assignments related to specific topics might be grouped together inside the main parent folder of assignments.
    • You might chose to add reference links in individual questions, which may provide additional material on a related topic for that question (this is suggested solely for the purpose of providing more (potentially real world) info related to the topic asked in original question and hence, should not in any way lead to the solution).
    enhancement 
    opened by meshtag 0
  • Add Content: Interpolations.

    Add Content: Interpolations.

    Is your feature request related to a problem? Please describe. As discussed in the thread, concepts of interpolation can also be added.

    Describe the solution you'd like

    • Create a implementations of interpolation from scratch using necessary OpenCV C++ API.
    • Add a Makefile to compile and build executables.
    • Add a .md file to explain the theory of interpolations and instructions to build and run the executables.

    Additional context Reference: Ancient Secrets of computer vision.

    Note: Content is not finalised and open for discussion

    enhancement 
    opened by amanchhaparia 0
  • Add Content: Image Storing Formats.

    Add Content: Image Storing Formats.

    Is your feature request related to a problem? Please describe. As discussed in the thread, It is important to have a familiarity of how images are store.

    Describe the solution you'd like

    • Add the theory of basic image storing formats such as .bmp, .tiff, .jpg, png etc.
    • Implement a .cpp file on how image can be read from the bmp format.
      • Consider only 8 bit grayscale BitMap image (Since they are easy to read and contains only 2D form of data).
      • Use simple posix read() api to read the image bitmap file.
      • Directly storing the values of various attributes of image in struct is suggested.
    • A similar example can be added to demonstrate how to edit/write a grayscale bitmap image.
    • Add a Makefile to compile and build the executable.
    • Add a .md file explaining the theory and instructions to build and run the executables.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 2
  • Add Content: Build Systems

    Add Content: Build Systems

    Is your feature request related to a problem? Please describe. As discussed in the thread, Concepts of Build System should be added.

    Describe the solution you'd like

    • Content should be added for manual creating and linking the object files.
    • Importance of build systems.
    • Add the contents for Makefile.
    • Add contents for Cmake.

    Additional context Can refer from here: Embedded Study Group Week 2.

    Note: Content is not finalised and open for discussion.

    enhancement Build-Systems 
    opened by amanchhaparia 0
  • Add Content: C++ basic concepts for seminar.

    Add Content: C++ basic concepts for seminar.

    Is your feature request related to a problem? Please describe. Since the seminar is being ported to C++ as discussed in this thread, it is important to teach some important C++ concepts.

    Describe the solution you'd like

    • Some advance concepts of C++ like handling 2D arrays/vector, pointer etc.

    Note: Content is not finalised and open for discussion.

    enhancement C++ 
    opened by amanchhaparia 1
Releases(v1.0)
  • v1.0(Sep 7, 2022)

    This release contains the 1st version of the PIXELS Seminar conducted in 2021. The content of this release is implemented in Python and uses numpy and OpenCV Python API.

    • This release can be used as a reference to basic Image Processing using Python.
    • Contains a tutorial for necessary numpy methods.
    • Tutorials on commonly used OpenCV functions in Python.
    • Implementation of blob detection a very commonly used algorithm in Python.
    Source code(tar.gz)
    Source code(zip)
Owner
Society of Robotics and Automation
The Society of Robotics and Automation is a society for VJTI students. As the name suggests, we deal with Robotics, Machine Vision and Automation .
Society of Robotics and Automation
OpenGait is a flexible and extensible gait recognition project

A flexible and extensible framework for gait recognition. You can focus on designing your own models and comparing with state-of-the-arts easily with the help of OpenGait.

Shiqi Yu 335 Dec 22, 2022
list all open dataset about ocr.

ocr-open-dataset list all open dataset about ocr. printed dataset year Born-Digital Images (Web and Email) 2011-2015 COCO-Text 2017 Text Extraction fr

hongbomin 95 Nov 24, 2022
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

hocr-tools About About the code Installation System-wide with pip System-wide from source virtualenv Available Programs hocr-check -- check the hOCR f

OCRopus 285 Dec 08, 2022
Scene text recognition

AttentionOCR for Arbitrary-Shaped Scene Text Recognition Introduction This is the ranked No.1 tensorflow based scene text spotting algorithm on ICDAR2

777 Jan 09, 2023
Vietnamese Language Detection and Recognition

Table of Content Introduction (Khôi viết) Dataset (đổi link thui thành 3k5 ảnh mình) Getting Started (An Viết) Requirements Usage Example Training & E

6 May 27, 2022
Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and limited )

GTA-5-Lane-detection Just a script for detecting the lanes in any car game (not just gta 5) with specific resolution and road design ( very basic and

Danciu Georgian 4 Aug 01, 2021
A small C++ implementation of LSTM networks, focused on OCR.

clstm CLSTM is an implementation of the LSTM recurrent neural network model in C++, using the Eigen library for numerical computations. Status and sco

Tom 794 Dec 30, 2022
YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)(Rotation Detection)(Rotated BBox)

YOLOv5_DOTA_OBB YOLOv5 in DOTA_OBB dataset with CSL_label.(Oriented Object Detection) Datasets and pretrained checkpoint Datasets : DOTA Pretrained Ch

1.1k Dec 30, 2022
Programa que viabiliza a OCR (Optical Character Reading - leitura óptica de caracteres) de um PDF.

Este programa tem o intuito de ser um modificador de arquivos PDF. Os arquivos PDFs podem ser 3: PDFs verdadeiros - em que podem ser selecionados o ti

Daniel Soares Saldanha 2 Oct 11, 2021
Play the Namibian game of Owela against a terrible AI. Built using Django and htmx.

Owela Club A Django project for playing the Namibian game of Owela against a dumb AI. Built following the rules described on the Mancala World wiki pa

Adam Johnson 18 Jun 01, 2022
Rotational region detection based on Faster-RCNN.

R2CNN_Faster_RCNN_Tensorflow Abstract This is a tensorflow re-implementation of R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detecti

UCAS-Det 581 Nov 22, 2022
Tensorflow-based CNN+LSTM trained with CTC-loss for OCR

Overview This collection demonstrates how to construct and train a deep, bidirectional stacked LSTM using CNN features as input with CTC loss to perfo

Jerod Weinman 489 Dec 21, 2022
Pixie - A full-featured 2D graphics library for Python

Pixie - A full-featured 2D graphics library for Python Pixie is a 2D graphics library similar to Cairo and Skia. pip install pixie-python Features: Ty

treeform 65 Dec 30, 2022
Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

TableNet Unofficial implementation of ICDAR 2019 paper : TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from

Jainam Shah 243 Dec 30, 2022
An expandable and scalable OCR pipeline

Overview Nidaba is the central controller for the entire OGL OCR pipeline. It oversees and automates the process of converting raw images into citable

81 Jan 04, 2023
Create single line SVG illustrations from your pictures

Create single line SVG illustrations from your pictures

Javier Bórquez 686 Dec 26, 2022
Tool which allow you to detect and translate text.

Text detection and recognition This repository contains tool which allow to detect region with text and translate it one by one. Description Two pretr

Damian Panek 176 Nov 28, 2022
This repository contains codes on how to handle mouse event using OpenCV

Handling-Mouse-Click-Events-Using-OpenCV This repository contains codes on how t

Happy N. Monday 3 Feb 15, 2022
Natural language detection

Detect the language of text. What’s so cool about franc? franc can support more languages(†) than any other library franc is packaged with support for

Titus 3.8k Jan 02, 2023
Generates a message from the infamous Jerma Impostor image

Generate your very own jerma sus imposter message. Modes: Default Mode: Only supports the characters " ", !, a, b, c, d, e, h, i, m, n, o, p, q, r, s,

Giorno420 1 Oct 27, 2022