SRA's seminar on Introduction to Computer Vision Fundamentals

Last update: Dec 04, 2022

Overview

Introduction to Computer Vision

This repository includes basics to :

Python
Numpy: A python library
Git
Computer Vision.

The aim of this repository is to provide:

A brief idea of algorithms involved in Computer Vision .
Introduction to Version Control System: Git and GitHub.
Computer Vision and Image Processing basics, idea of implementation of various algorithms involved using numpy (instead of any dedicated image processing library like OpenCV.)
Introduction to a commonly used Image Processing Library: OpenCV

Demonstration

Comments

Add suboptimal 2D convolution

This pull request intends to add a suboptimal implementation of generic 2D convolution. This is done for the purpose of giving a rough idea to Fys about how to work with python arrays/loops, etc. Fys will be asked to improve this implementation and complete tasks related to convolution on top of it.

opened by meshtag 5
Morphology notes updated.

I have added images for dilation and erosion, replaced the previous gif of dilation and erosion with new ones and added a few lines explaining morphology.

opened by Aryaman22102002 2
Updated cv-basics/
Optimised code and flow as discussed in:

cv-basics/5_opencv_overview.ipynb

python-numpy-basics/7_classes_and_objects.ipynb

Added an image :

cv-basics/image/bcci.png
opened by dhairyashah1 1
Port to C++ : Assignments related to PIXELS seminar
Is your feature request related to a problem? Please describe. This feature request is created to keep a record of porting and potential addition of new assignments related to the seminar in C++ as discussed in this thread.

Describe the solution you'd like

Create a separate main folder for containing all assignments. Individual assignments related to specific topics might be grouped together inside the main parent folder of assignments.

You might chose to add reference links in individual questions, which may provide additional material on a related topic for that question (this is suggested solely for the purpose of providing more (potentially real world) info related to the topic asked in original question and hence, should not in any way lead to the solution).

enhancement
opened by meshtag 0
Add Content: Interpolations.
Is your feature request related to a problem? Please describe. As discussed in the thread, concepts of interpolation can also be added.

Describe the solution you'd like

Create a implementations of interpolation from scratch using necessary OpenCV C++ API.

Add a Makefile to compile and build executables.

Add a .md file to explain the theory of interpolations and instructions to build and run the executables.

Additional context Reference: Ancient Secrets of computer vision.

Note: Content is not finalised and open for discussion
enhancement
opened by amanchhaparia 0
Add Content: Image Storing Formats.
Is your feature request related to a problem? Please describe. As discussed in the thread, It is important to have a familiarity of how images are store.

Describe the solution you'd like

Add the theory of basic image storing formats such as .bmp, .tiff, .jpg, png etc.

Implement a .cpp file on how image can be read from the bmp format.

Consider only 8 bit grayscale BitMap image (Since they are easy to read and contains only 2D form of data).

Use simple posix read() api to read the image bitmap file.

Directly storing the values of various attributes of image in struct is suggested.

A similar example can be added to demonstrate how to edit/write a grayscale bitmap image.

Add a Makefile to compile and build the executable.

Add a .md file explaining the theory and instructions to build and run the executables.

Note: Content is not finalised and open for discussion.
enhancement C++
opened by amanchhaparia 2
Add Content: Build Systems
Is your feature request related to a problem? Please describe. As discussed in the thread, Concepts of Build System should be added.

Describe the solution you'd like

Content should be added for manual creating and linking the object files.

Importance of build systems.

Add the contents for Makefile.

Add contents for Cmake.

Additional context Can refer from here: Embedded Study Group Week 2.

Note: Content is not finalised and open for discussion.
enhancement Build-Systems
opened by amanchhaparia 0
Add Content: C++ basic concepts for seminar.
Is your feature request related to a problem? Please describe. Since the seminar is being ported to C++ as discussed in this thread, it is important to teach some important C++ concepts.

Describe the solution you'd like

Some advance concepts of C++ like handling 2D arrays/vector, pointer etc.

Note: Content is not finalised and open for discussion.
enhancement C++
opened by amanchhaparia 1

Releases(v1.0)

v1.0(Sep 7, 2022)
This release contains the 1st version of the PIXELS Seminar conducted in 2021. The content of this release is implemented in Python and uses numpy and OpenCV Python API.

This release can be used as a reference to basic Image Processing using Python.

Contains a tutorial for necessary numpy methods.

Tutorials on commonly used OpenCV functions in Python.

Implementation of blob detection a very commonly used algorithm in Python.

Source code(tar.gz)
Source code(zip)

Owner

Society of Robotics and Automation

The Society of Robotics and Automation is a society for VJTI students. As the name suggests, we deal with Robotics, Machine Vision and Automation .

GitHub Repository

MXNet OCR implementation. Including text recognition and detection.

insightocr Text Recognition Accuracy on Chinese dataset by caffe-ocr Network LSTM 4x1 Pooling Gray Test Acc SimpleNet N Y Y 99.37% SE-ResNet34 N Y Y 9

99 Nov 01, 2022

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching This repository is an official implementation of

13 Sep 08, 2022

Motion Detection Squid Game with OpenCV Python

*Motion Detection Squid Game with OpenCV Python i am newbie in python. In this project I made a simple game to follow the trend about the red light gr

17 Nov 22, 2022

chineseocr/table_line 表格线检测模型pytorch版

table_line_pytorch chineseocr/table_detct 表格线检测模型table_line pytorch版原项目github: https://github.com/chineseocr/table-detect 1、模型转换下载原项目table_detect模型文

1 Oct 21, 2021

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

Multi-Type-TD-TSR Check it out on Source Code of our Paper: Multi-Type-TD-TSR Extracting Tables from Document Images using a Multi-stage Pipeline for

178 Dec 27, 2022

code for our ICCV 2021 paper "DeepCAD: A Deep Generative Network for Computer-Aided Design Models"

DeepCAD This repository provides source code for our paper: DeepCAD: A Deep Generative Network for Computer-Aided Design Models Rundi Wu, Chang Xiao,

85 Dec 31, 2022

A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes A PyTorch implement of TextSnake: A Flexible Representation for Detecting

417 Dec 12, 2022

An Implementation of the seglink alogrithm in paper Detecting Oriented Text in Natural Images by Linking Segments

Tips: A more recent scene text detection algorithm: PixelLink, has been implemented here: https://github.com/ZJULearning/pixel_link Contents: Introduc

484 Dec 07, 2022

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

RepMLP RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition Released the code of RepMLP together with an example o

260 Jan 03, 2023

Code release for Hu et al., Learning to Segment Every Thing. in CVPR, 2018.

Learning to Segment Every Thing This repository contains the code for the following paper: R. Hu, P. Dollár, K. He, T. Darrell, R. Girshick, Learning

417 Oct 03, 2022

Code for the head detector (HeadHunter) proposed in our CVPR 2021 paper Tracking Pedestrian Heads in Dense Crowd.

Head Detector Code for the head detector (HeadHunter) proposed in our CVPR 2021 paper Tracking Pedestrian Heads in Dense Crowd. The head_detection mod

76 Dec 06, 2022

Solution for Problem 1 by team codesquad for AIDL 2020. Uses ML Kit for OCR and OpenCV for image processing

CodeSquad PS1 Solution for Problem Statement 1 for AIDL 2020 conducted by @unifynd technologies. Problem Given images of bills/invoices, the task was

111 Nov 27, 2022

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless. This is the official Roboflow python package that interfaces with the Roboflow API.

52 Dec 23, 2022

Document Layout Analysis Projects

Layout_Analysis Introduction This is an implementation of RLSA and X-Y Cut with OpenCV Dependencies OpenCV 3.0+ How to use Compile with g++ : g++ -std

22 Dec 08, 2022

A real-time dolly zoom camera effect

Dolly-Zoom I've always been amazed by the gradual perspective change of dolly zoom, and I have some experience in python and OpenCV, so I decided to c

52 Dec 08, 2022

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo Thomas Kollar, Michael Laskey, Kevin Stone, Brijen Thananjeyan

68 Dec 14, 2022

Character Segmentation using TensorFlow

Character Segmentation Segment characters and spaces in one text line,from this paper Chinese English mixed Character Segmentation as Semantic Segment

26 Aug 25, 2022

[ICCV, 2021] Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks

Cloud Transformers: A Universal Approach To Point Cloud Processing Tasks This is an official PyTorch code repository of the paper "Cloud Transformers:

27 Dec 15, 2022

This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

75 Oct 21, 2022

PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application in real time.

PyNeuro PyNeuro is designed to connect NeuroSky's MindWave EEG device to Python and provide Callback functionality to provide data to your application

45 Dec 30, 2022