Object Detection and Multi-Object Tracking

Last update: Jan 04, 2023

Overview

Object Detection and Tracking

Object detection is a computer technology related to computer vision and image processing that deals with detecting instances of semantic objects of a certain class (such as humans, buildings, or cars) in digital images and videos.

Environment

I have tested on Ubuntu 16.04/18.04. The code may work on other systems.

[Ubuntu-Deep-Learning-Environment-Setup]

Ubuntu 16.04 / 18.04
ROS Kinetic / Melodic
GTX 1080Ti / RTX 2080Ti
python 2.7 / 3.6

Installation

Clone the repository

git clone https://github.com/yehengchen/Object-Detection-and-Tracking.git

[OneStage]

YOLO: Real-Time Object Detection and Tracking

How to train a YOLO model on custom images: YOLOv3 - [Link] / YOLOv4 - [Link]

YOLOv4 + Deep_SORT - Pedestrian Counting & Social Distance - [Link]
YOLOv3 + Deep_SORT - Pedestrian&Car Counting - [Link]

YOLOv3 + SORT - Pedestrian Counting - [Link]

Darknet_ROS: Real-Time Object Detection and Grasp Detection With ROS

YOLOv3 + ROS Kinetic - For small Custom Data - [Link]

YOLOv3 + ROS Melodic - Robot Grasp Detection - [Link]
Parts-Arrangement-Robot - [Link]

YOLOv3 + OpenCV + ROS Melodic - Object Detection (Rotated) - [Link]

SSD: Single Shot MultiBox Detector

How to train a SSD model on own images - [Link]

[TwoStage]

R-CNN: Region-based methods

Fast R-CNN / Faster R-CNN / Mask R-CNN

How to train a Mask R-CNN model on own images - [Link]

Mask R-CNN + ROS Kinetic - [Link]

This project is ROS package of Mask R-CNN algorithm for object detection and segmentation.

COCO & VOC Datasets

COCO dataset and Pascal VOC dataset - [Link]
How to get it working on the COCO dataset coco2voc - [Link]
Convert Dataset2Yolo - COCO / VOC - [Link]

Object Detection and Multi-Object Tracking

Related tags

Overview

Object Detection and Tracking

Environment

Ubuntu 16.04 / 18.04

ROS Kinetic / Melodic

GTX 1080Ti / RTX 2080Ti

python 2.7 / 3.6

Installation

[OneStage]

YOLO: Real-Time Object Detection and Tracking

YOLOv4 + Deep_SORT - Pedestrian Counting & Social Distance - [Link]

YOLOv3 + Deep_SORT - Pedestrian&Car Counting - [Link]

YOLOv3 + SORT - Pedestrian Counting - [Link]

Darknet_ROS: Real-Time Object Detection and Grasp Detection With ROS

YOLOv3 + ROS Kinetic - For small Custom Data - [Link]

YOLOv3 + ROS Melodic - Robot Grasp Detection - [Link]

Parts-Arrangement-Robot - [Link]

YOLOv3 + OpenCV + ROS Melodic - Object Detection (Rotated) - [Link]

SSD: Single Shot MultiBox Detector

How to train a SSD model on own images - [Link]

[TwoStage]

R-CNN: Region-based methods

Mask R-CNN + ROS Kinetic - [Link]

COCO & VOC Datasets

COCO dataset and Pascal VOC dataset - [Link]

How to get it working on the COCO dataset coco2voc - [Link]

Convert Dataset2Yolo - COCO / VOC - [Link]

CV & Robotics Paper List (3D object detection & 6D pose estimation) - [Link]

PapersWithCode: Browse > Computer Vision > Object Detection - [Link]

ObjectDetection Two-stage vs One-stage Detectors - [Link]

ObjectDetection mAP & IoU - [Link]

Owner

Bobby Chen

PyTorch implementation of "PatchGame: Learning to Signal Mid-level Patches in Referential Games" to appear in NeurIPS 2021

Source code of CIKM2021 Long Paper "PSSL: Self-supervised Learning for Personalized Search with Contrastive Sampling".

TensorFlow2 Classification Model Zoo playing with TensorFlow2 on the CIFAR-10 dataset.

LyaNet: A Lyapunov Framework for Training Neural ODEs

Implementation of Diverse Semantic Image Synthesis via Probability Distribution Modeling

[ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.

Code for a real-time distributed cooperative slam(RDC-SLAM) system for ROS compatible platforms.

Keyword2Text This repository contains the code of the paper: "A Plug-and-Play Method for Controlled Text Generation"

STARCH compuets regional extreme storm physical characteristics and moisture balance based on spatiotemporal precipitation data from reanalysis or climate model data.

Equivariant GNN for the prediction of atomic multipoles up to quadrupoles.

Implementation of Artificial Neural Network Algorithm

Equivariant Imaging: Learning Beyond the Range Space

Learning 3D Part Assembly from a Single Image

Official implementation of VQ-Diffusion

GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

gACSON software for visualization, processing and analysis of three-dimensional electron microscopy images

HashNeRF-pytorch - Pure PyTorch Implementation of NVIDIA paper on Instant Training of Neural Graphics primitives

A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".

SUPERVISED-CONTRASTIVE-LEARNING-FOR-PRE-TRAINED-LANGUAGE-MODEL-FINE-TUNING - The Facebook paper about fine tuning RoBERTa with contrastive loss