Libtorch yolov3 deepsort

Last update: Dec 13, 2022

Overview

It is for my undergrad thesis in Tsinghua University.

There are four modules in the project:

Detection: YOLOv3
Tracking: SORT and DeepSORT
Processing: Run detection and tracking, then display and save the results (a compressed video, a few snapshots for each target)
GUI: Display the results

YOLOv3

A Libtorch implementation of the YOLO v3 object detection algorithm, written with modern C++.

The code is based on the walktree.

The config file in .\models can be found at Darknet.

SORT

I also merged SORT to do tracking.

A similar software in Python is here, which also rewrite form the most starred version and SORT

DeepSORT

Recently I reimplement DeepSORT which employs another CNN for re-id. It seems it gives better result but also slows the program a bit. Also, a PyTorch version is available at ZQPei, thanks!

Performance

Currently on a GTX 1060 6G it consumes about 1G RAM and have 37 FPS.

The video I test is TownCentreXVID.avi.

GUI

With wxWidgets, I developed the GUI module for visualization of results.

Previously I used Dear ImGui. However, I do not think it suits my purpose.

Pre-trained network

This project uses pre-trained network weights from others

How to build

This project requires LibTorch, OpenCV, wxWidgets and CMake to build.

LibTorch can be easily integrated with CMake, but there are a lot of strange things...

On Ubuntu 16.04, I use apt install to install the others. Everything is fine. On Windows 10 + Visual Studio 2017, I use the latest stable version of the others from their official websites.

Snapshots

Here are some intermediate output from detection and tracking module:

Here is the snapshot of processing module:

Here is the snapshot of GUI module:

Libtorch yolov3 deepsort

Related tags

Overview

Overview

YOLOv3

SORT

DeepSORT

Performance

GUI

Pre-trained network

How to build

Snapshots

Owner

Xu Wei

Python版OpenCVのTracking APIのサンプルです。DaSiamRPNアルゴリズムまで対応しています。

Code for EMNLP'21 paper "Types of Out-of-Distribution Texts and How to Detect Them"

An implementation of the BADGE batch active learning algorithm.

Point cloud processing tool library.

Evaluating AlexNet features at various depths

Sub-tomogram-Detection - Deep learning based model for Cyro ET Sub-tomogram-Detection

OCR Post Correction for Endangered Language Texts

MassiveSumm: a very large-scale, very multilingual, news summarisation dataset

Reproducible research and reusable acyclic workflows in Python. Execute code on HPC systems as if you executed them on your personal computer!

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

Rational Activation Functions - Replacing Padé Activation Units

Framework that uses artificial intelligence applied to mathematical models to make predictions

This is the dataset for testing the robustness of various VO/VIO methods

Veri Setinizi Yolov5 Formatına Dönüştürün

A simple API wrapper for Discord interactions.

Fully Convolutional DenseNet (A.K.A 100 layer tiramisu) for semantic segmentation of images implemented in TensorFlow.

CHERRY is a python library for predicting the interactions between viral and prokaryotic genomes

When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset of 53,000+ Legal Holdings

Auto-updating data to assist in investment to NEPSE

Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)