YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

Last update: Jan 01, 2023

Related tags

Deep Learning yoltv5

Overview

YOLTv5

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks.

YOLTv5 builds upon YOLT and SIMRDWN, and updates these frameworks to use the YOLOv5 version of the YOLO object detection family. This repository has generally similar performance to the Darknet-based YOLTv4 repository. For those users who prefer a PyTorch backend, however, we provide YOLTv5.

Below, we provide examples of how to use this repository with the open-source SpaceNet dataset.

Running YOLTv5

0. Installation (Preliminary)

YOLTv5 is built to execute on a GPU-enabled machine.

cd yoltv5/yolov5
pip install -r requirements.txt 

# update with geo packages
conda install -c conda-forge gdal
conda install -c conda-forge osmnx=0.12 
conda install  -c conda-forge scikit-image
conda install  -c conda-forge statsmodels
pip install torchsummary
pip install utm
pip install numba
pip install jinja2==2.10

1. Train

Training preparation is accomplished via prep_train.py. To train a model, run:

cd /yoltv5
python yolov5/train.py --img 640 --batch 16 --epochs 100 --data yoltv5_train_vehicles_8cat.yaml --weights yolov5l.pt

2. Test

Simply edit yoltv5_test_vehicles_8cat.yaml to point to the appropriate locations, then run the test.sh script:

cd yoltv5
./test.sh ../configs/yoltv5_test_vehicles_8cat.yaml

Outputs will look something like the figure below:

YOLTv5 rapidly detects objects in arbitrarily large aerial or satellite images that far exceed the ~600×600 pixel size typically ingested by deep learning object detection frameworks

Related tags

Overview

YOLTv5

Running YOLTv5

0. Installation (Preliminary)

1. Train

2. Test

Owner

Adam Van Etten

An MQA (Studio, originalSampleRate) identifier for lossless flac files written in Python.

image scene graph generation benchmark

[ICCV 2021] Learning A Single Network for Scale-Arbitrary Super-Resolution

Rule Based Classification Project

codes for IKM (arXiv2021, Submitted to IEEE Trans)

PySOT - SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.

Automatic number plate recognition using tech: Yolo, OCR, Scene text detection, scene text recognation, flask, torch

UNAVOIDS: Unsupervised and Nonparametric Approach for Visualizing Outliers and Invariant Detection Scoring

Model Quantization Benchmark

Code to reproduce the experiments in the paper "Transformer Based Multi-Source Domain Adaptation" (EMNLP 2020)

Voice Conversion by CycleGAN (语音克隆/语音转换)：CycleGAN-VC3

ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic arm

This repository collects project-relevant Isabelle/HOL formalizations.

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

Utilizes Pose Estimation to offer sprinters cues based on an image of their running form.

An end-to-end library for editing and rendering motion of 3D characters with deep learning [SIGGRAPH 2020]

Least Square Calibration for Peer Reviews

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

PyTorch for Semantic Segmentation