YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Last update: Dec 30, 2022

Overview

YOLOv5_DOTA_OBB

YOLOv5 in DOTA_OBB dataset with CSL_label.(Oriented Object Detection)

Datasets and pretrained checkpoint

Datasets : DOTA
Pretrained Checkpoint or Demo Files :
- train,detect_and_evaluate_demo_files.(6666)
- yolov5x.pt.(6666)
- yolov5l.pt.(6666)
- yolov5m.pt.(6666)
- yolov5s.pt.(6666)
- YOLOv5_DOTAv1.5_OBB.pt.(6666)

Fuction

train.py. Train.
detect.py. Detect and visualize the detection result. Get the detection result txt.
evaluation.py. Merge the detection result and visualize it. Finally evaluate the detector

Installation (Linux Recommend, Windows not Recommend)

1. Python 3.8 or later with all requirements.txt dependencies installed, including torch>=1.7. To install run:

$   pip install -r requirements.txt

2. Install swig

$   cd  \.....\yolov5_DOTA_OBB\utils
$   sudo apt-get install swig

3. Create the c++ extension for python

$   swig -c++ -python polyiou.i
$   python setup.py build_ext --inplace

More detailed explanation

想要了解相关实现的细节和原理可以看我的知乎文章:
YOLOv5_DOTAv1.5(遥感旋转目标检测，全踩坑记录);

Usage Example

1. 'Get Dataset'

Split the DOTA_OBB image and labels. Trans DOTA format to YOLO longside format.
You can refer to hukaixuan19970627/DOTA_devkit_YOLO.
The Oriented YOLO Longside Format is:

$  classid    x_c   y_c   longside   shortside    Θ    Θ∈[0, 180)


* longside: The longest side of the oriented rectangle.

* shortside: The other side of the oriented rectangle.

* Θ: The angle between the longside and the x-axis(The x-axis rotates clockwise).x轴顺时针旋转遇到最长边所经过的角度

WARNING: IMAGE SIZE MUST MEETS 'HEIGHT = WIDTH'

2. 'train.py'

All same as ultralytics/yolov5. You better train demo files first before train your custom dataset.
Single GPU training:

$ python train.py  --batch-size 4 --device 0

Multi GPU training: DistributedDataParallel Mode

python -m torch.distributed.launch --nproc_per_node 4 train.py --sync-bn --device 0,1,2,3

3. 'detect.py'

Download the demo files.
Then run the demo. Visualize the detection result and get the result txt files.

$  python detect.py

4. 'evaluation.py'

Run the detect.py demo first. Then change the path with yours:

evaluation
(
        detoutput=r'/....../DOTA_demo_view/detection',
        imageset=r'/....../DOTA_demo_view/row_images',
        annopath=r'/....../DOTA_demo_view/row_DOTA_labels/{:s}.txt'
)
draw_DOTA_image
(
        imgsrcpath=r'/...../DOTA_demo_view/row_images',
        imglabelspath=r'/....../DOTA_demo_view/detection/result_txt/result_merged',
        dstpath=r'/....../DOTA_demo_view/detection/merged_drawed'
)

Run the evaluation.py demo. Get the evaluation result and visualize the detection result which after merged.

$  python evaluation.py

有问题反馈

在使用中有任何问题，欢迎反馈给我，可以用以下联系方式跟我交流

知乎（@略略略）
代码问题提issues,其他问题请知乎上联系

感激

感谢以下的项目,排名不分先后

关于作者

  Name  : "胡凯旋"
  describe myself："咸鱼一枚"

YOLOv5 in DOTA with CSL_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）

Related tags

Overview

YOLOv5_DOTA_OBB

Datasets and pretrained checkpoint

Fuction

Installation (Linux Recommend, Windows not Recommend)

More detailed explanation

Usage Example

有问题反馈

感激

关于作者

Owner

The Open Source Framework for Machine Vision

Qrcode Attendence System with Opencv and Pyzbar

Write-ups for the SwissHackingChallenge2021 CTF.

Python-based tools for document analysis and OCR

TedEval: A Fair Evaluation Metric for Scene Text Detectors

Assignment work with webcam

OCR-D-compliant page segmentation

A curated list of papers, code and resources pertaining to image composition

Connect Aseprite to Blender for painting pixelart textures in real time

Code release for Hu et al., Learning to Segment Every Thing. in CVPR, 2018.

Textboxes : Image Text Detection Model : python package (tensorflow)

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"

Rubik's Cube in pygame with OpenGL

A simple Security Camera created using Opencv in Python where images gets saved in realtime in your Dropbox account at every 5 seconds

Aloception is a set of package for computer vision: aloscene, alodataset, alonet.

Run tesseract with the tesserocr bindings with @OCR-D's interfaces

Omdena-abuja-anpd - Automatic Number Plate Detection for the security of lives and properties using Computer Vision.

Shape Detection - It's a shape detection project with OpenCV and Python.

A simple Digits Recogniser made in Python