CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Last update: Mar 07, 2022

Related tags

Deep Learning CvT-ASSD

Overview

CvT-ASSD

including extra CvT, CvT-SSD, VGG-ASSD models

original-code-website:

https://github.com/albert-jin/CvT-SSD

new-code-website:

https://github.com/albert-jin/CvT-ASSD

为了符合开源号召,本项目于2021-7-12 正式开源...

project architecture:

Mentions

You may probably need to install an anaconda environment which contains all packages followed.
- pytorch 1.9.0 py3.7_cuda10.2_cudnn7_0 pytorch
- cudatoolkit 10.2.89 h74a9793_1
- opencv-python 4.5.2.54 pypi_0 pypi
- visdom 0.1.8.9 pypi_0 pypi
- yacs 0.1.8 pypi_0 pypi
- jupyter 1.0.0 pypi_0 pypi
For training, an NVIDIA GPU is strongly recommended for speed. we use two NVIDIA GTX-1080TI, but we recommend GPUs like Tesla-V100 /RTX-3090 for more memory
Before you run the codes for self-study or reappearance the performance in this paper "CvT-ASSD", please add the CvT_SSD/model/ directory into sources Root caused by the reference of many codes inside of model directory
you should download the pytorch parameters file postfix by ".pth" and move into models/CvT/weights like 项目结构.PNG
图像物体检测benchmark(参照论文native-SSD)一般是将VOC2007—TEST的数据作为模型的测试集,训练集可有以下搭配:
- 1. 07:VOC2007 trainval 训练集验证集
- 1. 02+12 VOC2007 trainval + VOC2007 trainval 训练集验证集
- 1. 07+12+COCO 在 COCO trainval35k上预训练,然后在07+12上微调
评价指标maP使用mxnet提供的VOC07MApMetric,将recall分成10等分,继而对所有precision取平均,在对类别去平均,具体参见 https://blog.csdn.net/u014203453/article/details/77598997

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Related tags

Overview

CvT-ASSD

including extra CvT, CvT-SSD, VGG-ASSD models

original-code-website:

new-code-website:

为了符合开源号召,本项目于2021-7-12 正式开源...

project architecture:

Mentions

Owner

金伟强 -上海大学人工智能小渣渣~

This python-based package offers a way of creating a parametric OpenMC plasma source from plasma parameters.

Uncertainty Estimation via Response Scaling for Pseudo-mask Noise Mitigation in Weakly-supervised Semantic Segmentation

Source code for PairNorm (ICLR 2020)

A collection of papers about Transformer in the field of medical image analysis.

This repository contains the code used for the implementation of the paper "Probabilistic Regression with HuberDistributions"

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

Instantaneous Motion Generation for Robots and Machines.

A solution to the 2D Ising model of ferromagnetism, implemented using the Metropolis algorithm

'Solving the sampling problem of the Sycamore quantum supremacy circuits

Quantify the difference between two arbitrary curves in space

OpenVINO黑客松比赛项目

PyTorch Implementation for AAAI'21 "Do Response Selection Models Really Know What's Next? Utterance Manipulation Strategies for Multi-turn Response Selection"

Visual dialog agents with pre-trained vision-and-language encoders.

Yoga - Yoga asana classifier for python

PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Accelerated deep learning R&D

Volumetric parameterization of the placenta to a flattened template

PyTorch Lightning + Hydra. A feature-rich template for rapid, scalable and reproducible ML experimentation with best practices. ⚡🔥⚡

PyTorch evaluation code for Delving Deep into the Generalization of Vision Transformers under Distribution Shifts.

Generating retro pixel game characters with Generative Adversarial Networks. Dataset "TinyHero" included.