Reproduces ResNet-V3 with pytorch

Last update: Dec 23, 2022

Overview

ResNeXt.pytorch

Reproduces ResNet-V3 (Aggregated Residual Transformations for Deep Neural Networks) with pytorch.

Download

git clone https://github.com/prlz77/resnext.pytorch
cd resnext.pytorch
# git checkout R4.0 or R3.0 for backwards compatibility (not recommended).

Usage

To train on Cifar-10 using 2 gpu:

python train.py ~/DATASETS/cifar.python cifar10 -s ./snapshots --log ./logs --ngpu 2 --learning_rate 0.05 -b 128

It should reach ~3.65% on Cifar-10, and ~17.77% on Cifar-100.

After train phase, you can check saved model.

Thanks to @AppleHolic we have now a test script:

To test on Cifar-10 using 2 gpu:

python test.py ~/DATASETS/cifar.python cifar10 --ngpu 2 --load ./snapshots/model.pytorch --test_bs 128

Configurations

From the original paper:

cardinality	base_width	parameters	Error cifar10	error cifar100	default
8	64	34.4M	3.65	17.77	x
16	64	68.1M	3.58	17.31

Update: widen_factor has been disentangled from base_width because it was confusing. Now widen factor is set to consant 4, and base_width is the same as in the original paper.

Trained models and curves

Link to trained models corresponding to the following curves:

Update: several commits have been pushed after training the models in Mega, so it is recommended to revert to e10c37d8cf7a958048bc0f58cd86c3e8ac4e707d

Other frameworks

torch (@facebookresearch). (Original) Cifar and Imagenet
caffe (@terrychenism). Imagenet
MXNet (@dmlc). Imagenet

Cite

@article{xie2016aggregated,
  title={Aggregated residual transformations for deep neural networks},
  author={Xie, Saining and Girshick, Ross and Doll{\'a}r, Piotr and Tu, Zhuowen and He, Kaiming},
  journal={arXiv preprint arXiv:1611.05431},
  year={2016}
}

Reproduces ResNet-V3 with pytorch

Related tags

Overview

ResNeXt.pytorch

Download

Usage

Configurations

Trained models and curves

Other frameworks

Cite

Owner

Pau Rodriguez

Rendering color and depth images for ShapeNet models.

An easier way to build neural search on the cloud

Charsiu: A transformer-based phonetic aligner

The official homepage of the COCO-Stuff dataset.

Code for "FGR: Frustum-Aware Geometric Reasoning for Weakly Supervised 3D Vehicle Detection", ICRA 2021

🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 1.7M (int8) and 3.3M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

A PyTorch implementation of "SelfGNN: Self-supervised Graph Neural Networks without explicit negative sampling"

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Using deep learning to predict gene structures of the coding genes in DNA sequences of Arabidopsis thaliana

Information Gain Filtration (IGF) is a method for filtering domain-specific data during language model finetuning. IGF shows significant improvements over baseline fine-tuning without data filtration.

Multi-modal Vision Transformers Excel at Class-agnostic Object Detection

Tackling the Class Imbalance Problem of Deep Learning Based Head and Neck Organ Segmentation

No-reference Image Quality Assessment(NIQA) Algorithms (BRISQUE, NIQE, PIQE, RankIQA, MetaIQA)

Safe Bayesian Optimization

Commonsense Ability Tests

Functional TensorFlow Implementation of Singular Value Decomposition for paper Fast Graph Learning

Learning embeddings for classification, retrieval and ranking.

TagLab: an image segmentation tool oriented to marine data analysis

Open source person re-identification library in python

Table-Extractor 表格抽取