Over9000 optimizer

Last update: Nov 27, 2022

Related tags

Overview

Optimizers and tests

Every result is avg of 20 runs.

Dataset	LR Schedule	Imagenette size 128, 5 epoch	Imagewoof size 128, 5 epoch
Adam - baseline	OneCycle	0.8493	0.6125
RangerLars (RAdam + LARS + Lookahead)	Flat and anneal	0.8732	0.6523
Ralamb (RAdam + LARS)	Flat and anneal	0.8675	0.6367
Ranger (RAdam + Lookahead)	Flat and anneal	0.8594	0.5946
Novograd	Flat and anneal	0.8711	0.6126
Radam	Flat and anneal	0.8444	0.537
Lookahead	OneCycle	0.8578	0.6106
Lamb	OneCycle	0.8400	0.5597
DiffGrad	OneCycle	0.8527	0.5912
AdaMod	OneCycle	0.8473	0.6132

Owner

Mikhail Grankin

GitHub Repository

Deep Learning tutorials in jupyter notebooks.

DeepSchool.io Sign up here for Udemy Course on Machine Learning (Use code DEEPSCHOOL-MARCH to get 85% off course). Goals Make Deep Learning easier (mi

1.8k Dec 28, 2022

Resources complimenting the Machine Learning Course led in the Faculty of mathematics and informatics part of Sofia University.

Machine Learning and Data Mining, Summer 2021-2022 How to learn data science and machine learning? Programming. Learn Python. Basic Statistics. Take a

8 Oct 04, 2022

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation Zhaoyun Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li

25 Dec 16, 2022

Code of PVTv2 is released! PVTv2 largely improves PVTv1 and works better than Swin Transformer with ImageNet-1K pre-training.

Updates (2020/06/21) Code of PVTv2 is released! PVTv2 largely improves PVTv1 and works better than Swin Transformer with ImageNet-1K pre-training. Pyr

1.3k Jan 04, 2023

MT3: Multi-Task Multitrack Music Transcription

MT3: Multi-Task Multitrack Music Transcription MT3 is a multi-instrument automatic music transcription model that uses the T5X framework. This is not

867 Dec 29, 2022

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Multilingual Unsupervised Sentence Simplification Code and pretrained models to reproduce experiments in "MUSS: Multilingual Unsupervised Sentence Sim

81 Dec 29, 2022

[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

DAB-DETR This is the official pytorch implementation of our ICLR 2022 paper DAB-DETR. Authors: Shilong Liu, Feng Li, Hao Zhang, Xiao Yang, Xianbiao Qi

336 Dec 25, 2022

FIRM-AFL is the first high-throughput greybox fuzzer for IoT firmware.

FIRM-AFL FIRM-AFL is the first high-throughput greybox fuzzer for IoT firmware. FIRM-AFL addresses two fundamental problems in IoT fuzzing. First, it

356 Dec 23, 2022

Brax is a differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators

Brax is a differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators. It's also a suite of learning algorithms to train agents to operate in these enviro

1.5k Jan 02, 2023

A python software that can help blind people find things like laptops, phones, etc the same way a guide dog guides a blind person in finding his way.

GuidEye A python software that can help blind people find things like laptops, phones, etc the same way a guide dog guides a blind person in finding h

0 Aug 09, 2022

Pytorch code for our paper "Feedback Network for Image Super-Resolution" (CVPR2019)

Feedback Network for Image Super-Resolution [arXiv] [CVF] [Poster] Update: Our proposed Gated Multiple Feedback Network (GMFN) will appear in BMVC2019

539 Jan 06, 2023

An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.

Sketch Simulator An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics. See

12 Dec 18, 2022

Over9000 optimizer

Related tags

Overview

Optimizers and tests

Owner

Mikhail Grankin

Deep Learning tutorials in jupyter notebooks.

Resources complimenting the Machine Learning Course led in the Faculty of mathematics and informatics part of Sofia University.

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Code of PVTv2 is released! PVTv2 largely improves PVTv1 and works better than Swin Transformer with ImageNet-1K pre-training.

MT3: Multi-Task Multitrack Music Transcription

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

FIRM-AFL is the first high-throughput greybox fuzzer for IoT firmware.

Brax is a differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators

A python software that can help blind people find things like laptops, phones, etc the same way a guide dog guides a blind person in finding his way.

Pytorch code for our paper "Feedback Network for Image Super-Resolution" (CVPR2019)

An architecture that makes any doodle realistic, in any specified style, using VQGAN, CLIP and some basic embedding arithmetics.

Conversational text Analysis using various NLP techniques

Implementation of Monocular Direct Sparse Localization in a Prior 3D Surfel Map (DSL)

CVPR '21: In the light of feature distributions: Moment matching for Neural Style Transfer

Build a small, 3 domain internet using Github pages and Wikipedia and construct a crawler to crawl, render, and index.

Official repository of Semantic Image Matting

Unofficial PyTorch implementation of Guided Dropout

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

This project uses Template Matching technique for object detecting by detection of template image over base image.