Optimizers-visualized - Visualization of different optimizers on local minimas and saddle points.

Last update: Jan 01, 2022

Overview

Optimizers Visualized

Visualization of how different optimizers handle mathematical functions for optimization.

Installation
Usage
Functions for optimization
Visualization of optimizers
Links
TODO

Installation of libraries

pip install -r requirements.txt

NOTE: The optimizers used in this project are the pre-written ones in the pytorch module.

Usage

python main.py

The project is designed to be interactive, making it easy for the user to change any default values simply using stdin.

Functions for optimization

Matyas' Function

This is a relatively simple function for optimization.

Source: https://en.wikipedia.org/wiki/File:Matyas_function.pdf

Himmelblau's Function

A complex function, with multiple global minimas.

Source: https://en.wikipedia.org/wiki/File:Himmelblau_function.svg

Visualization of optimizers

All optimizers were given 100 iterations to find the global minima, from a same starting point. Learning rate was set to 0.1 for all instances, except when using SGD for minimizing Himmelblau's function.

Stochastic Gradient Descent

The vanilla stochastic gradient descent optimizer, with no additional functionalities:

theta_t = theta_t - lr * gradient

SGD on Matyas' function

We can see that SGD takes an almost direct path downwards, and then heads towards the global minima.

SGD on Himmelblau's function

SGD on Himmelblau's function fails to converge even when the learning rate is reduced from 0.1 to 0.03.

It only converges when the learning rate is further lowered to 0.01, still overshooting during the early iterations.

Root Mean Square Propagation

RMSProp with the default hyperparameters, except the learning rate.

RMSProp on Matyas' function

RMSProp first reaches a global minima in one dimension, and then switches to minimizing another dimension. This can be hurtful if there are saddle points in the function which is to be minimized.

RMSProp on Himmelblau's function

By trying to minimize one dimension first, RMSProp overshoots and has to return back to the proper path. It then minimizes the next dimension.

Adaptive Moment Estimation

Adam optimizer with the default hyperparameters, except the learning rate.

Adam on Matyas' function

Due to the momentum factor and the exponentially weighted average factor, Adam shoots past the minimal point, and returns back.

Adam on Himmelblau's function

Adam slides around the curves, again mostly due to the momentum factor.

Todos

Add more optimizers
Add more complex functions
Test out optimizers in saddle points

Optimizers-visualized - Visualization of different optimizers on local minimas and saddle points.

Related tags

Overview

Optimizers Visualized

Contents

Installation of libraries

Usage

Functions for optimization

Matyas' Function

Himmelblau's Function

Visualization of optimizers

Stochastic Gradient Descent

SGD on Matyas' function

SGD on Himmelblau's function

Root Mean Square Propagation

RMSProp on Matyas' function

RMSProp on Himmelblau's function

Adaptive Moment Estimation

Adam on Matyas' function

Adam on Himmelblau's function

Links

Todos

Owner

Gautam J

Simple PyTorch hierarchical models.

This is a deep learning-based method to segment deep brain structures and a brain mask from T1 weighted MRI.

Company clustering with K-means/GMM and visualization with PCA, t-SNE, using SSAN relation extraction

This repo is a PyTorch implementation for Paper "Unsupervised Learning for Cuboid Shape Abstraction via Joint Segmentation from Point Clouds"

[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.

AI assistant built in python.the features are it can display time,say weather,open-google,youtube,instagram.

Predictive AI layer for existing databases.

Replication attempt for the Protein Folding Model

Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"

Prototype python implementation of the ome-ngff table spec

Code for reproducing experiments in "Improved Training of Wasserstein GANs"

Extracts essential Mediapipe face landmarks and arranges them in a sequenced order.

Code for the Paper: Alexandra Lindt and Emiel Hoogeboom.

Pmapper is a super-resolution and deconvolution toolkit for python 3.6+

Impelmentation for paper Feature Generation and Hypothesis Verification for Reliable Face Anti-Spoofing

Code for the ECCV2020 paper "A Differentiable Recurrent Surface for Asynchronous Event-Based Data"

This is code of book "Learn Deep Learning with PyTorch"

Lyapunov-guided Deep Reinforcement Learning for Stable Online Computation Offloading in Mobile-Edge Computing Networks

RoboDesk A Multi-Task Reinforcement Learning Benchmark

yolov5 deepsort 行人 车辆 跟踪 检测 计数

yolov5 deepsort 行人车辆跟踪检测计数