Experiments for distributed optimization algorithms

Last update: Dec 04, 2022

Overview

Network-Distributed Algorithm Experiments

This repository contains a set of optimization algorithms and objective functions, and all code needed to reproduce experiments in:

"DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization" [PDF]. (code is in this file [link])
"Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction" [PDF]. (code is in the previous version of this repo [link])

Due to the random data generation procedure, resulting graphs may be slightly different from those appeared in the paper, but conclusions remain the same.

If you find this code useful, please cite our papers:

@article{li2021destress,
  title={DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization},
  author={Li, Boyue and Li, Zhize and Chi, Yuejie},
  journal={arXiv preprint arXiv:2110.01165},
  year={2021}
}

@article{li2020communication,
  title={Communication-Efficient Distributed Optimization in Networks with Gradient Tracking and Variance Reduction},
  author={Li, Boyue and Cen, Shicong and Chen, Yuxin and Chi, Yuejie},
  journal={Journal of Machine Learning Research},
  volume={21},
  pages={1--51},
  year={2020}
}

Implemented objective functions

The gradient implementations of all objective functions are checked numerically.

Linear regression

Linear regression with random generated data. The objective function is $f(w) = \frac{1}{N} \sum_i (y_i - x_i^\top w)^2$

Logistic regression

Logistic regression with $l$-2 or nonconvex regularization with random generated data or the Gisette dataset or datasets from libsvmtools. The objective function is $$ f(w) = - \frac{1}{N} * \Big(\sum_i y_i \log \frac{1}{1 + exp(w^T x_i)} + (1 - y_i) \log \frac{exp(w^T x_i)}{1 + exp(w^T x_i)} \Big) + \frac{\lambda}{2} | w |_2^2 + \alpha \sum_j \frac{w_j^2}{1 + w_j^2} $$

One-hidden-layer fully-connected neural netowrk

One-hidden-layer fully-connected neural network with softmax loss on the MNIST dataset.

Implemented optimization algorithms

Centralized optimization algorithms

Gradient descent
Stochastic gradient descent
Nesterov's accelerated gradient descent
SVRG
SARAH

Distributed optimization algorithms (i.e. with parameter server)

ADMM
DANE

Decentralized optimization algorithms

Decentralized gradient descent
Decentralized stochastic gradient descent
Decentralized gradient descent with gradient tracking
EXTRA
NIDS
Network-DANE/SARAH/SVRG
GT-SARAH
DESTRESS

Experiments for distributed optimization algorithms

Related tags

Overview

Network-Distributed Algorithm Experiments

Implemented objective functions

Linear regression

Logistic regression

One-hidden-layer fully-connected neural netowrk

Implemented optimization algorithms

Centralized optimization algorithms

Distributed optimization algorithms (i.e. with parameter server)

Decentralized optimization algorithms

Owner

Boyue Li

A state-of-the-art semi-supervised method for image recognition

This program presents convolutional kernel density estimation, a method used to detect intercritical epilpetic spikes (IEDs)

Code release for Universal Domain Adaptation(CVPR 2019)

A TensorFlow 2.x implementation of Masked Autoencoders Are Scalable Vision Learners

A simple Tensorflow based library for deep and/or denoising AutoEncoder.

Grammar Induction using a Template Tree Approach

Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation

Global Pooling, More than Meets the Eye: Position Information is Encoded Channel-Wise in CNNs, ICCV 2021

PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and reinforcement learning

Official repo for our 3DV 2021 paper "Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized Refinements".

GluonMM is a library of transformer models for computer vision and multi-modality research

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Implementation for "Domain-Specific Bias Filtering for Single Labeled Domain Generalization"

Deep learning toolbox based on PyTorch for hyperspectral data classification.

This is a Pytorch implementation of paper: DropEdge: Towards Deep Graph Convolutional Networks on Node Classification

Implementation of Heterogeneous Graph Attention Network

Deep learning based hand gesture recognition using LSTM and MediaPipie.

DeepFashion2 is a comprehensive fashion dataset.

The code for MM2021 paper "Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning"

Learning Compatible Embeddings, ICCV 2021