Advantage Actor Critic (A2C): jax + flax implementation

Last update: Jan 23, 2022

Related tags

Overview

Advantage Actor Critic (A2C): jax + flax implementation

Current version supports only environments with continious action spaces and was tested on mujoco 1.50 environments.
Algorithm uses wandb logging.

A2C uses a diagonal gaussian policy with state-independent action distribution variance.

HalfCheetah-v3

Two runs with different seeds. Run with lower score (blue) arrived at a relatively rare local optimum.

Owner

Andrey

GitHub Repository

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

NeurIPS 2021 Title: Distilling Robust and Non-Robust Features in Adversarial Exa

35 Dec 26, 2022

A small library for doing fluid simulation with neural networks.

Neural Fluid Fields This is a small library for doing fluid simulation with neural fields. Check out our review paper, Neural Fields in Visual Computi

23 Jun 23, 2022

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding, where the hidden data can be utilized for various management purposes, including hyper-linking, annotation

1 Nov 17, 2021

A set of examples around hub for creating and processing datasets

Examples for Hub - Dataset Format for AI A repository showcasing examples of using Hub Uploading Dataset Places365 Colab Tutorials Notebook Link Getti

11 Dec 14, 2022

This is the repository for CVPR2021 Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales

Intro This is the repository for CVPR2021 Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales Vehicle Sam

39 Jul 21, 2022

Unofficial keras(tensorflow) implementation of MAE model from Masked Autoencoders Are Scalable Vision Learners

MAE-keras Unofficial keras(tensorflow) implementation of MAE model described in 'Masked Autoencoders Are Scalable Vision Learners'. This work has been

11 Jun 12, 2022

LBBA-boosted WSOD

LBBA-boosted WSOD Summary Our code is based on ruotianluo/pytorch-faster-rcnn and WSCDN Sincerely thanks for your resources. Newer version of our code

20 Sep 19, 2022

PyTorch for Semantic Segmentation

PyTorch for Semantic Segmentation This repository contains some models for semantic segmentation and the pipeline of training and testing models, impl

1.7k Jan 06, 2023

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.

Xcessiv Xcessiv is a tool to help you create the biggest, craziest, and most excessive stacked ensembles you can think of. Stacked ensembles are simpl

1.3k Nov 17, 2022

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

G2LTex This repository contains the implementation of "Texture Mapping for 3D Reconstruction with RGB-D Sensor (CVPR2018)" based on mvs-texturing. Due

129 Dec 30, 2022

Code for MSc Quantitative Finance Dissertation

MSc Dissertation Code ReadMe Sector Volatility Prediction Performance Using GARCH Models and Artificial Neural Networks Curtis Nybo MSc Quantitative F

2 Dec 01, 2022

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

SELF-ATTENTIVE VAD: CONTEXT-AWARE DETECTION OF VOICE FROM NOISE (ICASSP 2021) Pytorch implementation of SELF-ATTENTIVE VAD | Paper | Dataset Yong Rae

97 Dec 23, 2022

MutualGuide is a compact object detector specially designed for embedded devices

Introduction MutualGuide is a compact object detector specially designed for embedded devices. Comparing to existing detectors, this repo contains two

103 Dec 13, 2022

This tutorial aims to learn the basics of deep learning by hands, and master the basics through combination of lectures and exercises

2021-Deep-learning This tutorial aims to learn the basics of deep learning by hands, and master the basics through combination of paper and exercises.

108 Feb 24, 2022

Advantage Actor Critic (A2C): jax + flax implementation

Related tags

Overview

Advantage Actor Critic (A2C): jax + flax implementation

HalfCheetah-v3

Owner

Andrey

Adversarial-Information-Bottleneck - Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck (NeurIPS21)

A small library for doing fluid simulation with neural networks.

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

A set of examples around hub for creating and processing datasets

This is the repository for CVPR2021 Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales

Unofficial keras(tensorflow) implementation of MAE model from Masked Autoencoders Are Scalable Vision Learners

LBBA-boosted WSOD

PyTorch for Semantic Segmentation

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

Code for MSc Quantitative Finance Dissertation

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

MutualGuide is a compact object detector specially designed for embedded devices

This tutorial aims to learn the basics of deep learning by hands, and master the basics through combination of lectures and exercises

Visualizer for neural network, deep learning, and machine learning models

Rate-limit-semaphore - Semaphore implementation with rate limit restriction for async-style (any core)

PyTorch implementation for OCT-GAN Neural ODE-based Conditional Tabular GANs (WWW 2021)

Implementation of various Vision Transformers I found interesting

Age and Gender prediction using Keras

Imaging, analysis, and simulation software for radio interferometry