Admin Panels
Algorithms
Asset Management
Audio
Authentication
More Categories
Boilerplate Build Tools Caching CMS Code Analysis Code Refactoring Code review tool Command-line Interface Development Command-line Tools Communication Computer Vision Concurrency and Parallelism Configuration Cryptography Data Analysis Data Containers Data Serialization Data Structures Data Validation Data Visualization Database Database Drivers Date & Time Utilities Debugging Tools Deep Learning Deep Learning Model Explanation DevOps Tools Distributed Computing Distribution Django Documentation Downloader E-commerce Editor Plugins Email Environment Management FastAPI Projects FastAPI Utilities Feature Engineering File & Path Utilities Finance Flask Forms Functional Programming Game Development General Utilities Geolocation GPU Utilities GraphQL GUI Development Hardware HTML Manipulation HTTP Clients IDE Image Processing Implementations of Python Internationalization Interpreter Job Scheduler JSON Linters & Style Checkers Logging Machine Learning Markdown/YAML Microsoft Windows Miscellaneous Monitoring Network Virtualization Networking Office Files Processing Organization ORM Package Management Payment Processing PDF Files Processing Performance optimization Pipelines Process Utilities Productivity PyTorch Learning Resources Pytorch Utilities Recommender Systems Reinforcement Learning RESTful API RPC Servers Science SCM Search Security related resources Serialization Serverless Frameworks Sklearn Utilities Specific Formats Processing Static Site Generator Storage Task Queues Template Engine Testing Text Data & NLP Text Processing Third-party APIs Wrappers URL Manipulation Video Web Asset Management Web Content Extracting Web Crawling Web Frameworks WebSocket WSGI Servers
Popular Repo
Latest Repo
Resources
All Article News Book Tutorial

Overview
Comments 1
Releases

Reinforcement Learning Theory Book (rus)

Last update: Nov 27, 2022

Related tags

Deep Learning RL-Theory-book

Overview

Reinforcement Learning Theory Book (rus)

Full book on Arxiv: https://arxiv.org/abs/2201.09746

Ch. 1: Introduction
Ch. 2: Meta-heuristics
- NEAT, WANN
- CEM, OpenAI-ES, CMA-ES
Ch. 3: Classic theory
- Bellman equations
- RPI, policy improv. theorem
- Value Iteration, Generalized Policy Iteration
- Temporal Difference, Q-learning, SARSA
- Eligibility Traces, TD-lambda, Retrace
Ch. 4: Value-based
- DQN
- Double DQN, Dueling DQN, PER, Noisy DQN, Multi-step DQN
- c51, QR-DQN, IQN, Rainbow DQN
Ch. 5: Policy Gradient
- REINFORCE, A2C, GAE
- TRPO, PPO
Ch. 6: Continuous Control
- DDPG, TD3
- SAC
Ch. 7: Model-based
- Bandits
- MCTS, AlphaZero, MuZero
- LQR
Ch. 8: Next Stage
- Imitation Learning / Inverse Reinforcement Learning
- Intrinsic Motivation
- Multi-Task and Hindsight
- Hierarchical RL
- Partial observability
- Multi-Agent RL

Owner

qbrick

qbrick

GitHub Repository

Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples

Welcome to the cuQuantum repository! This public repository contains two sets of files related to the NVIDIA cuQuantum SDK: samples: All C/C++ sample

147 Dec 27, 2022

Informal Persian Universal Dependency Treebank

Informal Persian Universal Dependency Treebank (iPerUDT) Informal Persian Universal Dependency Treebank, consisting of 3000 sentences and 54,904 token

0 Jan 05, 2022

A minimalist implementation of score-based diffusion model

sdeflow-light This is a minimalist codebase for training score-based diffusion models (supporting MNIST and CIFAR-10) used in the following paper "A V

89 Dec 20, 2022

Source code for Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning Official implementation of ACC, described in the paper "Adaptively Calibrated C

3 Sep 16, 2022

Deep Learning Datasets Maker is a QGIS plugin to make datasets creation easier for raster and vector data.

Deep Learning Dataset Maker Deep Learning Datasets Maker is a QGIS plugin to make datasets creation easier for raster and vector data. How to use Down

25 Dec 15, 2022

Using OpenAI's CLIP to upscale and enhance images

CLIP Upscaler and Enhancer Using OpenAI's CLIP to upscale and enhance images Based on nshepperd's JAX CLIP Guided Diffusion v2.4 Sample Results Viewpo

5 Jun 14, 2022

MVP Benchmark for Multi-View Partial Point Cloud Completion and Registration

MVP Benchmark: Multi-View Partial Point Clouds for Completion and Registration [NEWS] 2021-07-12 [NEW 🎉 ] The submission on Codalab starts! 2021-07-1

93 Dec 21, 2022

CURL: Contrastive Unsupervised Representations for Reinforcement Learning

CURL Rainbow Status: Archive (code is provided as-is, no updates expected) This is an implementation of CURL: Contrastive Unsupervised Representations

46 Dec 12, 2022

Franka Emika Panda manipulator kinematics&dynamics simulation

pybullet_sim_panda Pybullet simulation environment for Franka Emika Panda Dependency pybullet, numpy, spatial_math_mini Simple example (please check s

0 Jan 20, 2022

A library for uncertainty quantification based on PyTorch

Torchuq [logo here] TorchUQ is an extensive library for uncertainty quantification (UQ) based on pytorch. TorchUQ currently supports 10 representation

96 Dec 12, 2022

Scale-aware Automatic Augmentation for Object Detection (CVPR 2021)

SA-AutoAug Scale-aware Automatic Augmentation for Object Detection Yukang Chen, Yanwei Li, Tao Kong, Lu Qi, Ruihang Chu, Lei Li, Jiaya Jia [Paper] [Bi

182 Dec 29, 2022

Feup-csr - Repository holding my group's submission to the CSR project competition

CSR Competições de Swarm Robotics Swarm Robotics Competitions This repository holds the files submitted for the CSR project competition. Project group

1 Jan 04, 2022

A code generator from ONNX to PyTorch code

onnx-pytorch Generating pytorch code from ONNX. Currently support onnx==1.9.0 and torch==1.8.1. Installation From PyPI pip install onnx-pytorch From

94 Jan 06, 2023

Automates Machine Learning Pipeline with Feature Engineering and Hyper-Parameters Tuning :rocket:

MLJAR Automated Machine Learning Documentation: https://supervised.mljar.com/ Source Code: https://github.com/mljar/mljar-supervised Table of Contents

2.4k Dec 31, 2022

Rainbow DQN implementation that outperforms the paper's results on 40% of games using 20x less data 🌈

Rainbow 🌈 An implementation of Rainbow DQN which outperforms the paper's (Hessel et al. 2017) results on 40% of tested games while using 20x less dat

31 Dec 21, 2022

Distributed Asynchronous Hyperparameter Optimization better than HyperOpt.

UltraOpt : Distributed Asynchronous Hyperparameter Optimization better than HyperOpt. UltraOpt is a simple and efficient library to minimize expensive

98 Aug 16, 2022

Library for converting from RGB / GrayScale image to base64 and back.

Library for converting RGB / Grayscale numpy images from to base64 and back. Installation pip install -U image_to_base_64 Conversion RGB to base 64 b

16 Aug 28, 2022

Official code for the ICLR 2021 paper Neural ODE Processes

Neural ODE Processes Official code for the paper Neural ODE Processes (ICLR 2021). Abstract Neural Ordinary Differential Equations (NODEs) use a neura

50 Oct 28, 2022

Tensorflow2.0 🍎🍊 is delicious, just eat it! 😋😋

How to eat TensorFlow2 in 30 days ? 🔥 🔥 Click here for Chinese Version（中文版）《10天吃掉那只pyspark》 🚀 github项目地址: https://github.com/lyhue1991/eat_pyspark

9.7k Jan 01, 2023

A crash course in six episodes for software developers who want to become machine learning practitioners.

Featured code sample tensorflow-planespotting Code from the Google Cloud NEXT 2018 session "Tensorflow, deep learning and modern convnets, without a P

2.6k Jan 08, 2023

2022.PythonRepo

About
Contact Us
DMCA
Disclaimer
Privacy Policy