当前位置:网站首页>[easy to understand and intensive learning] 1 Introduction
[easy to understand and intensive learning] 1 Introduction
2022-04-22 22:44:00 【Zhao-Jichao】
List of articles
1.5 Construction of reinforcement learning simulation environment
1.5.1 gym Installation and simple demo Example
pip3 install gym
The simplest example
import gym % Import Gym modular
env = gym.make('CartPole-v0') % Create a model of a trolley inverted pendulum
env.reset() % Initialization environment
env.render() % Refresh the current environment and display
Through this 6 Step , You can get a trolley inverted pendulum system .

1.5.2 In depth analysis of gym Environment construction
-
reset()Function details
Initialization function -
render()Function details
Play the role of image engine .
Actually , For reinforcement learning algorithms , There can be no render() function , however , In order to visually display the objects in the current environment , Image engine is still necessary .
step()Function details
Play the role of physics engine .
In this function , General use of agent Kinematic model and The kinetic model Calculate the status of the next step and immediate return , And judge whether it reaches the termination state .
版权声明
本文为[Zhao-Jichao]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204222202030814.html
边栏推荐
- GORM 预加载和自引用
- 2021下半年软件设计师上午真题及答案解析
- 条件编译分析及使用
- SecureCRT v7.0连接sshd服务失败解决
- Pangolin安装报错:make: *** 没有规则可制作目标“pypangolin_pip_install”。 停止。
- GBase 8s V8.8 SQL 指南:教程-6.1.2(2)
- 【微信小程序开发(云壁纸小程序教程)】
- GBase 8s V8. 8 SQL Guide: Tutorial - 6.2.1 (4)
- Basic practice of C language (001-1)
- Reinforcement learning (practice): dqn, double dqn, dueling dqn
猜你喜欢
![[wechat applet development (cloud wallpaper applet tutorial)]](/img/71/d1ec9b0e7af47427c1e19b9b949bfa.png)
[wechat applet development (cloud wallpaper applet tutorial)]

多线程进阶(八)----线程池

线性基(各种模板+例题)

ATOS阿托斯比例阀的工作原理及主要特性概述

js的正则表达式

GBase 8s V8.8 SQL 指南:教程-6.2.1(2)

【Paper】2019_Distributed fixed-time consensus-based formation tracking for multiple nonholonomic whee
![[4.1] trigger trigger and evictor cleaner of flick window operator](/img/68/9a32ba6cc484237cd2c7015e5179be.png)
[4.1] trigger trigger and evictor cleaner of flick window operator

High end beer is losing young people

SMB+MSSQL
随机推荐
条件编译分析及使用
论文笔记: BRITS: Bidirectional Recurrent Imputation for Time Series
【4.1】flink窗口算子的trigger触发器和Evictor清理器
一夜爆红的Moonbirds NFT,究竟有何魔力?
Hydraulic shock analysis of haWe haWe hydraulic pump station
线性基(各种模板+例题)
外部中断---------stm32f407zet6
赛微微电上市首日破发:市值蒸发超15亿元,经营规模略输一筹
[summary of scattered knowledge points 5]
GBase 8s V8. 8 SQL Guide: Tutorial - 6.1.2 (2)
jsp的form表单提交给servlet但js失效问题
51 MCU proteus simulation key control nixie tube digital display
2.58-编写程序is-little-endian,当在小端法机器上编译和运行时返回1,在大端法机器上编译和运行时则返回0。这个程序应该可以运行在任何机器上,无论机器的字长是多少。
GBase 8s V8. 8 SQL Guide: Tutorial - 6.2.1 (1)
41.0:GemBox.Spreadsheet|.Document|.Pdf|.Presentation
JS solving power deduction daily question (8) -- 396 Rotation function (2022-4-22)
多层感知机的从零开始实现( 从D2L 包中抽取函数)
JS解力扣每日一题(八)——396.旋转函数(2022-4-22)
【洛谷】P1162 填涂颜色(bfs)
多线程进阶(八)----线程池