用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Last update: Dec 17, 2022

Overview

用强化学习玩合成大西瓜

代码地址：https://github.com/Sharpiless/play-daxigua-using-Reinforcement-Learning

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本、PARL（paddle）版本和pytorch版本。

B站：https://space.bilibili.com/470550823

CSDN：https://blog.csdn.net/weixin_44936889

AI Studio：https://aistudio.baidu.com/aistudio/personalcenter/thirdview/67156

Github：https://github.com/Sharpiless

1. 打开游戏：

这里使用pygame重写了大西瓜游戏，并封装为适合RL环境的代码。

解压图片素材：

unzip res.zip

运行：

python Main.py

即可开始游戏：

2. 训练RL模型：

RL算法采用DQN算法，其中Keras版本使用了简单的卷积神经网络来计算Q值，PRAL版本使用ResNet。

运行：

python train_keras.py

或者

python train_paddle.py

或者

python train_torch.py

开始训练：

关注我的公众号：

感兴趣的同学关注我的公众号——可达鸭的深度学习教程：

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

Related tags

Overview

用强化学习玩合成大西瓜

1. 打开游戏：

2. 训练RL模型：

关注我的公众号：

Owner

a grammar based feedback fuzzer

A Closer Look at Reference Learning for Fourier Phase Retrieval

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

Implementation of Hire-MLP: Vision MLP via Hierarchical Rearrangement and An Image Patch is a Wave: Phase-Aware Vision MLP.

K Closest Points and Maximum Clique Pruning for Efficient and Effective 3D Laser Scan Matching (To appear in RA-L 2022)

TensorFlow implementation of Deep Reinforcement Learning papers

The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.

Asterisk is a framework to generate high-quality training datasets at scale

Multi-Scale Aligned Distillation for Low-Resolution Detection (CVPR2021)

Tzer: TVM Implementation of "Coverage-Guided Tensor Compiler Fuzzing with Joint IR-Pass Mutation (OOPSLA'22)“.

Implémentation en pyhton de l'article Depixelizing pixel art de Johannes Kopf et Dani Lischinski

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Implementation of gMLP, an all-MLP replacement for Transformers, in Pytorch

Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL

Implementation supporting the ICCV 2017 paper "GANs for Biological Image Synthesis"

An implementation of IMLE-Net: An Interpretable Multi-level Multi-channel Model for ECG Classification

MAUS: A Dataset for Mental Workload Assessment Using Wearable Sensor - Baseline system

A toy project using OpenCV and PyMunk

StableSims is an open-source project aimed at simulating MakerDAO's Dai stablecoin system

Fine-tuning StyleGAN2 for Cartoon Face Generation