Update for using DQN to play sekiro 2021.2.2（English Version）

This is the code of using DQN to play Sekiro .

I am very glad to tell that I have writen the codes of using DQN to play Sekiro . As is known to all , Supervised learning can only learn skills from the data we provide for it . However , this time by using Reinforcement Learning , we can see a more clever agent playing Sekiro .

Reinforcement Learning can update its network by itself , using the reward feedback , which means we no longer need to collect our own data sets this time . All the data sets come from the real-time interaction between DQN network and the game. By using this DQN network , you can fight any boss you want in the game . There still something you need to know ：

In order to shorten the time of restart the game , we need game modifier .
The link for game modifier : https://patch.ali213.net/showpatch/118405.html
See the tutorial video for specific code usage , link : https://www.bilibili.com/video/BV1by4y1n7pe/

Have fun !

Old version sekiro_tensorflow

Code link for using Supervised learning to play Sekiro : https://github.com/analoganddigital/sekiro_tensorflow

Hello everyone , this is analoganddigital . I use this code to complete an interesting porgram of using machine learning to play Sekiro . You can see the final presentation in https://www.bilibili.com/video/BV1wC4y1s7oa/ . I am a junior student in university , which means I can't spend too much time on this program . What a shame ! On the other hand , many audiences hope me share this code . Thus , I eventually put it on the GitHub . This is an interesting program , and I hope everyone can enjoy it. In addition , I really welcome you to improve this program , to make this AI more smart ! There still something you need to konw:

The window size I set is 96*86 , you can change it by yourselves .
I finally collected 300M training data , if you want better result , maybe you need to collect more data .
I use Alexnet to finish the training . This program is depend on Supervised learning.
I have no idea about using Reinforcement learning yet , so I will really appreciate it if someone can help me to overcome this difficulty.(already finished)
See the tutorial video for specific code usage , link : https://www.bilibili.com/video/BV1bz4y1R7kB

Reference ： https://github.com/Sentdex/pygta5/blob/master/LICENSE

更新——强化学习DQN打只狼 2021.2.2（中文说明）

我非常高兴地告诉大家，我最近又开发出了用DQN强化学习打只狼的代码。众所周知，监督学习只能学习到我们所提供的数据集的相关技能，但是利用强化学习，我们将看到一个完全不一样的只狼。

强化学习会根据reward奖励进行判断并且自己学习一种打斗方法。更重要的是，我们这次不再需要自己收集数据集了，所有更新数据均来自于DQN网络与游戏的实时交互。利用这个DQN代码（链接见下方），你可以挑战只狼中任何一个boss，只要boss的血条位置不变即可（因为我采用的是图像抓取的方式获取只狼的血量与boss的血量进行reward判断）。然后还有一些注意事项：

为了缩短只狼复活周期，在这个项目训练中，我们需要采用只狼的24项修改器，让只狼能够原地复活继续训练。
修改器下载地址：https://patch.ali213.net/showpatch/118405.html
具体代码使用方法请见我在b站上发布的DQN打只狼的教程视频，链接：https://www.bilibili.com/video/BV1by4y1n7pe/

祝各位玩得愉快！

旧版本用机器学习打只狼

旧版本的利用监督学习打只狼的代码链接： https://github.com/analoganddigital/sekiro_tensorflow 。

各位观众大家好，我GitHub用户名是analoganddigital。我用这个程序完成了机器学习打只狼这个项目。最终效果视频可以看b站https://www.bilibili.com/video/BV1wC4y1s7oa/ 。我是一个大三学生，真的非常抱歉没能长时间更新这个项目，所以我把它放到了GitHub上面，之前很多观众也是私信我想要代码。总之我还是希望大家能喜欢这个小项目吧。当然，我非常希望大家能帮忙完善这个程序，万分感激，大家共同讨论我们会获益更多，这其实就是开源的意义。现在由于代码比较基础，所以训练效果不太好。我相信大家会有更多的点子，如果能更新一点算法，我们将会看到一个更机智的AI。我很感谢大家对之前视频的支持（受宠若惊），也十分期待大家有趣的优化，就算没有优化直接用也可以。还有一些细节我这声明一下：

我截取的图像大小是96*86的，各位可以根据自身情况选择。
我最终只收集了300M的数据，如果你想训练效果更好的话，可能要收集更多。
我用的神经网络是Alexnet，基于监督学习完成的。
由于我能力有限，我还没想好如何用强化学习优化算法，所以如果有大佬能分享一下自己的才华，那将十分感谢。（目前已经实现）
具体代码使用方法请见我在b站上发布的机器学习打只狼的教程视频，链接： https://www.bilibili.com/video/BV1bz4y1R7kB

部分参考代码： https://github.com/Sentdex/pygta5/blob/master/LICENSE

This is the code of using DQN to play Sekiro .

Related tags

Overview

Update for using DQN to play sekiro 2021.2.2（English Version）

Old version sekiro_tensorflow

更新——强化学习DQN打只狼 2021.2.2（中文说明）

旧版本用机器学习打只狼

Owner

Code for Fully Context-Aware Image Inpainting with a Learned Semantic Pyramid

Pytorch Implementation of LNSNet for Superpixel Segmentation

Official code release for ICCV 2021 paper SNARF: Differentiable Forward Skinning for Animating Non-rigid Neural Implicit Shapes.

Semantic Segmentation of images using PixelLib with help of Pascalvoc dataset trained with Deeplabv3+ framework.

Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)

Near-Optimal Sparse Allreduce for Distributed Deep Learning (published in PPoPP'22)

Computer Vision and Pattern Recognition, NUS CS4243, 2022

Code for Multinomial Diffusion

Evaluating AlexNet features at various depths

Repository for the semantic WMI loss

Code for reproducible experiments presented in KSD Aggregated Goodness-of-fit Test.

Fashion Entity Classification

Proof of concept GnuCash Webinterface

Music Source Separation; Train & Eval & Inference piplines and pretrained models we used for 2021 ISMIR MDX Challenge.

CVNets: A library for training computer vision networks

PiRapGenerator - Make anyone rap the digits of pi

Code repository for the work "Multi-Domain Incremental Learning for Semantic Segmentation", accepted at WACV 2022

Scaling and Benchmarking Self-Supervised Visual Representation Learning

DCSL - Generalizable Crowd Counting via Diverse Context Style Learning

Indoor Panorama Planar 3D Reconstruction via Divide and Conquer