【深度強化學習教程】高質量PyTorch實現集錦

【導讀】包含用PyTorch語言編寫的深度強化學習算法的高質量實現。

作者：這些IPython筆記本的目的主要是幫助我練習和理解我讀過的論文；因此，在某些情況下，我將選擇可讀性而不是效率。首先，我會上傳論文的實現，然後是標記來解釋代碼的每一部分。

相關論文

Human Level Control Through Deep Reinforement Learning [Publication] https://deepmind.com/research/publications/human-level-control-through-deep-reinforcement-learning/ [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/01.DQN.ipynb
Multi-Step Learning (from Reinforcement Learning: An Introduction, Chapter 7) [Publication] https://github.com/qfettes/DeepRL-Tutorials/blob/master/01.DQN.ipynb [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/02.NStep_DQN.ipynb
Deep Reinforcement Learning with Double Q-learning [Publication] https://arxiv.org/abs/1509.06461 [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/03.Double_DQN.ipynb
Dueling Network Architectures for Deep Reinforcement Learning [Publication] https://arxiv.org/abs/1511.06581 [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/04.Dueling_DQN.ipynb
Noisy Networks for Exploration [Publication] https://github.com/qfettes/DeepRL-Tutorials/blob/master/04.Dueling_DQN.ipynb [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/05.DQN-NoisyNets.ipynb
Prioritized Experience Replay [Publication] https://arxiv.org/abs/1511.05952?context=cs [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/06.DQN_PriorityReplay.ipynb
A Distributional Perspective on Reinforcement Learning [Publication] https://arxiv.org/abs/1707.06887 [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/07.Categorical-DQN.ipynb
Rainbow: Combining Improvements in Deep Reinforcement Learning [Publication] https://arxiv.org/abs/1710.02298 [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/08.Rainbow.ipynb
Distributional Reinforcement Learning with Quantile Regression [Publication] https://arxiv.org/abs/1710.10044 [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/09.QuantileRegression-DQN.ipynb
Rainbow with Quantile Regression [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/10.Quantile-Rainbow.ipynb
Deep Recurrent Q-Learning for Partially Observable MDPs [Publication] https://arxiv.org/abs/1507.06527 [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/11.DRQN.ipynb
Advantage Actor Critic (A2C) [Publication1] https://arxiv.org/abs/1602.01783 [Publication2] https://blog.openai.com/baselines-acktr-a2c/ [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/12.A2C.ipynb
High-Dimensional Continuous Control Using Generalized Advantage Estimation [Publication] https://arxiv.org/abs/1506.02438 [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/13.GAE.ipynb
Proximal Policy Optimization Algorithms [Publication] https://arxiv.org/abs/1707.06347 [code] https://github.com/qfettes/DeepRL-Tutorials/blob/master/14.PPO.ipynb

PyTorch實現

【深度強化學習教程】高質量PyTorch實現集錦

再談23種設計模式（3）：行爲型模式（學習筆記）

Power Automate Desktop 安裝完，登錄後老是提示one driver 錯誤

微前端學習筆記(4):從微前端到微模塊之EMP與hel-micro方案探索

微前端學習筆記（1）：微前端總體架構概述，從微服務發微

985 碩士程序員，空窗 4 個月沒有 Offer！

一文搞懂 Spring 循環依賴

賽博鬥地主——使用大語言模型扮演Agent智能體玩牌類遊戲。

VScode右鍵打開(添加到右鍵)

記一次 .NET某工控視覺自動化系統卡死分析

WindowsServer--SQL Server搭建主從同步實現讀寫分離 - 事務性分發

【乾貨】計算機也會ps圖片：TL-GAN（附代碼和sildes下載）

【教程】語音識別中的End-to-End模型教程（附178頁PDF全文下載）

《機器學習100天》一份超全機器學習實戰資料，初學者必備！

人工智能產業估值高企

2018年度北京市自然科學基金傑出青年科學基金擬資助項目公佈

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結