人工智能是21世紀最激動人心的技術之一。人工智能,就是像人一樣的智能,而人的智能包括感知、決策和認知(從直覺到推理、規劃、意識等)。其中,感知解決what,深度學習已經超越人類水平;決策解決how,強化學習在遊戲和機器人等領域取得了一定效果;認知解決why,知識圖譜、因果推理和持續學習等正在研究。強化學習,採用反饋學習的方式解決序貫決策問題,因此必然是通往通用人工智能的終極鑰匙。
課程和視頻
Reinforcement Learning by David Silver [2015] [youtube] [bilibili]
CS 188: Introduction to Artificial Intelligence [Fall 2012-Spring 2014] [Fall 2018] [Summer 2019] [Spring 2020]
CS 294: Deep Reinforcement Learning by Sergey Levine [Fall 2015] [Spring 2017] [Fall 2017] [Fall 2018]
CS 285: Deep Reinforcement Learning [Fall 2019] [youtube]
Advanced Deep Learning & Reinforcement Learning by DeepMind & UCL [youtube2018]
Deep Reinforcement Learning and Control [Spring 2017]
CS234: Reinforcement Learning [Winter 2019] [youtube]
Deep Reinforcement Learning by 李宏毅 [Spring 2018] [yourube2018]
Reinforcement Learning by 莫煩 [homepage]
書籍
Reinforcement Learning: An Introduction (1st Edition, 1998) [homepage]
Reinforcement Learning: An Introduction (2nd Edition, 2018) [homepage] [bookdraft2018jan1] [2018] [Python Code]
Hands-On Reinforcement Learning With Python (2018) [homepage]
Reinforcement Learning With Open AI TensorFlow and Keras Using Python (2018) [homepage]
Algorithms for Reinforcement Learning (2010) [download]
《神經網絡與深度學習》[download]
代碼
ShangtongZhang/Python Implementation of Reinforcement Learning: An Introduction (2nd Edition) [github]
berkeleydeeprlcourse [github]
tensorlayer/RLzoo github
rlcode/reinforcement-learning [github]
MorvanZhou/Reinforcement-learning-with-tensorflow [github]
dennybritz/reinforcement-learning [github]
教程
OpenAI Spinning Up [英文版] [中文版]
演講
Rich Sutton, 2015, Introduction to Reinforcement Learning with Function Approximation
Andrew Barto, 2018, A history of reinforcement learning
David Silver, Principles of Deep RL
Benjamin Recht, 2018, Optimization Perspectives on Learning to Control
John Schulman, 2017, The Nuts and Bolts of Deep Reinforcement Learning Research
Joelle Pineau, Introduction to Reinforcement Learning
Deep Learning and Reinforcement Learning Summer School, 2018, 2017
Deep Learning Summer School, 2016, 2015
Yisong Yue and Hoang M. Le, Imitation Learning, ICML 2018 Tutorial
綜述
Li, Y. (2017). Deep Reinforcement Learning: An Overview. ArXiv. [paper]
Littman, M. L. (2015). Reinforcement learning improves behaviour from evaluative feedback. Nature, 521:445–451. [paper]
算法
環境
OpenAI Gym
Google Dopamine 2.0
Emo Todorov Mujoco
通用格子世界環境類
框架
OpenAI Baselines
百度 PARL
DeepMind OpenSpiel
研究員
David Silver [homepage]
Pieter Abbeel [homepage]
Sergey Levine [homepage]
李宏毅 [homepage]
會議/期刊
會議:AAAI、NIPS、ICML、ICLR、IJCAI、 AAMAS、IROS等。
期刊:AI、 JMLR、JAIR、 Machine Learning、JAAMAS等。
研究機構
OpenAI
DeepMind
Berkeley Artificial Intelligence Research (BAIR) Lab
博客
Keavnn’Blog
Medium : Reinforcement Learning
StackOverflow : Reinforcement Learning
知乎
公衆號
深度強化學習實驗室
深度學習技術前沿
AI科技評論
新智元
其他
kmario23/deep-learning-drizzle [github] [webpage]
Mr.Jk.Zhang [CSDN]