Why Deep Q Learning Needs A Target Network and Replay Memory | Course Excerpt For Cyber Monday
11.2 هزار بار بازدید -
5 سال پیش
-
The two biggest innovations in
The two biggest innovations in deep Q learning were the introduction of the target network and the replay memory. One would think that simply bolting a deep neural network to the Q learning algorithm would be enough for a robust deep Q learning agent, but that isn't the case. In this video I'll show you how this naive implementation of the deep q learning agent fails, and spectacularly at that.
#DeepQLearning #PyTorch #ReinforcementLearning
This is an excerpt from my new course, Actor Critic Methods from Paper to Code
Learn how to turn deep reinforcement learning papers into code:
Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.
Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to [email protected]
https://www.neuralnet.ai/courses
Or, pickup my Udemy courses here:
Deep Q Learning:
https://www.udemy.com/course/deep-q-l...
Actor Critic Methods:
https://www.udemy.com/course/actor-cr...
Curiosity Driven Deep Reinforcement Learning
https://www.udemy.com/course/curiosit...
Natural Language Processing from First Principles:
https://www.udemy.com/course/natural-...
Reinforcement Learning Fundamentals
https://www.manning.com/livevideo/rei...
Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: https://bit.ly/3fXHy8W
Grokking Deep Learning: https://bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: https://bit.ly/2VNAXql
Come hang out on Discord here:
Discord: discord
Need personalized tutoring? Help on a programming project? Shoot me an email! [email protected]
Website: https://www.neuralnet.ai
Github: https://github.com/philtabor
Twitter: Twitter: MLWithPhil
#DeepQLearning #PyTorch #ReinforcementLearning
This is an excerpt from my new course, Actor Critic Methods from Paper to Code
Learn how to turn deep reinforcement learning papers into code:
Get instant access to all my courses, including the new Prioritized Experience Replay course, with my subscription service. $29 a month gives you instant access to 42 hours of instructional content plus access to future updates, added monthly.
Discounts available for Udemy students (enrolled longer than 30 days). Just send an email to [email protected]
https://www.neuralnet.ai/courses
Or, pickup my Udemy courses here:
Deep Q Learning:
https://www.udemy.com/course/deep-q-l...
Actor Critic Methods:
https://www.udemy.com/course/actor-cr...
Curiosity Driven Deep Reinforcement Learning
https://www.udemy.com/course/curiosit...
Natural Language Processing from First Principles:
https://www.udemy.com/course/natural-...
Reinforcement Learning Fundamentals
https://www.manning.com/livevideo/rei...
Here are some books / courses I recommend (affiliate links):
Grokking Deep Learning in Motion: https://bit.ly/3fXHy8W
Grokking Deep Learning: https://bit.ly/3yJ14gT
Grokking Deep Reinforcement Learning: https://bit.ly/2VNAXql
Come hang out on Discord here:
Discord: discord
Need personalized tutoring? Help on a programming project? Shoot me an email! [email protected]
Website: https://www.neuralnet.ai
Github: https://github.com/philtabor
Twitter: Twitter: MLWithPhil
5 سال پیش
در تاریخ 1398/09/09 منتشر شده
است.
11,230
بـار بازدید شده