Let's code a dueling deep q learning agent to beat the lunar lander environment. Dueling Deep Q learning is pretty cool in that it splits the Q network into a value function and an advantage function. This unique twist on the algorithm provides a significant improvement in convergence speed in the lunar lander environment from the OpenAI gym. Dueling Deep networks can also be incorporated into double deep q learning, or really any variant of deep Q learning as the only real change is the splitting of the Q function into a value and an advantage stream.
#DuelingDeepQLearning #PyTorch #OpenAIGym
Simple Deep Q Network w/Pytorch:
Reinforcement Learning Crash Course:
Policy Gradients w/Tensorflow
Deep Q Learning w/Tensorflow
Code Your Own RL Environments
How to Spec a Deep Learning PC:
Deep Q Learning w/ Pytorch:
Machine Learning Freelancing
Website:
Course:
Github:
Twitter:

0 Comments