The policy gradient equation is shown in pseudo code.

12. Then, we update our policy network weights using a policy gradient.
1
1
Alex Movila
Markus Buchholz
·Follow
1 min read·
Mar 17, 2019
--
The policy gradient equation is shown in pseudo code. You can find the best explanation in orgin DeepMind paper. (section 3). It is similar to Reinforce algorithmm which I discussed in my post.
--
--
Written by Markus Buchholz862 Followers
·342 Following
Researcher in underwater robotics
Responses (2)
Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams