Markus Buchholz
1 min readMar 17, 2019

--

The policy gradient equation is shown in pseudo code. You can find the best explanation in orgin DeepMind paper. (section 3). It is similar to Reinforce algorithmm which I discussed in my post.

--

--

Markus Buchholz
Markus Buchholz

Written by Markus Buchholz

Researcher in underwater robotics

Responses (2)