1 min readMar 17, 2019
The policy gradient equation is shown in pseudo code. You can find the best explanation in orgin DeepMind paper. (section 3). It is similar to Reinforce algorithmm which I discussed in my post.
The policy gradient equation is shown in pseudo code. You can find the best explanation in orgin DeepMind paper. (section 3). It is similar to Reinforce algorithmm which I discussed in my post.