Recent Posts

Deep Q-networks

   

This post uses Deep Q Networks to introduce off-policy algorithms

On-Policy Actor-Critic Algorithms

   

This post introduces Actor-Critic Algorithms as an extension of basic policy gradient algorithms such as REINFORCE.