Provides an overview of policy gradient methods in reinforcement learning, focusing on the log-likelihood trick and the transition from batch to online learning.
Explores variance reduction techniques in deep learning, covering gradient descent, stochastic gradient descent, SVRG method, and performance comparison of algorithms.