Discusses Stochastic Gradient Descent and its application in non-convex optimization, focusing on convergence rates and challenges in machine learning.
Provides an overview of policy gradient methods in reinforcement learning, focusing on the log-likelihood trick and the transition from batch to online learning.