Covers model-free prediction methods in reinforcement learning, focusing on Monte Carlo and Temporal Differences for estimating value functions without transition dynamics knowledge.
Explores Stochastic Optimal Control, emphasizing Optimal Consumption and Investment, the Martingale Representation Theorem, and the Verification Theorem.