Skip to main content

Search

Show all results for

Home

Lecture

Introduction to policy gradient

About
Privacy
Disclaimer

Copyright © 2026 EPFL, all rights reserved

Graph Chatbot

Description

This lecture introduces the concept of policy gradients, explaining how actions are associated with observations to optimize rewards parametrically using a gradient method, contrasting it with Q-learning.

Official source

https://mediaspace.epfl.ch/media/0_okkx50f1

About this result

This page is automatically generated and may contain information that is not correct, complete, up-to-date, or relevant to your search query. The same applies to every other page on this website. Please make sure to verify the information with EPFL's official sources.

Related lectures (30)

Perception: Data-Driven Approaches

Explores perception in deep learning for autonomous vehicles, covering image classification, optimization methods, and the role of representation in machine learning.

The Hidden Convex Optimization Landscape of Deep Neural Networks

Explores the hidden convex optimization landscape of deep neural networks, showcasing the transition from non-convex to convex models.

Statistical Physics in Machine Learning: Understanding Deep Learning

Explores the application of statistical physics in understanding deep learning with a focus on neural networks and machine learning challenges.

Reinforcement Learning Concepts

Covers key concepts in reinforcement learning, neural networks, clustering, and unsupervised learning, emphasizing their applications and challenges.

Foundations of Deep Learning: Transformer Architecture Overview

Covers the foundational concepts of deep learning and the Transformer architecture, focusing on neural networks, attention mechanisms, and their applications in sequence modeling tasks.