Skip to main content
Graph
Search
fr
en
Login
Search
All
Categories
Concepts
Courses
Lectures
MOOCs
People
Practice
Publications
Startups
Units
Show all results for
Home
Concept
Markov decision process
Applied sciences
Information engineering
Control theory and engineering
Control theory
Graph Chatbot
Related lectures (32)
Login to filter by course
Login to filter by course
Reset
Monte Carlo Tree Search and Alpha Zero
Explores Monte Carlo Tree Search and Alpha Zero in deep reinforcement learning.
Optimal Hunting Strategies
Explores optimal hunting strategies, uncertain oil prices, and linear cost-minimization policies.
Bellman Equation: Value Consistency and Optimal Actions
Covers the Bellman equation, Q-values, discount factor, and optimal actions.
Reinforcement Learning: Markov Processes and Policy Optimization
Covers Markov processes, decision rules, and policy optimization techniques in reinforcement learning.
Generation of Markov Processes
Covers the generation of Markov processes and Markov chains, including transition matrices and stochastic matrices.
Interactive Lecture: Reinforcement Learning
Explores advanced reinforcement learning topics, including policies, value functions, Bellman recursion, and on-policy TD control.
Markov Chains: Ergodicity and Stationary Distribution
Explores ergodicity and stationary distribution in Markov chains, emphasizing convergence properties and unique distributions.
Dynamic Programming: Portfolio Optimization
Explores dynamic programming for optimizing portfolio choices and asset pricing theory.
Model-Based Deep RL: Planning and VAST
Covers model-based reinforcement learning, planning, variational state tabulation, and efficient Q- and V-values updating.
Support Vector Machines: Exercises Solutions
Covers solutions to SVM exercises, discussing optimality conditions, decision functions, and parameter impacts.
Policy Gradient Methods: Convergence and Optimization
Covers the convergence of policy gradient methods and their optimization in reinforcement learning.
Positive recurrence: invariant distributions
Explores positive recurrence and invariant distributions in Markov chains, discussing their relationship and implications.
Previous
Page 2 of 2
Next