Custom cover image
Custom cover image

Mathematical foundations of reinforcement learning / Shiyu Zhao

By: Resource type: Ressourcentyp: Buch (Online)Book (Online)Language: English Publisher: Singapore : Springer, [2025]Publisher: [Tsinghua] : Tsinghua University Press, [2025]Copyright date: © 2025Description: 1 Online-Ressource (xvi, 275 Seiten)ISBN:
  • 9789819739448
Subject(s): Additional physical formats: 9789819739431 | 9789819739455 | 9789819739462 | Erscheint auch als: 9789819739431 Druck-Ausgabe | Erscheint auch als: 9789819739455 Druck-Ausgabe | Erscheint auch als: 9789819739462 Druck-AusgabeDDC classification:
  • 006.3 23
DOI: DOI: 10.1007/978-981-97-3944-8Online resources: Summary: 1 Basic Concepts -- 2 State Value and Bellman Equation -- 3 Optimal State Value and Bellman Optimality Equation -- 4 Value Iteration and Policy Iteration -- 5 Monte Carlo Learning -- 6 Stochastic Approximation -- 7 Temporal-Difference Learning -- 8 Value Function Approximation -- 9 Policy Gradient -- 10 Actor-Critic Methods.Summary: This book provides a mathematical yet accessible introduction to the fundamental concepts, core challenges, and classic reinforcement learning algorithms. It aims to help readers understand the theoretical foundations of algorithms, providing insights into their design and functionality. Numerous illustrative examples are included throughout. The mathematical content is carefully structured to ensure readability and approachability. The book is divided into two parts. The first part is on the mathematical foundations of reinforcement learning, covering topics such as the Bellman equation, Bellman optimality equation, and stochastic approximation. The second part explicates reinforcement learning algorithms, including value iteration and policy iteration, Monte Carlo methods, temporal-difference methods, value function methods, policy gradient methods, and actor-critic methods. With its comprehensive scope, the book will appeal to undergraduate and graduate students, post-doctoral researchers, lecturers, industrial researchers, and anyone interested in reinforcement learning.PPN: PPN: 1915753384Package identifier: Produktsigel: ZDB-2-SEB | ZDB-2-SCS | ZDB-2-SXCS
No physical items for this record