Distributional Reinforcement Learning

6h 19m
Marc G. Bellemare, Mark Rowland, Will Dabney
The MIT Press
2023

The first comprehensive guide to distributional reinforcement learning, providing a new mathematical formalism for thinking about decisions from a probabilistic perspective.

Distributional reinforcement learning is a new mathematical formalism for thinking about decisions. Going beyond the common approach to reinforcement learning and expected values, it focuses on the total reward or return obtained as a consequence of an agent's choices—specifically, how this return behaves from a probabilistic perspective. In this first comprehensive guide to distributional reinforcement learning, Marc G. Bellemare, Will Dabney, and Mark Rowland, who spearheaded development of the field, present its key concepts and review some of its many applications. They demonstrate its power to account for many complex, interesting phenomena that arise from interactions with one's environment.

The authors present core ideas from classical reinforcement learning to contextualize distributional topics and include mathematical proofs pertaining to major results discussed in the text. They guide the reader through a series of algorithmic and mathematical developments that, in turn, characterize, compute, estimate, and make decisions on the basis of the random return. Practitioners in disciplines as diverse as finance (risk management), computational neuroscience, computational psychiatry, psychology, macroeconomics, and robotics are already using distributional reinforcement learning, paving the way for its expanding applications in mathematical finance, engineering, and the life sciences. More than a mathematical approach, distributional reinforcement learning represents a new perspective on how intelligent agents make predictions and decisions.

About the Author

Marc G. Bellemare is Senior Staff Research Scientist, Google Research and Adjunct Professor, McGill University.

Will Dabney is Senior Staff Research Scientist, DeepMind.

Mark Rowland is Staff Research Scientist, DeepMind.

In this Book

Introduction
The Distribution of Returns
Learning the Return Distribution
Operators and Metrics
Distributional Dynamic Programming
Incremental Algorithms
Control
Statistical Functionals
Linear Function Approximation
Deep Reinforcement Learning
Two Applications and a Conclusion
Notation
References

FREE ACCESS

Book Mathematical Methods in Data Science, First Edition

Book Federated Learning: Fundamentals and Advances

Book Learning Technology: A Complete Guide for Learning Professionals

Get Started

Sharpen your skills. Upgrade your career. Find the right learning path for you, based on your role and skills. Take part in hands-on practice, study for a certification, and much more - all personalized for you.

*Not included: Compliance, Leadership Development Program content, and Engineering books

Your content + our content + our platform = a path to learning success

Using our learning experience platform, Percipio, your learners can engage in custom learning paths that can feature curated content from all sources.

Learn More

Aspire to something bigger

Aspire Journeys are guided learning paths that set you in motion for career success.

Browse Aspire Journeys

Explore a world of live learning with Global Knowledge

Choose from convenient delivery formats to get the training you and your team need - where, when and how you want it.

Browse Live Learning

IT Skills & Salary Report

ESG Impact Report

Distributional Reinforcement Learning

In this Book

YOU MIGHT ALSO LIKE