Advanced Reinforcement Learning: Implementation

Machine Learning | Intermediate

11 videos | 1h 34m 16s
Includes Assessment
Earns a Badge

(6)

From Channel:

Machine Learning

From Journey:

ML Programmer to ML Architect

In this 11-video course, learners can examine the role of reward and discount factors in reinforcement learning, as well as the multi-armed bandit problem and approaches to solving it for machine learning. You will begin by learning how to install the Markov Decision Policy (MDP) toolbox and implement the Discounted Markov Decision Process using the policy iteration algorithm. Next, examine the role of reward and discount factors in reinforcement learning, and the multi-armed bandit problem and solutions. Learn about dynamic programming, policy evaluation, policy iteration, value iteration, and characteristics of Bellman equation. Then learners will explore reinforcement learning agent components and applications; work with reinforcement learning agents using Keras and OpenAI Gym; describe reinforcement learning algorithms and the reinforcement learning taxonomy defined by OpenAI; and implement deep Q-learning with Keras. Finally, observe how to train deep neural networks (DNN) with reinforcement learning for time series forecasting. In the closing exercise, you will recall approaches for resolving the multi-armed bandit problem, list reinforcement learning agent components, and implement deep Q-learning by using Keras and OpenAI Gym.

WHAT YOU WILL LEARN

Discover the key concepts covered in this course

Install the markov decision policy toolbox and implement the discounted markov decision process using the policy iteration algorithm

Recognize the role of reward and discount factors in reinforcement learning

Describe the multi-armed bandit problem and different approaches of solving this problem

Describe dynamic programming, policy evaluation, policy iteration, value iteration, and characteristics of bellman equation

List reinforcement learning agent components and reinforcement agent applications
Work with reinforcement learning agents using keras and openai gym

Describe reinforcement learning algorithms and the reinforcement learning taxonomy defined by openai

Implement deep reinforcement learning or deep q-learning using keras and openai gym

Recognize how to train deep neural networks using reinforcement learning for time series forecasting

Recall approaches for resolving the multi-armed bandit problem, list reinforcement learning agent components, and implement deep q-learning using keras and openai gym

IN THIS COURSE

1m 42s

FREE ACCESS
8m 8s

In this video, you will learn how to install the Markov Decision Policy toolbox and implement the Discounted Markov Decision Process using the policy iteration algorithm. FREE ACCESS
3. Rewards and Discounts

11m 15s

Upon completion of this video, you will be able to recognize the role of reward and discount factors in reinforcement learning. FREE ACCESS
4. Multi-Armed Bandit Problem

10m 35s

After completing this video, you will be able to describe the multi-armed bandit problem and different approaches for solving this problem. FREE ACCESS
5. Dynamic Programming and Bellman Equation

6m 2s

After completing this video, you will be able to describe dynamic programming, policy evaluation, policy iteration, value iteration, and characteristics of the Bellman equation. FREE ACCESS
6. Reinforcement Learning Agent and Its Components

7m 13s

After completing this video, you will be able to list the components of a reinforcement learning agent and its applications. FREE ACCESS
7. Reinforcement Learning with OpenAI Gym and Keras

17m 3s

Find out how to work with reinforcement learning agents using Keras and OpenAI Gym. FREE ACCESS
8. Reinforcement Learning Taxonomy by OpenAI

8m 6s

Upon completion of this video, you will be able to describe reinforcement learning algorithms and the reinforcement learning taxonomy defined by OpenAI. FREE ACCESS
9. Deep Q-Learning Implementation

9m 48s

In this video, you will learn how to implement deep reinforcement learning or deep Q-learning using Keras and OpenAI Gym. FREE ACCESS
10. Training DNN Using Reinforcement Learning

6m 48s

After completing this video, you will be able to recognize how to train deep neural networks using reinforcement learning for time series forecasting. FREE ACCESS
11. Exercise: Implementing Deep Q-Learning

7m 38s

Upon completion of this video, you will be able to recall approaches for resolving the multi-armed bandit problem, list reinforcement learning agent components, and implement deep Q-learning using Keras and OpenAI Gym. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

Book The AI Playbook: Mastering the Rare Art of Machine Learning Deployment

Book Adversarial Machine Learning: Attack Surfaces, Defence Mechanisms, Learning Theories in Artificial Intelligence

Book Automated Machine Learning in Action

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Course Generative AI APIs for Practical Applications: An Introduction

(18)

Course Advanced Reinforcement Learning: Principles

(8)

Course Applied Predictive Modeling

(21)

Get Started

Sharpen your skills. Upgrade your career. Find the right learning path for you, based on your role and skills. Take part in hands-on practice, study for a certification, and much more - all personalized for you.

*Not included: Compliance, Leadership Development Program content, and Engineering books

Your content + our content + our platform = a path to learning success

Using our learning experience platform, Percipio, your learners can engage in custom learning paths that can feature curated content from all sources.

Learn More

Aspire to something bigger

Aspire Journeys are guided learning paths that set you in motion for career success.

Browse Aspire Journeys

Explore a world of live learning with Global Knowledge

Choose from convenient delivery formats to get the training you and your team need - where, when and how you want it.

Browse Live Learning

IT Skills & Salary Report

ESG Impact Report

Advanced Reinforcement Learning: Implementation

WHAT YOU WILL LEARN

IN THIS COURSE

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

YOU MIGHT ALSO LIKE

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE