In this brief tutorial you’re going to learn the fundamentals of deep reinforcement learning, and the basic concepts behind actor critic methods. We’ll cover the Markov decision process, the agent’s policy, reward discounting and why it’s necessary, and the actor critic algorithm. We’ll implement an actor critic algorithm using Tensorflow 2 to handle the cart pole environment from the Open AI Gym.

Actor critic methods form the basis for more advanced algorithms such as deep deterministic policy gradients, soft actor critic, and twin delayed deep deterministic policy gradients, among others.

https://youtu.be/LawaN3BdI00

#deep-learning #python #machine-learning #artificial-intelligence #tensorflow

Everything You Need To Master Actor Critic Methods | Tensorflow 2 Tutorial
3.45 GEEK