


For this particular action, the immediate reward you observe in the environment is r. Let’s say our environment is in a particular state s, and upon taking an action a, it changes to state s’. Thus, we learn by experimentation and not imitation. We repeat this process multiple times gaining examples of positive and negative experiences, which acts as our training data. Instead, we interact with our environment and observe the outcome of our interaction. What is Reinforcement Learning (and Deep Q-Learning)?Ĭontrary to Supervised Learning, we do not need to manually label the training data in Reinforcement Learning. Let’s start with understanding the Reinforcement Learning technique and how we can formulate our free kick problem to fit this technique. For this tutorial, I will be trying to teach the bot to take 30-yard free kicks, but you can modify it to play other skill games as well.

Due to this reason, I was unable to train the AI on the full game, but could find a work-around to implement it for skill games in practice mode. A big challenge in implementing this is that we do not have access to the game’s code, so we can only make use of what we see on the game screen. In this article, I’l l provide a short description of what Reinforcement Learning is and how I applied it to this game. Previous article: Building a Deep Neural Network to play FIFA 18
