Today, I’m gonna share with you an amazing algorithm that learns from zero (no needs labeled data) to collecting yellow bananas while avoiding blue bananas. It’s very nice, isn’t? Before we talk about the algorithm and the agent, let’s understand how reinforcement learning works.