Rewards and Penalties\n\nReinforcement learning (RL) is about agents taking actions in an environment to maximize rewards. Q-Learning is a core table-based approach.