Reinforcement Learning Simulator
See how an AI agent learns to optimize traffic signals through trial and error to maximize rewards (i.e., minimize wait times).

Low episodes mean the agent is exploring. High episodes mean it's exploiting what it has learned.