DQN agent trained on a four-way intersection MDP; 31% reduction in average waiting time versus fixed-cycle baseline.