RL environment for diagnosing ML training failures
A custom 2D 8*8 grid RL env with random obstacle spawning