Suppose the agent performs the Suck action at time t then square x y will NOT incur cost due to dirtiness at step t because at the end of this time step square x y is already clean

Showing the single result

Showing the single result