By prompting the model with different language, we can simulate counterfactuals and complex scenes, allowing for the generation of varied scenarios based on the same initial conditions and enabling a robot to interact with objects as if it were in the real world, even if the hardware doesn't support it.