Imitation Learning, Visual Geometry, Diffusion Model