Scene Understand & Perception Feed-Forward 3D Reconstruction and Understanding with background image Visual Localization with background image