Files
NoteNextra-origin/content/CSE5519/CSE5519_G5.md
2025-11-18 13:25:21 -06:00

567 B

CSE5519 Advances in Computer Vision (Topic G: 2025: Correspondence Estimation and Structure from Motion)

MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic Videos

link to paper

  • vanilla Droid-SLAM
  • mono-depth initialization
  • objective movement map prediction
  • two-stage training scheme

Tip

How does the two-stage training scheme help with the robustness of the model? For me, it seems that this paper is just the integration of GeoNet (separated pose and depth) with full regression.