Files
NoteNextra-origin/content/CSE5519/CSE5519_E1.md
Zheyuan Wu ce180d4b26 update
2025-09-01 23:04:32 -05:00

1.3 KiB

CSE5519 Topic E.1 2021 and before: Deep Learning for Geometric Computer Vision

Note

This topic is presented by Me. and will be the most detailed one for this course, perhaps.

This topic is mainly about Depth Estimation from Monocular Images. (Boring, not even RANSAC)

PoseNet

A Convolutional Network for Real-Time 6-DOF Camera Relocalization (ICCV 2015)

link to the paper

Convolutional neural network (convnet) we train to estimate camera pose directly from a monocular image, I. Our network outputs a pose vector p, given by a 3D camera position x and orientation represented by quaternion q:


p = [x, q]

q is a quaternion, x is a 3D camera position.

Unsupervised Learning of Depth and Ego-Motion From Video

(CVPR 2017)

link to the paper

GeoNet

Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose (CVPR 2018)

link to the paper

link to the repository

GeoNet

Rigid structure constructor

Non-rigid motion localizer

Geometric consistency enforcement