updates?

2025-11-14 11:15:12 -06:00
parent 1b75ef050f
commit 0597afb511
14 changed files with 746 additions and 411 deletions
--- a/content/CSE5519/CSE5519_E5.md
+++ b/content/CSE5519/CSE5519_E5.md
@@ -1,2 +1,15 @@
 # CSE5519 Advances in Computer Vision (Topic E: 2025: Deep Learning for Geometric Computer Vision)

+## VGGT: Visual Geometry Grounded Transformer
+
+[link to paper](https://arxiv.org/pdf/2503.11651)
+
+### Novelty in VGGT
+
+Use alternating attention to encode the image.
+
+> [!TIP]
+>
+>  VGGT uses a feed-forward neural network that directly infers all key 3D attributes of a scene using alternating attention and is robust to some non-rigid deformations.
+> 
+> I wonder how this model adapts to different light settings for the same image, how the non-Lambertian reflectance is captured, and how this framework can be extended to recover the true color of the objects and evaluate the surface properties of the objects.