From a9d84cb2bbdcb67d5d38748045758c966e2eac59 Mon Sep 17 00:00:00 2001 From: Zheyuan Wu <60459821+Trance-0@users.noreply.github.com> Date: Mon, 3 Nov 2025 01:30:59 -0600 Subject: [PATCH] updates --- content/CSE5519/CSE5519_E4.md | 17 +++++++++++++++++ content/Math401/Extending_thesis/Math401_R1.md | 4 ++-- content/Math401/Extending_thesis/Math401_S4.md | 2 +- 3 files changed, 20 insertions(+), 3 deletions(-) diff --git a/content/CSE5519/CSE5519_E4.md b/content/CSE5519/CSE5519_E4.md index fad8c7d..fde53c1 100644 --- a/content/CSE5519/CSE5519_E4.md +++ b/content/CSE5519/CSE5519_E4.md @@ -1,2 +1,19 @@ # CSE5519 Advances in Computer Vision (Topic E: 2024: Deep Learning for Geometric Computer Vision) +## DUSt3R: Geometric 3D Vision Made Easy.Links to an external site. + +[link to paper](https://arxiv.org/pdf/2312.14132) + +### Novelty in DUSt3R + +Use point map to represent the 3D scene, combining with the camera intrinsics to estimate the 3D scene. + +Direct-RGB to 3D scene. + +Use ViT to encode the image, and then use two Transformer decoder (with information sharing between them) to decode the two representation of the same scene $F_1$ and $F_2$. Direct regression from RGB to point map and confidence map. + +>[!TIP] +> +> Compared with previous works, this paper directly regresses the point map and confidence map from RGB, producing a more accurate and efficient 3D scene representation. +> +> However, I'm not sure how the information across the two representations is shared in the Transformer decoder. If for a multiview image, there are two pairs of images that don't have any overlapping region, how can the model correctly reconstruct the 3D scene? diff --git a/content/Math401/Extending_thesis/Math401_R1.md b/content/Math401/Extending_thesis/Math401_R1.md index 1eb3166..75b004b 100644 --- a/content/Math401/Extending_thesis/Math401_R1.md +++ b/content/Math401/Extending_thesis/Math401_R1.md @@ -234,7 +234,7 @@ Then the measurable space $(\Omega, \mathscr{B}(\mathbb{C}), \lambda)$ is a meas If $\Omega=\mathbb{R}$, then we denote such measurable space as $L^2(\mathbb{R}, \lambda)$. -
+
#### Probability space @@ -426,7 +426,7 @@ is a pure state. -## Drawing the connection between the space $S^{2n+1}$, $CP^n$, and $\mathbb{R}$ +## Drawing the connection between the space $S^{2n+1}$, $\mathbb{C}P^n$, and $\mathbb{R}$ A pure quantum state of size $N$ can be identified with a **Hopf circle** on the sphere $S^{2N-1}$. diff --git a/content/Math401/Extending_thesis/Math401_S4.md b/content/Math401/Extending_thesis/Math401_S4.md index 629a03d..0c00a32 100644 --- a/content/Math401/Extending_thesis/Math401_S4.md +++ b/content/Math401/Extending_thesis/Math401_S4.md @@ -1,4 +1,4 @@ -# Math 401, Fall 2025: Thesis notes, S4, Bargmann space +# Math 401, Fall 2025: Thesis notes, S4, Complex function spaces and complex manifold ## Bargmann space (original)