updates

2025-10-23 13:39:36 -05:00
parent 5ac36745e2
commit dbb201ef37
7 changed files with 165 additions and 1 deletions
--- a/content/CSE5519/CSE5519_A3.md
+++ b/content/CSE5519/CSE5519_A3.md
@@ -1,2 +1,21 @@
 # CSE5519 Advances in Computer Vision (Topic A: 2023 - 2024: Semantic Segmentation)

+## Segment Anything
+
+[link to the paper](https://arxiv.org/pdf/2304.02643)
+
+### Novelty in Segment Anything
+
+Brute force approach with large scale training data (400x) more
+
+#### Dataset construction
+
+- Model-assisted manual annotation
+- Semi-automatic annotation
+- Automatic annotation (predict mask for 32x32 patches)
+
+> [!TIP]
+>
+> This paper shows a remarkable breakthrough in semantic segmentation with a brute force approach using a large scale training data. The authors use a transformer encoder to get the final segmentation map.
+>
+> I'm really interested in the scalability of the model. Is there any approach to reduce the training data size or the model size with comparable performance via distillation or other techniques?