This commit is contained in:
Zheyuan Wu
2025-10-23 13:39:36 -05:00
parent 5ac36745e2
commit dbb201ef37
7 changed files with 165 additions and 1 deletions

View File

@@ -1,2 +1,21 @@
# CSE5519 Advances in Computer Vision (Topic A: 2023 - 2024: Semantic Segmentation)
## Segment Anything
[link to the paper](https://arxiv.org/pdf/2304.02643)
### Novelty in Segment Anything
Brute force approach with large scale training data (400x) more
#### Dataset construction
- Model-assisted manual annotation
- Semi-automatic annotation
- Automatic annotation (predict mask for 32x32 patches)
> [!TIP]
>
> This paper shows a remarkable breakthrough in semantic segmentation with a brute force approach using a large scale training data. The authors use a transformer encoder to get the final segmentation map.
>
> I'm really interested in the scalability of the model. Is there any approach to reduce the training data size or the model size with comparable performance via distillation or other techniques?