diff --git a/content/CSE5519/CSE5519_I3.md b/content/CSE5519/CSE5519_I3.md
index 0ebf170..3d265a9 100644
--- a/content/CSE5519/CSE5519_I3.md
+++ b/content/CSE5519/CSE5519_I3.md
@@ -8,3 +8,8 @@
 
 VLA, vision-language-action models.
 
+> [!TIP]
+>
+> This paper shows a new way to transfer web knowledge to robotic control. The key is to use a vision-language-action model to transfer the knowledge from the web to the robotic control.
+>
+> I'm considering how this framework could be migrated to two-hand robotic control. In general case, the action is solely done by one hand, but in most real-world applications, the action is done by two hands. I wonder if this framework could be extended to two-hand robotic control?
\ No newline at end of file