updates

2025-10-23 13:39:36 -05:00
parent 5ac36745e2
commit dbb201ef37
7 changed files with 165 additions and 1 deletions
--- a/content/CSE510/CSE510_L17.md
+++ b/content/CSE510/CSE510_L17.md
@@ -55,6 +55,27 @@ Close-loop planning:

 - At each state, iteratively build a search tree to evaluate actions, select the best-first action, and the move the next state.

+Use model as simulator to evaluate actions.
+
+#### MCTS Algorithm Overview
+
+1. Selection: Select the best-first action from the search tree
+2. Expansion: Add a new node to the search tree
+3. Simulation: Simulate the next state from the selected action
+4. Backpropagation: Update the values of the nodes in the search tree
+
+#### Policies in MCTS
+
+Tree policy:
+
+Decision policy:
+
+- Max (highest weight)
+- Robust (most visits)
+- Max-Robust (max of the two)
+
+#### Upper Confidence Bound on Trees (UCT)
+


 #### Continuous Case: Trajectory Optimization