This commit is contained in:
Zheyuan Wu
2025-11-02 20:11:37 -06:00
parent 248051db0d
commit 8917fb124f
15 changed files with 43 additions and 21 deletions

View File

@@ -214,11 +214,13 @@ What is not used:
\newpage
\item [2.4] Bonus (20pt)
% \begin{figure}[H]
% \centering
% \includegraphics[width=0.8\textwidth]{images/p241.png}
% \caption{Learning Curve for Average Return for HalfCheetah with Berkely Parameters}
% \end{figure}
\begin{figure}[H]
\centering
\includegraphics[width=0.8\textwidth]{images/p24.png}
\caption{Learning Curve for Average Return for HalfCheetah with Berkely Parameters}
\end{figure}
Unfortunately, the experiments with Berkely parameters do not converge to the maximum reward of 300. We tested different batch sizes and learning rates, but the results are still not satisfactory. Even with increasing epoch, the performance is still not beyond random movement.
\end{enumerate}