[2025-10-12 00:26:21,193][__main__][INFO] - Training for 50000 timesteps with NormalQNetwork and Prioritized10StepReplayBuffer [2025-10-12 00:26:37,205][core][INFO] - Step: 2000, Eval mean: 12.2, Eval std: 0.8717797887081347 [2025-10-12 00:26:54,645][core][INFO] - Step: 4000, Eval mean: 9.6, Eval std: 1.2806248474865698 [2025-10-12 00:27:12,056][core][INFO] - Step: 6000, Eval mean: 9.2, Eval std: 0.6 [2025-10-12 00:27:29,842][core][INFO] - Step: 8000, Eval mean: 21.6, Eval std: 5.730619512757761 [2025-10-12 00:27:48,840][core][INFO] - Step: 10000, Eval mean: 227.8, Eval std: 182.14543639630392 [2025-10-12 00:28:09,044][core][INFO] - Step: 12000, Eval mean: 432.6, Eval std: 135.09641001891944 [2025-10-12 00:28:29,323][core][INFO] - Step: 14000, Eval mean: 360.9, Eval std: 121.26866866590068 [2025-10-12 00:28:49,316][core][INFO] - Step: 16000, Eval mean: 368.7, Eval std: 105.315763302556 [2025-10-12 00:29:09,905][core][INFO] - Step: 18000, Eval mean: 370.6, Eval std: 114.3268997218065 [2025-10-12 00:29:30,078][core][INFO] - Step: 20000, Eval mean: 380.4, Eval std: 108.37545847653887 [2025-10-12 00:29:50,352][core][INFO] - Step: 22000, Eval mean: 436.0, Eval std: 89.43489251964247 [2025-10-12 00:30:10,478][core][INFO] - Step: 24000, Eval mean: 437.1, Eval std: 93.34286260877154 [2025-10-12 00:30:30,447][core][INFO] - Step: 26000, Eval mean: 451.8, Eval std: 83.40119903214821 [2025-10-12 00:30:50,912][core][INFO] - Step: 28000, Eval mean: 439.1, Eval std: 85.54115968351142 [2025-10-12 00:31:11,646][core][INFO] - Step: 30000, Eval mean: 454.0, Eval std: 69.80257874892588 [2025-10-12 00:31:32,031][core][INFO] - Step: 32000, Eval mean: 423.2, Eval std: 75.24333857558422 [2025-10-12 00:31:52,381][core][INFO] - Step: 34000, Eval mean: 386.5, Eval std: 103.64386137152552 [2025-10-12 00:32:12,858][core][INFO] - Step: 36000, Eval mean: 456.8, Eval std: 84.32057874564192 [2025-10-12 00:32:33,233][core][INFO] - Step: 38000, Eval mean: 448.7, Eval std: 96.5691979877642 [2025-10-12 00:32:53,194][core][INFO] - Step: 40000, Eval mean: 455.4, Eval std: 83.44123680770797 [2025-10-12 00:33:13,862][core][INFO] - Step: 42000, Eval mean: 431.9, Eval std: 89.02634441557173 [2025-10-12 00:33:34,167][core][INFO] - Step: 44000, Eval mean: 426.1, Eval std: 91.27370924861113 [2025-10-12 00:33:54,320][core][INFO] - Step: 46000, Eval mean: 422.8, Eval std: 84.57635603405954 [2025-10-12 00:34:14,480][core][INFO] - Step: 48000, Eval mean: 439.7, Eval std: 94.10850121003946