[2025-10-11 23:43:53,676][__main__][INFO] - Training for 50000 timesteps with NormalQNetwork and PrioritizedReplayBuffer [2025-10-11 23:44:10,340][core][INFO] - Step: 2000, Eval mean: 167.4, Eval std: 44.43467114765226 [2025-10-11 23:44:28,836][core][INFO] - Step: 4000, Eval mean: 193.3, Eval std: 37.97380676203006 [2025-10-11 23:44:47,985][core][INFO] - Step: 6000, Eval mean: 100.3, Eval std: 2.7586228448267445 [2025-10-11 23:45:07,025][core][INFO] - Step: 8000, Eval mean: 110.7, Eval std: 4.050925820105819 [2025-10-11 23:45:26,143][core][INFO] - Step: 10000, Eval mean: 116.7, Eval std: 3.28785644455472 [2025-10-11 23:45:45,589][core][INFO] - Step: 12000, Eval mean: 128.9, Eval std: 3.6999999999999997 [2025-10-11 23:46:04,629][core][INFO] - Step: 14000, Eval mean: 102.4, Eval std: 2.4576411454889016 [2025-10-11 23:46:24,888][core][INFO] - Step: 16000, Eval mean: 283.4, Eval std: 24.920674148184673 [2025-10-11 23:46:46,747][core][INFO] - Step: 18000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:47:09,101][core][INFO] - Step: 20000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:47:30,699][core][INFO] - Step: 22000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:47:51,303][core][INFO] - Step: 24000, Eval mean: 142.5, Eval std: 4.5 [2025-10-11 23:48:14,734][core][INFO] - Step: 26000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:48:37,095][core][INFO] - Step: 28000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:48:59,772][core][INFO] - Step: 30000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:49:20,254][core][INFO] - Step: 32000, Eval mean: 105.8, Eval std: 2.638181191654584 [2025-10-11 23:49:41,804][core][INFO] - Step: 34000, Eval mean: 290.0, Eval std: 92.10971718553913 [2025-10-11 23:50:05,661][core][INFO] - Step: 36000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:50:29,141][core][INFO] - Step: 38000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:50:50,699][core][INFO] - Step: 40000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:51:12,136][core][INFO] - Step: 42000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:51:33,089][core][INFO] - Step: 44000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:51:54,989][core][INFO] - Step: 46000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:52:18,438][core][INFO] - Step: 48000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:52:41,758][core][INFO] - Step: 50000, Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:53:06,483][core][INFO] - Final Eval mean: 500.0, Eval std: 0.0 [2025-10-11 23:53:12,682][__main__][INFO] - Finish training with eval mean: 500.0