16 lines
1.4 KiB
Plaintext
16 lines
1.4 KiB
Plaintext
[2025-10-11 23:26:03,897][__main__][INFO] - Training for 50000 timesteps with DuelingQNetwork and NormalReplayBuffer
|
|
[2025-10-11 23:26:17,366][core][INFO] - Step: 2000, Eval mean: 226.0, Eval std: 42.757455490241696
|
|
[2025-10-11 23:26:31,018][core][INFO] - Step: 4000, Eval mean: 106.6, Eval std: 3.49857113690718
|
|
[2025-10-11 23:26:44,765][core][INFO] - Step: 6000, Eval mean: 111.5, Eval std: 3.1064449134018135
|
|
[2025-10-11 23:26:58,783][core][INFO] - Step: 8000, Eval mean: 96.8, Eval std: 2.821347195933177
|
|
[2025-10-11 23:27:13,637][core][INFO] - Step: 10000, Eval mean: 112.4, Eval std: 4.476605857119878
|
|
[2025-10-11 23:27:30,697][core][INFO] - Step: 12000, Eval mean: 257.3, Eval std: 5.692978130996114
|
|
[2025-10-11 23:27:49,873][core][INFO] - Step: 14000, Eval mean: 500.0, Eval std: 0.0
|
|
[2025-10-11 23:28:06,234][core][INFO] - Step: 16000, Eval mean: 113.8, Eval std: 3.515679166249389
|
|
[2025-10-11 23:28:22,582][core][INFO] - Step: 18000, Eval mean: 182.2, Eval std: 8.022468448052631
|
|
[2025-10-11 23:28:41,446][core][INFO] - Step: 20000, Eval mean: 500.0, Eval std: 0.0
|
|
[2025-10-11 23:29:00,468][core][INFO] - Step: 22000, Eval mean: 500.0, Eval std: 0.0
|
|
[2025-10-11 23:29:19,606][core][INFO] - Step: 24000, Eval mean: 500.0, Eval std: 0.0
|
|
[2025-10-11 23:29:39,300][core][INFO] - Step: 26000, Eval mean: 500.0, Eval std: 0.0
|
|
[2025-10-11 23:29:55,951][core][INFO] - Step: 28000, Eval mean: 140.2, Eval std: 3.4871191548325386
|