16 lines
1.4 KiB
Plaintext
16 lines
1.4 KiB
Plaintext
[2025-10-11 22:45:50,761][__main__][INFO] - Training for 50000 timesteps with NormalQNetwork and NormalReplayBuffer
|
|
[2025-10-11 22:46:00,290][core][INFO] - Step: 2000, Eval mean: 9.4, Eval std: 0.66332495807108
|
|
[2025-10-11 22:46:10,843][core][INFO] - Step: 4000, Eval mean: 9.4, Eval std: 0.66332495807108
|
|
[2025-10-11 22:46:21,821][core][INFO] - Step: 6000, Eval mean: 9.2, Eval std: 0.6
|
|
[2025-10-11 22:46:33,067][core][INFO] - Step: 8000, Eval mean: 9.4, Eval std: 0.66332495807108
|
|
[2025-10-11 22:46:44,081][core][INFO] - Step: 10000, Eval mean: 9.2, Eval std: 0.6
|
|
[2025-10-11 22:46:55,719][core][INFO] - Step: 12000, Eval mean: 12.1, Eval std: 2.4269322199023193
|
|
[2025-10-11 22:47:07,463][core][INFO] - Step: 14000, Eval mean: 9.2, Eval std: 0.6
|
|
[2025-10-11 22:47:19,411][core][INFO] - Step: 16000, Eval mean: 9.4, Eval std: 0.66332495807108
|
|
[2025-10-11 22:47:31,631][core][INFO] - Step: 18000, Eval mean: 9.4, Eval std: 0.66332495807108
|
|
[2025-10-11 22:47:44,763][core][INFO] - Step: 20000, Eval mean: 9.4, Eval std: 0.66332495807108
|
|
[2025-10-11 22:47:57,803][core][INFO] - Step: 22000, Eval mean: 10.5, Eval std: 0.9219544457292888
|
|
[2025-10-11 22:48:10,465][core][INFO] - Step: 24000, Eval mean: 9.4, Eval std: 0.66332495807108
|
|
[2025-10-11 22:48:22,940][core][INFO] - Step: 26000, Eval mean: 9.4, Eval std: 0.66332495807108
|
|
[2025-10-11 22:48:35,661][core][INFO] - Step: 28000, Eval mean: 9.4, Eval std: 0.66332495807108
|