Files
CSE5100H2/runs/2025-10-11/22-09-31_/main.log
Zheyuan Wu 250f763f1f done?
2025-10-12 00:55:07 -05:00

8 lines
1.0 KiB
Plaintext

[2025-10-11 22:09:33,607][__main__][INFO] - Training for 50000 timesteps with NormalQNetwork and NormalReplayBuffer
[2025-10-11 22:09:33,902][py.warnings][WARNING] - d:\Documents\Nextcloud\Documents\Project WUSTL\Academic\2025_Fall\CSE5100\Homeworks\hw2\hw2\agent.py:55: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.detach().clone() or sourceTensor.detach().clone().requires_grad_(True), rather than torch.tensor(sourceTensor).
max_tensor = torch.max(self.target_net(torch.tensor(next_state).to(self.device)).cpu(), dim=1)
[2025-10-11 22:09:33,932][py.warnings][WARNING] - d:\Documents\Nextcloud\Documents\Project WUSTL\Academic\2025_Fall\CSE5100\Homeworks\hw2\hw2\agent.py:57: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.detach().clone() or sourceTensor.detach().clone().requires_grad_(True), rather than torch.tensor(sourceTensor).
tensor_arr = np.where(done, torch.tensor(reward).to(self.device), torch.tensor(reward).to(self.device) + max_tensor * gamma_tensor)