[2025-10-11 21:56:28,491][__main__][INFO] - Training for 50000 timesteps with NormalQNetwork and NormalReplayBuffer [2025-10-11 21:56:28,783][py.warnings][WARNING] - d:\Documents\Nextcloud\Documents\Project WUSTL\Academic\2025_Fall\CSE5100\Homeworks\hw2\hw2\agent.py:55: UserWarning: To copy construct from a tensor, it is recommended to use sourceTensor.detach().clone() or sourceTensor.detach().clone().requires_grad_(True), rather than torch.tensor(sourceTensor). return torch.tensor(reward)