-
Notifications
You must be signed in to change notification settings - Fork 18
Open
Description
Why we pass (next_obs, next_obs)? It should (obs, next_obs) right? Because you are optimizing for the entropy of
Line 224 in b523c38
| intr_reward = self.compute_apt_reward(next_obs,next_obs) |
Kaixhin, Harimus and Howuhh
Metadata
Metadata
Assignees
Labels
No labels