https://github.com/zj-zhang/AMBER/blob/master/amber/architect/optim/controller/pytorch/generalController.py#L120C18-L120C45 default = True add benchmarking between `rescale_advantage_by_reward=False` vs `rescale_advantage_by_reward=True`