[Algo] Added sac codebase #5

ShahRutav · 2023-01-11T21:39:01Z

No description provided.

…_dev

vmoens · 2023-01-23T18:13:04Z

scripts/sac_mujoco/sac_loss.py

+    _has_functorch = False
+
+
+class SACLoss(LossModule):


How does that differ from the TorchRL SAC exactly? If there's an extra feature I'd prefer to add it to torchrl directly, wdyt?

The reason why I used the local sac_loss is because torchRL sac.py requires you to pass three networks: actor, qvalue, and value. In SAC, you don't have a value function as far as I remember.

I implemented that following regorously what the paper presented, but if it works better with one net only we can put that as an option

https://arxiv.org/abs/1801.01290

This is SAC-v1, I think more commonly used is SAC-v2 (https://arxiv.org/abs/1812.05905). Checkout section 4.2 in the paper.

Here's the pseudo code that they have used:

in my opinion, it is worth adding the v2 implementation of SACLoss?

I agree. My point is mainly that rather than coding up a new SAC, we should simply add the v2 to the SAC loss. As it is now, we're sort of saying "TorchRL has everything you need... but they got SAC wrong so here's a patch"

Can you have a look at pytorch/rl#864?

scripts/sac_mujoco/sac.py

vmoens · 2023-01-24T11:24:51Z

rlhive/sim_algos/helpers/rrl_transform.py

+    _has_tv = False
+
+
+class _RRLNet(Transform):


I don't really see why we need a new env for this. We could create R3M with download=False, and load the state dict from torchvision no?

I am not 100% sure if the architecture of R3M is different from ResNet torchvision module. Plus I think this is a cleaner way to do it? but we can switch to loading weights if you think so

I am not 100% sure if the architecture of R3M is different from ResNet torchvision module

What would be different? The only thing that pretrained=True does is load a state_dict, the architecture is 100% the same
Have a look at my PR on torchrl.

Oh cool. I have never tested R3M backbone against ResNet backbone but they might be exactly same. Thanks! I will take a look and update the code

…into sac_dev

Added sac codebase. Works independently.

b3068c9

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jan 11, 2023

Added small test codebase.

f37ac89

ShahRutav linked an issue Jan 11, 2023 that may be closed by this pull request

Error in running set_info_dict_reader #6

Closed

vmoens changed the title ~~Added sac codebase.~~ [Algo] Added sac codebase Jan 12, 2023

ShahRutav added 2 commits January 12, 2023 21:51

Merge branch 'dev' into sac_dev

09bad16

test.py updated with another bug

6c03e9c

ShahRutav linked an issue Jan 13, 2023 that may be closed by this pull request

Error in creating transformed env with R3M #8

Closed

ShahRutav and others added 13 commits January 13, 2023 12:59

small change with updated torchrl

50ae2e0

working sac codebase. cleanup

f2d9b43

added installation script. sac configs correct

b576682

Added a new running instruction for SAC+R3M

2f07d0c

Fixed readme

e6067c4

Added redq codebase from torchrl

c6084e8

Merge branch 'sac_dev' of github.com:facebookresearch/rlhive into sac…

d39bd0c

…_dev

updated redq script with robohive env

1f02c30

Added RRLTransform

fab9084

moved rrl_transform inside helpers

76e601a

Updated README with parameter sweep

850c3d9

updated redq with action, state, and obs norms

2a942ab

Merge branch 'sac_dev' of github.com:facebookresearch/rlhive into sac…

bd932d3

…_dev

vmoens reviewed Jan 24, 2023

View reviewed changes

vmoens and others added 8 commits January 24, 2023 11:38

Merge branch 'dev' into sac_dev

bc49e48

Merge branch 'sac_dev' of https://github.com/facebookresearch/rlhive …

e3cd33d

…into sac_dev

updated the code with torchrl sacloss and rrl transform

5823199

init

e68f917

amend

721394c

amend

47dbc8a

amend

e120d7b

amend

582020c

vmoens and others added 30 commits January 27, 2023 19:00

amend

c85a24d

amend

bbcd73d

rl_env updated for state based experiments

dc68e2e

amend

faa46de

init

e895912

amend

3da5e5c

amend

eee0d4b

minor

2e5e1e6

Some more info in GET_STARTED.md

caa66e1

Fix ref to wandb

c935d24

cleanup

1af25a9

init

ad20206

amend

1bbddd4

amend

fea42b2

amend

a43e2a4

amend

8cb852d

amend

deeb272

amend

ff4895a

amend

3224ec2

amend

4573419

amend

97180ae

amend

7106f01

amend

f71a155

amend

a28404b

amend

1ac5466

amend

a7be171

amend

22d91cb

merged with sac_example

0659cca

moving the sac_loss to local file

1a6e527

updated with rrl,r3m,flatten transforms, added visual hand envs

c521fcd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Algo] Added sac codebase #5

[Algo] Added sac codebase #5

Uh oh!

ShahRutav commented Jan 11, 2023

Uh oh!

vmoens Jan 23, 2023

Uh oh!

ShahRutav Jan 24, 2023

Uh oh!

vmoens Jan 24, 2023

Uh oh!

ShahRutav Jan 24, 2023

Uh oh!

ShahRutav Jan 24, 2023

Uh oh!

vmoens Jan 24, 2023

Uh oh!

vmoens Jan 24, 2023

Uh oh!

Uh oh!

vmoens Jan 24, 2023

Uh oh!

ShahRutav Jan 24, 2023

Uh oh!

vmoens Jan 24, 2023 •

edited

Loading

Uh oh!

ShahRutav Jan 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Algo] Added sac codebase #5

Are you sure you want to change the base?

[Algo] Added sac codebase #5

Uh oh!

Conversation

ShahRutav commented Jan 11, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vmoens Jan 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

vmoens Jan 24, 2023 •

edited

Loading