Replies: 1 comment
-
Thanks for posting this. I will move this to our issues and tag it as a proposal of enhancements for the team to consider. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello,
I would like to discuss and propose some improvements for SB3 integration with isaac lab (as a follow up of #528).
I've been playing around IsaacLab with SB3/SBX (SBX = SB3 + jax) and I have some improvements to suggests/questions:
To reproduce, using my fork (https://github.com/araffin/IsaacLab, branch
/feat/sb3-update
):Default, runs at 10 000 steps/s:
Fast variant, runs at 35 000 steps/s (SBX) and 30 000 steps/s:
fastest variant but incorrect (truncation not handled, lower performance):
--no--info
flag, runs at 40 000 steps/s (similar to rsl_rl)Side remark: while playing around, the rsl_rl ppo hyperparameters don't seem to be tuned for the A1 task/ I managed to get much higher rewards with SB3/SBX implementation (mean reward ~30 vs ~10 for rsl_rl, probably because they don't handle truncation?)
Beta Was this translation helpful? Give feedback.
All reactions