Discrepancy Between Training and Play Performance in Isaac Sim (SKRL PPO) #2219

RodsCoimbra · 2025-04-01T12:41:39Z

RodsCoimbra
Apr 1, 2025

Hi,
I’ve noticed a significant difference between how my agent behaves during training (train.py) and how it performs when playing back the trained policy. During training, the videos show the agents consistently reaching the target. However, when I play back the policy using play.py, the performance is completely different—the agents fail to reach the target at all.

Training:
https://github.com/user-attachments/assets/ffb492e8-2c4a-4a12-afba-2ddd48d0c8d6

Playing:
https://github.com/user-attachments/assets/49d8f4c5-59f7-44bc-8fa8-97da556cd8bb

I’m training with SKRL’s PPO in Isaac Sim 4.5 + Isaac Lab 2.0.
Any ideas on what might be causing it or if it can be a bug?
Thanks!

RandomOakForest · 2025-04-01T21:58:34Z

RandomOakForest
Apr 1, 2025
Maintainer

Thank you for posting this. I will move this post to our Discussions section for the team to follow up.

0 replies

giangdao1402 · 2025-05-13T05:34:44Z

giangdao1402
May 13, 2025

Have you figure out the way to solve the problem, i encountered the same one @RodsCoimbra

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Discrepancy Between Training and Play Performance in Isaac Sim (SKRL PPO) #2219

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Discrepancy Between Training and Play Performance in Isaac Sim (SKRL PPO) #2219

Uh oh!

RodsCoimbra Apr 1, 2025

Replies: 2 comments

Uh oh!

RandomOakForest Apr 1, 2025 Maintainer

Uh oh!

giangdao1402 May 13, 2025

RodsCoimbra
Apr 1, 2025

RandomOakForest
Apr 1, 2025
Maintainer

giangdao1402
May 13, 2025