Fig. 5

Scalability evaluation on PyBullet and Isaac environments. Learning curves (average return vs. environment steps) comparing TDC-λ and SAC on four PyBullet locomotion tasks and two Isaac manipulation tasks. Shaded regions indicate variability across independent runs.