Issues
- 1
Reward Shape
#28 opened by QiyaoWei - 6
right_to_left_pad optimization
#26 opened by vwxyzjn - 0
Creating a jax implementation
#5 opened by vwxyzjn - 3
Question about KL divergence computation
#25 opened by Maxtoq - 3
A question about `normalize_after`
#8 opened by liutianlin0121 - 2
Add accelerate to poetry dependencies
#9 opened by liutianlin0121 - 4