the reweard of "cup pouring" task is too low
csufangyu opened this issue · 1 comments
hi, I want to try to reproduce paper results,and I run the eval.sh.But the reward of "pour" only 5.047.
Is it because of my mistake.The results as followed:
`=> loading checkpoint /home/fangyu/robot_learning/isaac_project/oscar/examples/../trained_models/train/Pour/Pour_oscar__seed_1.pth'
reward: 4.269585132598877 steps: 148.0
reward: 1.1434197425842285 steps: 150.0
reward: 0.813079833984375 steps: 150.0
reward: 32.457889556884766 steps: 150.0
reward: 2.117696762084961 steps: 150.0
reward: 1.756449818611145 steps: 150.0
reward: 3.13059663772583 steps: 150.0
reward: 2.1976819038391113 steps: 150.0
reward: 1.295109748840332 steps: 150.0
reward: 1.2966032028198242 steps: 150.0
201.9124493598938
av reward: 5.047811233997345 av steps: 149.8
I believe as per our (offline) discussion, this has been resolved. (The newest Isaac Gym preview version 4 must be used).