Minor bug that removes best performing trajectory in gym experiments

I believe that this line should have a <= rather than a < in order for the code to not cut out the best performing trajectory even when using pct_traj = 1.

decision-transformer/gym/experiment.py

Line 109 in f04280e

while ind >= 0 and timesteps + traj_lens[sorted_inds[ind]] < num_timesteps:

To replicate, use a dataset with 2 trajectories and use pct_traj = 1. and the resulting num_trajectories will just be 1 rather than 2.

Thanks for catching this! I think you're right.