HumanCompatibleAI/imitation

Clean PyTorch implementations of imitation and reward learning algorithms

PythonMIT

Issues

"The observation_space and action_space were not given, can't verify new environments" when loading expert policy
#854 opened 2 months ago by boshi-an
1
Unable to load FeedForward32Policy -- multiple values for keyword argument 'net_arch'
#857 opened 2 months ago by glolichen
1
Robust termination Condition for equal horizon episodes
#856 opened 2 months ago by abastola0
0
The observation_space and action_space were not given, can't verify new environments
#819 opened 9 months ago by kavinwkp
3
GAIL tensorboard logging does not appear to work
#853 opened 3 months ago by eufrizz
0
TypeError: argument of type 'PosixPath' is not iterable
#852 opened 4 months ago by den-schmidt
1
Problem in Passing Along Custom Gym Env Constructor Parameters in make_vec_env
#850 opened 4 months ago by christianjcc
1
Serialize Dataset Save Not Working
#851 opened 4 months ago by alexpalms
1
Preference based Reinforcement Learning applies a "recurrent reward network" for solving a POMDP problem
#848 opened 5 months ago by CAI23sbP
0
Adding More State Only Imitation Learning Algorithms
#849 opened 5 months ago by HridayM25
0
Got an unexpected keyword argument 'use_sde' when passing behavioural cloning policy to PPO from SB3
#781 opened a year ago by JkAcktuator
5
RewardNetwork predict_processed doesn't work without next_state and done
#836 opened 9 months ago by gustavodemari
1
gen_replay_buffer_capacity VS demo_batch_size in tutorials code
#847 opened 5 months ago by piteren
0
difference between Step and Sample in Dagger
#846 opened 5 months ago by lwizard1999
0
Device conflict when training BC
#843 opened 6 months ago by IsaacSheidlower
1
Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!
#820 opened 10 months ago by kavinwkp
2
segmentation fault while loading policy
#845 opened 6 months ago by chenyangkang
1
abort and kernel crash with OMP libomp.dylib
#844 opened 6 months ago by chenyangkang
1
Utilizing Expert Data (.npz) in format of SB3.
#777 opened a year ago by azafar1991
4
Integration of Rule-Based Bot Actions for Imitation Learning
#835 opened 9 months ago by vladyskai
2
Rewards visualization
#840 opened 7 months ago by Rajesh-Siraskar
0
[Question] Training in Custom Environment
#839 opened 7 months ago by KingsDevs
0
[Question] Reward net transfer
#838 opened 8 months ago by risufaj
0
prob_true_act > 1 with PPO and BC?
#762 opened 9 months ago by ThomasRochefortB
2
Remove FloatReward when next SB3 is released
#794 opened 8 months ago by ZiyueWang25
0
Hierarchical Behavior Cloning
#834 opened 9 months ago by Dhanushvarma
0
Example notebook "1_train_bc.ipynb" gives "Namespace not found error" for "seals" module
#816 opened 9 months ago by Rajesh-Siraskar
3
Ensure all tutorials work as expected
#763 opened 9 months ago by ernestum
8
ReadTimeout during hyperparameter tuning
#804 opened 9 months ago by ZiyueWang25
1
Observation space not supported when getting expert trajectories in tutorial 7
#788 opened 9 months ago by camberg23
3
Trained reward function outputs constant zeros using MCE algorithm
#808 opened 9 months ago by spearsheep
1
error with setting up reward net
#809 opened 9 months ago by spearsheep
1
DAgger Demo Code not working
#814 opened 9 months ago by nil123532
2
Tensorboard Logging
#818 opened 9 months ago by mertalbaba
1
IQ-Learn
#813 opened 9 months ago by azafar1991
1
SyntheticGatherer often gives nearly deterministic feedback
#821 opened 10 months ago by timokau
1
Run time Error when run quickstart.py
#825 opened 10 months ago by Charles-Lim93
1
Examples Don't work
#815 opened 10 months ago by AdvanceXplorer
4
Improve pipeline speed and abort early
#789 opened a year ago by ernestum
0
tests/algorithms/test_sqil.py::test_sqil_performance_continuous[DDPG] failure
#791 opened a year ago by ZiyueWang25
1
Fix flaky SQIL test
#807 opened a year ago by AdamGleave
0
Mismatch between Tutorial doc and the actual jupyter notebook
#790 opened a year ago by ZiyueWang25
1
Shorten timeout and ensure the notebook can still show desired improvement.
#793 opened a year ago by ZiyueWang25
0
Update 8a_train_sqil_sac.ipynb to remove MuJoCo dependency
#798 opened a year ago by AdamGleave
0
Notebooks failing in readthedocs build does not trigger red status
#799 opened a year ago by AdamGleave
2
Add rgb observation to obs for interactive policy prediction
#795 opened a year ago by ZiyueWang25
0
Add CLI for SQIL
#780 opened a year ago by AdamGleave
2
Consider allowing integer reward in trajectories
#783 opened a year ago by PavelCz
0
Generalize SQIL to work with other off-policy algos
#767 opened a year ago by jas-ho
2
Don't train experts in the tutorials, download from HF instead
#764 opened a year ago by ernestum
1