rail-berkeley/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
PythonNOASSERTION
Issues
- 1
Error on installing with docker
#167 opened by Mahdi-HF - 1
The issue of softlearning implementation
#203 opened by Ziyu0118 - 0
why target entropy is -dim(A)?
#200 opened by mittu1008 - 0
- 7
Question on initialization of alpha and entropy
#149 opened by dbsxdbsx - 2
target_entropy discrete_space
#162 opened by Maggern3 - 6
- 3
- 0
Differences between softlearning implementation and formula 18 in paper of alpha loss
#177 opened by Maggern3 - 1
Incompatible with ray 1.2.0
#169 opened by xanderdunn - 1
Incompatible with tensorflow 2.4.0
#170 opened by xanderdunn - 6
MultiGoal Env not working, please give instruction.
#165 opened by qlinsey - 1
SQL algorithm is not working
#164 opened by ivan-ji-walmart - 2
No module named 'example.instruments'
#161 opened by ndormann - 1
No module named 'examples.instrument'
#117 opened by ling-pan - 0
No rendering/headless mode available?
#158 opened by khatch31 - 0
- 0
error when running example
#152 opened by eleyng - 4
- 0
Results folder should be configurable through cli
#128 opened by hartikainen - 0
Non-MuJoCo examples?
#115 opened by david-masters - 1
Question on the soft q learning implementation
#143 opened by YuxuanSong - 5
- 0
SAC gradients weighted inconsistently
#141 opened by hartikainen - 2
base_policy to_yaml fails
#135 opened by kapsl - 3
Using keras functional API in model
#138 opened by kapsl - 3
Question about multiple CPU usage.
#137 opened by charlesjsun - 0
Training step limit breaks logging
#139 opened by hartikainen - 1
--restore not working anymore?
#134 opened by kapsl - 1
Possibility to show structure of the model
#133 opened by kapsl - 2
A fatal error occurred
#129 opened by Max-918 - 3
Several issues
#130 opened by kapsl - 0
examples/development/simulate_policy.py is broken
#131 opened by hartikainen - 2
Training speed of new code is much slower than before
#124 opened by kapsl - 2
Are there plans to integrate DisCor
#125 opened by kapsl - 1
tune.sample_from parallelized
#126 opened by kapsl - 0
Simulating policy now broken
#113 opened by nflu - 6
Multiple conflicts in requirements.txt
#123 opened by johannespitz - 2
Nan error in Humanoid
#120 opened by varun-intel - 3
Conda installation problem Ubuntu 18.04
#111 opened by Jendker - 1
'FeedforwardGaussianPolicy' object has no attribute '_Serializable__initialize'
#107 opened by ZhanPython - 5
trials did not complete error
#98 opened by surbhi1944 - 3
Can not save checkpoint
#104 opened by mohsinlakhani - 0
- 1
Installing on Docker
#105 opened by kapsl - 1
ModulesNotFound - systems.path related issue
#103 opened by weijiafeng - 2
Stuck on KMP_AFFINITY thread allocation issue
#102 opened by weijiafeng - 1
Benchmark Result
#101 opened by lujiayou123 - 1
Mujoco license - invalid key in docker container
#99 opened by mbed92 - 1
LSTM Support
#97 opened by pethor