tensorflow/agents

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

PythonApache-2.0

Issues

Incompatibility between PPO collect policy and SquashToSpecNormal Distribution
#944 opened a month ago by CHEyy-85
0
ImportError: Attempted to load a reverb dynamic library, but it could not find the required symbols inside of TensorFlow.
#937 opened 4 months ago by ace510
2
No module named 'tensorflow.python.training.tracking
#916 opened 10 months ago by Afshin818
2
Unexpected output from `actor_network`. Expected `Distribution` objects, but saw output spec: TensorSpec(shape=(4,), dtype=tf.int32, name=None)
#942 opened 3 months ago by mhdtslm
0
GaussianPolicy applies the same noise to all actions in the batch
#940 opened 3 months ago by zhezherun
0
to_n_step_transition returns wrong results if episode was truncated by the time limit wrapper and N > 1
#939 opened 3 months ago by zhezherun
0
Updating outdated documentation
#938 opened 4 months ago by Nour-Aldein2
0
tf-agents ubuntu 24.04 venv pip3 install error on pygame 2.1.3
#933 opened 4 months ago by johnny-littlepunch
2
ValueError: Network only supports action_specs with shape in [(), (1,)])
#934 opened 4 months ago by Mr-Elysium
0
Module not found error when working through 1_dqn_tutorial.ipynb
#932 opened 4 months ago by ace510
2
Error when running DynamicStepDriver with tf_agents
#936 opened 4 months ago by Jack-Rais
1
Problem with AverageReturnMetric
#909 opened a year ago by billh0420
1
Type error in PolicySaver.save()
#929 opened 5 months ago by anmol438
1
Compatibility with gymnasium environments
#928 opened 5 months ago by anmol438
0
"Too many values to unpack error" when using DQN mnih example
#846 opened a year ago by hexonfox
1
PolicySavedModelTrigger has no sweeping ability.
#927 opened 7 months ago by brianorbrain
0
WARNING:tensorflow:Value in checkpoint could not be found in the restored object: (root).agent._optimizer._variables.1
#926 opened 7 months ago by huiyujie
0
Errors when saving ranking policies
#891 opened a year ago by tottenjordan
1
PPOAgent + MaskSplitterNetwork normalizes Mask when observation normalization is turned on.
#922 opened 8 months ago by BaLinuss
0
got an unexpected keyword argument 'kwargs' when create_variables and unable to copy
#919 opened 9 months ago by hotamago
0
Multi arm Multi play Setting
#917 opened 10 months ago by Akshaysharma29
0
Inference MAB
#914 opened 10 months ago by Akshaysharma29
1
[Fix this ASAP] TypeError : 'AtariEnv.render() takes 1 positional argument but 2 were given' and some environments can't be rendered
#911 opened 10 months ago by Skyisblue324
1
How to convert keras Model to DDQN QNetwork？
#910 opened a year ago by linfanzz
0
Feature Request: expose a target_update method for DQNAgent
#907 opened a year ago by billh0420
0
DQN Tutorial.ipynb not working
#906 opened a year ago by tarobins
2
Parallelization of this "CartPole v0" standard environment failed
#905 opened a year ago by fengyuhun
0
Per-Arm Features guide/tutorial not in site table of contents
#903 opened a year ago by tottenjordan
1
How to run Code on Windows
#904 opened a year ago by MarleneBs
0
Add support for gym_kwargs in suite_atari
#894 opened a year ago by msmith93
1
New to RL Agents, need help with policies
#892 opened a year ago by AhmadALBarqawi
1
remove tensorflow warning around tf.function on per_field_where
#902 opened a year ago by cmarlin
0
Q-Network wrong output spec
#896 opened a year ago by rissois
1
Actor network predicts actions over bounds using PPOClipAgent
#847 opened a year ago by b-fg
2
TypeError: configurable() got an unexpected keyword argument 'blacklist'
#889 opened a year ago by ParamB11
1
Understanding parallel environments
#888 opened a year ago by b-fg
0
Best way to emit/output internal network state (for debugging)
#884 opened a year ago by nathanmartz
0
Learning rate metric inconsistent depending on optimizer
#883 opened a year ago by b-fg
0
`merge_call` called while defining a new graph or a tf.function.
#882 opened a year ago by Jark5455
0
Shape mismatch in DDPG agent's critic loss function? Shape mismatch between td_targets and q_values tensors
#854 opened a year ago by Pren7z
1
ReverbAddTrajectoryObserver gives "The number of pending items is alarmingly high" error
#853 opened a year ago by adiaconu11
0
Bug in tf_agents.bandits.policies.linalg.conjugate_gradient?
#852 opened a year ago by td20002
0
ModuleNotFoundError: No module named 'tf_agents.drivers'; 'tf_agents' is not a package
#849 opened a year ago by RayTsui
0
Possible Bugs in CQL_SAC Example
#848 opened a year ago by ChengDaHaI
0
Unable to load multiprocessing context for my validated custom gym environment.
#845 opened a year ago by Yash271100
1
Error in Parallel environment processing BrokenPipeError:[WinError 109]
#844 opened 2 years ago by roeslib
2
Actor-PPOLearner with discrete Action on Windows
#841 opened 2 years ago by bennyfri
0
tf-agents 0.16.0 doesn't install cleanly on macosx (13.2.1 (22D68))
#843 opened 2 years ago by apurva-sharma
0
Shape issue with add_batch() method while training a DQN Agent
#842 opened 2 years ago by PierreSmague
0
ValueError when validating a gym environment with MultiDiscrete action space
#838 opened 2 years ago by heydarshahi
0