When I run SAC/BipedalWalker-v3/main.py, the following error occurred.

$ python3 main.py 
WARNING:absl:`lr` is deprecated, please use `learning_rate` instead, or use the legacy optimizer, e.g.,tf.keras.optimizers.legacy.Adam.
Traceback (most recent call last):
  File "main.py", line 263, in <module>
    main(n_episodes=1000, n_testplay=4)
  File "main.py", line 232, in main
    agent = SAC(env_id="BipedalWalker-v3", action_space=4, action_bound=1)
  File "main.py", line 54, in __init__
  File "main.py", line 63, in _initialize_weights
    dummy_state = (dummy_state[np.newaxis, ...]).astype(np.float32)
TypeError: tuple indices must be integers or slices, not tuple

I have no idea on this error, so please let me know how to fix it.

hi @Ishihara-Masabumi, thank you for reporting.
This error is due to the recent change of gym API. env.reset() in older version returns only np.ndarray, but in the latest version returns tuple of (np.ndarray, {}) . as a consequence, dummy_state[np.newaxis, ...] causes TypeError.

Therefore, there are two solution. One is simply downgrade your gym. I confirmed gym==0.25.2 works fine. Another solutions is changing dummy_state = env.reset() to dummy_state, _ = env.reset().

OK, thanks.