lucidrains/q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
PythonMIT
Issues
- 2
td loss: nan
#18 opened by Michelangelo-Y - 2
ValueError: could not broadcast input array from shape (8,) into shape (3,)
#17 opened by Michelangelo-Y - 1
how to get dataset?
#6 opened by etoilestar - 3
RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same or input should be a MKLDNN tensor and weight is a dense tensor
#16 opened by Michelangelo-Y - 2
Some personal questions
#15 opened by heyi2025 - 27
question about Q-head
#11 opened by 2M-kotb - 10
Question about num_timestep
#12 opened by carolineys - 0
provide a complete example
#13 opened by Johnly1986 - 1
- 15
- 2
A simple question about the code
#8 opened by KID0031 - 3
integrate into stable-baselines3
#5 opened by wenjun90 - 4
- 1
The rest part of the code?
#2 opened by wukui-muc