The update in DQN
YuxuanXie opened this issue · 2 comments
YuxuanXie commented
Hi,
I get a question about your implementation of DQN, which is supposed to have a C-interval-update between target q-network and current q-network. I see this update in your implementation of DDQN. Can you please tell me why it is this way?
In my point of view, your implementation of ddqn is actually dqn.
Best,
Yuxuan
douglasrizzo commented
I realized that too. In fact, there is no target network at all in 1.dqn.ipynb.
The following line in the notebook (cell 19):
next_q_values = model(next_state)
Should be:
next_q_values = target_model(next_state)
YuxuanXie commented
Yes, you are right. thanks!