The update in DQN

Question

The update in DQN

YuxuanXie opened this issue 4 years ago · 2 comments

Hi,

I get a question about your implementation of DQN, which is supposed to have a C-interval-update between target q-network and current q-network. I see this update in your implementation of DDQN. Can you please tell me why it is this way?

In my point of view, your implementation of ddqn is actually dqn.

Best,
Yuxuan

Answer 1 · 2020-07-07T10:06:21.000Z

I realized that too. In fact, there is no target network at all in 1.dqn.ipynb.

The following line in the notebook (cell 19):

next_q_values = model(next_state)

Should be:

next_q_values = target_model(next_state)

Answer 2 · 2020-07-08T04:14:11.000Z

Yes, you are right. thanks!