Deep Q learning: Algorithm

Question

Deep Q learning: Algorithm

wailker3 opened this issue 5 years ago · 0 comments

The author only used one CNN to calculate the evaluate Q value and the target Q valuate. Shouldn't there be another CNN to calculate the target Q valuate?