wailker3 opened this issue 5 years ago · 0 comments
The author only used one CNN to calculate the evaluate Q value and the target Q valuate. Shouldn't there be another CNN to calculate the target Q valuate?