simoninithomas/Deep_reinforcement_learning_Course

bug in space invaders

yonigottesman opened this issue · 0 comments

the line:
self.Q = tf.reduce_sum(tf.multiply(self.output, self.actions_))

should be:
self.Q = tf.reduce_sum(tf.multiply(self.output, self.actions_), axis=1)