Are bias terms also constrained when running the SFW algorithms?

Question

Are bias terms also constrained when running the SFW algorithms?

GeoffNN opened this issue 4 years ago · 1 comments

Hi! Really cool work, it's nice to see FW methods working on neural nets.

Quick question: from the code, I don't see any distinction when using bias terms; is everything constrained?

Answer 1 · 2021-08-18T07:20:13.000Z

Hi @GeoffNN , we probably discussed this already but yes, you're right. We do no distinguish between weights and biases. Each layer has a separate bias tensor which gets its own constraint. When using a global constraint (i.e. when concatenating all parameters to form a high-dim. vector which is then constrained), biases are treated the same way as weights.

I hope this answers your question!
Max