changlin31/BossNAS

About formulation (1) and (6)

Closed this issue · 2 comments

Hi, very thanks for sharing your nice work. In the paper's formulation (1) and (6), all has λ_k. But it seems to be no explaination about them. Could you please point it out here.

λ_k represent the weighting factor (a hyperparameter) to balance the loss of different blocks. These factors are set to 1 in this work and our previous work [36] and is learnable in DONNA [45].

Really thanks for your reply.