JaminFong/DenseNAS

Could you please tell me something about your strategy

HONGJUNRL opened this issue · 2 comments

In the traditional darts methods, weights and architecture are optimized step by step, but in your paper, weights and architecture are optimized by epoch, is it mean that in this epoch,the model optimizes weights and in the next epoch, the model optimizes the architecture. I am quite confused.

alternate the optimization process of weights and architecture by epoch

It means that we split one search epoch into two parts. We first optimize the operation weights on the training dataset and after that, we optimize the architecture on the validation dataset.