Questions about L1-loss models.

Question

Questions about L1-loss models.

sunlustar opened this issue 3 years ago · 3 comments

Thanks for your great work. From your results in Table 2, it seems that the model using l1 loss (Ours-l1) could outperform the model using the proposed two-stage training strategy (Ours) over 3 dB, and it seems an one-stage training process from your training code.

So,

Why does the model “Ours-l1” perform better than the model “Ours”? It seems that you don't have the groundtruth of real-world HR_UW.
How does one-stage training process works?

Answer 1 · 2022-04-14T13:09:31.000Z

Hi, @sunlustar.

Note that the compared models in Table 2 are the models trained with only the pre-training stage (Sec 4.1).
As explained in the paper (the paragraph under Table 2), pixel-based losses are known for having an advantage over perceptual-based losses.
In Table 2, the model Ours-l1 outperforms the model Ours in terms of PSNR, as the former model is trained with pixel-based loss (l1 loss only), where the letter model is trained with the perceptual-based loss (the contextual loss terms in Eq. 10).
Please refer to Sec. 4 of the main paper, especially Sec. 4.1 for the pre-training stage.

Answer 2 · 2022-04-14T13:55:55.000Z

Thanks, and I have another two questions,

According to the paper, in ours-l1 model, λ_rec =0.01, λ_pre= 0.05, λ_l1 = 1, which means the l1 loss is still more predominate. But the performance gap between ours and ours-l1 is over 3dB which is drastic.
Whether the model of ours-l1 in Table 3 trained by the proposed two-stage strategy or trained with only the pre-training stage?

Answer 3 · 2022-04-14T14:47:45.000Z

According to the paper, in ours-l1 model, λ_rec =0.01, λ_pre= 0.05, λ_l1 = 1, which means the l1 loss is still more predominate. But the performance gap between ours and ours-l1 is over 3dB which is drastic.

The model Ours-l1 is trained only with the l1 loss. I guess you are talking about model Ours with λ_rec =0.01, λ_pre= 0.05, λ_l1 = 1. As I mentioned before, the perceptual-based loss drops PSNR, which is a very well-known fact in previous studies.

Whether the model of ours-l1 in Table 3 trained by the proposed two-stage strategy or trained with only the pre-training stage?

The models in Table 3 are trained with only the pre-training stage.