Pre-train model about efficientnet-l2-noisystudent

Question

Pre-train model about efficientnet-l2-noisystudent

Closed this issue 4 years ago · 10 comments

First, thanks for this amazing work!

I have checked the repo you referred to in your paper about the pre-train model: efficientnet-l2-noisystudent. But in this repo, there is only the model but does not include the pre-train model. Therefore, do you use the efficientnet-l2-noisystudent from its official repo (that is, based on the Tensorflow)?

Answer 1 · 2020-12-15T17:57:30.000Z

Hi @davidzhangyuanhan,

Sorry I might be misunderstanding your question. We use the EfficientNet-L2-NoisyStudent from the pytorch-image-models repo which itself has been ported from the official tensorflow repo. Are you having any trouble running the model?

Answer 2 · 2020-12-16T02:53:08.000Z

Hi @davidzhangyuanhan,

Sorry I might be misunderstanding your question. We use the EfficientNet-L2-NoisyStudent from the pytorch-image-models repo which itself has been ported from the official tensorflow repo. Are you having any trouble running the model?

Hi @rtaori,
I mean the model of EfficientNet-L2-NoisyStudent from this repo: pytorch-image-models repo is based on the PyTorch, but the weight of the model of EfficientNet-L2-NoisyStudent from its official repo is based on TensorFlow.
So I guess you use the TensorFlow-based EfficientNet-L2-NoisyStudent model to conducts the experiments. Or you have the PyTorch-based EfficientNet-L2-NoisyStudent pre-train weights.

Answer 3 · 2020-12-16T02:57:29.000Z

Yes, we use the pytorch version - to be clear, the weights are exactly the same. The weights are ported from the official tensorflow repo into the pytorch model. We use pytorch so it integrates with the rest of our testbed, but if you run the tensorflow model you should get the exact same results (up to some small differences in data-preprocessing if you use tensorflow's dataloaders).

Answer 4 · 2020-12-16T03:06:30.000Z

Yes, we use the pytorch version - to be clear, the weights are exactly the same. The weights are ported from the official tensorflow repo into the pytorch model. We use pytorch so it integrates with the rest of our testbed, but if you run the tensorflow model you should get the exact same results (up to some small differences in data-preprocessing if you use tensorflow's dataloaders).

Thanks for your quick reply and again really appreciate this amazing work. EfficientNet-L2-NoisyStudent is the only released pre-train model that is trained on JFT-300M :), so it is very important.

Answer 5 · 2020-12-16T03:08:31.000Z

Ah yes, no worries! Let us know if you have any more questions. Closing this issue as it seems resolved.

Answer 6 · 2020-12-17T03:41:50.000Z

Hi Rohan:
Do you have some tools for converting TensorFlow weights to Pytorch? I try some tools from GitHub, but it seems like not work.

Answer 7 · 2020-12-17T03:43:21.000Z

Hi,
Sorry, I don't have any custom tools I built myself. So far I've just used ports others in the community have created :)
-Rohan

Answer 8 · 2020-12-17T03:48:03.000Z

Hi,
Sorry, I don't have any custom tools I built myself. So far I've just used ports others in the community have created :)
-Rohan

So, how can convert Noisy-student pre-train to PyTorch, could you please give some instructions? torch.load() can not load TensorFlow checkpoint.

Answer 9 · 2020-12-17T03:51:42.000Z

If you're looking to do the conversion yourself, you could ask https://github.com/rwightman for his script. If you're just looking to use the model in pytorch, you can either use the model from our testbed for from the https://github.com/rwightman/pytorch-image-models repository.

Answer 10 · 2020-12-17T08:03:04.000Z

If you're looking to do the conversion yourself, you could ask https://github.com/rwightman for his script. If you're just looking to use the model in pytorch, you can either use the model from our testbed for from the https://github.com/rwightman/pytorch-image-models repository.

Thanks, I find the PyTorch weight of noisy-student is already in this repo.