Unexpected performance drop when not using provided test code.
deepfakeLearnerHi opened this issue · 5 comments
Reproducing with a bit more data (20 frames per video), I got a fairly good checkpoint. However, when I evaluated the checkpoint without the provided test code, the performance dropped unexpected. I wonder if I did something wrong.
Here's what I did:
- Load my test data
- Perform augmentation: images normalized with mean an std both [0.5, 0.5, 0.5]; images resized to (299, 299)
- Labels: Real=0, Fake=1
- Put all data to dataloader
- Feed all data from dataloader to the model and calculate the ACC and AUC.
Please let me know if I am missing any important step.
And it will be very helpful if a well-trained checkpoint could be provided.
Hi, can your good checkpoint get the same results as shown in the paper?
Hi, actually I cannot find any important missing steps based on the description. I suggest modifying the provided test code line by line to find the reason.
Found the problem although I don't know why:
I did augmentations using the torchvision.transforms, which led to the bad results.
However, when I used the augmentation which the source code did, it worked just fine.
I think your source code is good though, something must be wrong on my side.
Thanks for your time!
Hi, thanks for reporting the reason. I guess it may result from the different implementations of them.
Best wishes
Reproducing with a bit more data (20 frames per video), I got a fairly good checkpoint. However, when I evaluated the checkpoint without the provided test code, the performance dropped unexpected. I wonder if I did something wrong. Here's what I did:
- Load my test data
- Perform augmentation: images normalized with mean an std both [0.5, 0.5, 0.5]; images resized to (299, 299)
- Labels: Real=0, Fake=1
- Put all data to dataloader
- Feed all data from dataloader to the model and calculate the ACC and AUC.
Please let me know if I am missing any important step. And it will be very helpful if a well-trained checkpoint could be provided.
Hi, could u share the detection performance you get with more frames? Thanks!