Unexpected performance drop when not using provided test code.

Question

Unexpected performance drop when not using provided test code.

deepfakeLearnerHi opened this issue 2 years ago · 5 comments

Reproducing with a bit more data (20 frames per video), I got a fairly good checkpoint. However, when I evaluated the checkpoint without the provided test code, the performance dropped unexpected. I wonder if I did something wrong.
Here's what I did:

Load my test data
Perform augmentation: images normalized with mean an std both [0.5, 0.5, 0.5]; images resized to (299, 299)
Labels: Real=0, Fake=1
Put all data to dataloader
Feed all data from dataloader to the model and calculate the ACC and AUC.

Please let me know if I am missing any important step.
And it will be very helpful if a well-trained checkpoint could be provided.

Answer 1 · 2022-08-18T07:16:53.000Z

Hi, can your good checkpoint get the same results as shown in the paper?

Answer 2 · 2022-09-03T07:48:16.000Z

Hi, actually I cannot find any important missing steps based on the description. I suggest modifying the provided test code line by line to find the reason.

Answer 3 · 2022-09-07T14:20:08.000Z

Found the problem although I don't know why:
I did augmentations using the torchvision.transforms, which led to the bad results.
However, when I used the augmentation which the source code did, it worked just fine.

I think your source code is good though, something must be wrong on my side.
Thanks for your time!

Answer 4 · 2022-09-28T12:03:36.000Z

Hi, thanks for reporting the reason. I guess it may result from the different implementations of them.
Best wishes

Answer 5 · 2023-09-06T03:10:32.000Z

Reproducing with a bit more data (20 frames per video), I got a fairly good checkpoint. However, when I evaluated the checkpoint without the provided test code, the performance dropped unexpected. I wonder if I did something wrong. Here's what I did:

Load my test data

Perform augmentation: images normalized with mean an std both [0.5, 0.5, 0.5]; images resized to (299, 299)

Labels: Real=0, Fake=1

Put all data to dataloader

Feed all data from dataloader to the model and calculate the ACC and AUC.

Please let me know if I am missing any important step. And it will be very helpful if a well-trained checkpoint could be provided.

Hi, could u share the detection performance you get with more frames? Thanks!