my training result is not as good as paper

Question

my training result is not as good as paper

w19787 opened this issue 4 years ago · 8 comments

w19787 commented 4 years ago

thanks for excellent paper and work. i try to re-train the project after migrating the project to tensorflow 2.0. Only api is migrated, no network, loss, structure, data preprocessing ,etc changed.

The following is the screen shot of training tensorboard on liver dataset:

and the evalation result:

any idea what might be go wrong? thanks.

Answer 1 · 2020-06-12T16:15:04.000Z

What setting are you running? I guess you are running the 1-cascade VTN version that should correspond to these numbers. Specify -n to train more cascades.

Answer 2 · 2020-06-13T01:58:27.000Z

What setting are you running? I guess you are running the 1-cascade VTN version that should correspond to these numbers. Specify -n to train more cascades.

thanks for reply. it is my bad not to notice the n should be changed for better performance according paper.

Answer 3 · 2020-06-13T02:47:17.000Z

What setting are you running? I guess you are running the 1-cascade VTN version that should correspond to these numbers. Specify -n to train more cascades.

in order to process the 10-cascade VTN training, what kinds of GPU is required? it is OOM on 16G v100 when train on 5 or 10 cascades.

Answer 4 · 2020-06-13T16:32:03.000Z

It would be fine if using 4 GPUs.

Answer 5 · 2020-06-14T01:39:44.000Z

It would be fine if using 4 GPUs.

after migrating to tf2.0 (since my server is installed new version's cuda which cannot run on tf1.4), the multi-gpu cannot work properly. the compute_gradients seems only work on gpu0. Currently, no idea how to fix it.

Answer 6 · 2020-06-14T14:48:57.000Z

It would be fine if using 4 GPUs.

@zsyzzsoft i have upload https://github.com/w19787/Recursive-Cascaded-Networks-TF2.0, if you can advice how to fix the issue of multi-gpu support on this version will be appreciated!!!

Answer 7 · 2020-06-14T16:52:57.000Z

I haven't met this issue before and I'm not familiar with TF 2.0. Maybe the GPU specification doesn't work properly for TF 2, but I'm not sure.

Answer 8 · 2020-06-15T01:34:19.000Z

I haven't met this issue before and I'm not familiar with TF 2.0. Maybe the GPU specification doesn't work properly for TF 2, but I'm not sure.

got it. thanks!