lightaime/deep_gcns_torch

My training is very slow

linhaojia13 opened this issue · 6 comments

Hi, I trained the sem_seg_sparse model using 2 Tesla V100 GPUs, however, I found the training is very very slow and it takes several minutes to forward a batch! Just a batch! The training is under the default configuration. I had checked the GPU situation and found it is working. Do you have any idea about this?

Thank you for you advice, now I realize that deep_gcns_torch may be not a friendly repo to me and my GPUs ...hhhhh
By the way, is there a gap on time efficiency between the tensorflow version and pytorch version of DeepGCNs ?

Thank you very much.

Hi @lightaime

I've found there are 2 ways for sparse KNN neighborhood calculation when running seg_sem_sparse: knn_graph_matrix and knn_graph.

The knn_graph_matrix is based on pairwise distance matrix but it assumes the input have same number of nodes in each graph (batch mode). However, the knn_graph is coming from torch_cluster and can support different number of nodes in each graph (batch mode).

Please correct me if there is anything wrong or missed.