About the format of the data set?
yangxia605 opened this issue · 2 comments
yangxia605 commented
hi @acbull
my case is that i should use a sample of the training data sets to build a graph but not all of the training data sets.
that is to say: Use the internal elements of the sample to construct a heterogeneous graph.
I was wonder if i can use GPT-GNN to fine-tuning it and then Do a classification task in the downstream ?
thanks!!!
acbull commented
Sure, pre-training is very suitable for your case.
About data format, you can refer to
Line 20 in 2409049
To construct such a graph, you can refer to prepreocess_*.py as a reference.
yangxia605 commented
thank you!!!