gao-lab/GLUE

About node feature in the guidance graph

Closed this issue · 4 comments

yuxu-1 commented

Hi,

I read your paper and did not find the text about initialized node features in the guidance graph (maybe I missed the relevant text..).

In my understanding, each node in that graph is a gene (atac-seq peak region, or methylation region), which is irrelevant to cells. I mean, these nodes can be defined without single-cell data. Is that right? I am not sure whether my understanding is right or not.

Thanks
Yu

Hi Yu! Thanks for your interest in GLUE. You are right about the identity of the nodes, but the graph encoder in GLUE does not use node features, the input of graph convolution is learnable node embeddings (vrepr).

You may also interpret it as one-hot node features times a linear transformation $I_{\mathrm{vnum} \times \mathrm{vnum}} \cdot V_{\mathrm{vnum} \times \mathrm{out \textunderscore features}}$, where the linear transformation is exactly the node embeddings.

Hope that clarifies. Please let me know if there were further questions!

yuxu-1 commented

Hi Jeff,

Thanks for your reply. GLUE is a great research work!
I still have a question: the learnable node embeddings (vrepr, $V_{vnum\times out_features}$) were initialized with the scalar value 0, which would make the message in the process of GCN to be 0. How do you transmit information in the graph?

BR
Yu

While the embeddings are initialized as zeros, they will be updated during training as guided by the graph and data decoders. They will definitely be non-zero after the model has been trained.

yuxu-1 commented

Hi Jeff,

I see. Thanks!
The learnable node embeddings (torch parameters) have gradients after backforward.

BR
Yu