X matrix

Question

X matrix

iyedbennour opened this issue 3 years ago · 1 comments

Hi ! can you describe the process to create the X matrix that is contained in the different npz files please ? How do you transform the cora papers into features.
Thank you !

Answer 1 · 2021-09-24T17:23:13.000Z

Sorry for the late reply. The X matrix contains the TF-IDF representation of the text in the paper abstracts. Specifically, I used the https://scikit-learn.org/stable/modules/generated/sklearn.feature_extraction.text.TfidfVectorizer.html on the raw text data.

Hope this helps.