PPRGo (PyTorch)

This repository provides a PyTorch implementation of PPRGo for a single machine. You can find the original TensorFlow 1 implementation in another repository. PPRGo is a fast GNN able to scale to massive graphs in both single-machine and distributed setups. It was proposed in our paper

Scaling Graph Neural Networks with Approximate PageRank
by Aleksandar Bojchevski*, Johannes Klicpera*, Bryan Perozzi, Amol Kapoor, Martin Blais, Benedek Rózemberczki, Michal Lukasik, Stephan Günnemann
Published at ACM SIGKDD 2020.

Demonstration

To see for yourself how fast PPRGo runs even on a large dataset we've set up a Google Colab notebook, which trains and generates predictions for the Reddit dataset, as described in the paper.

Installation

You can install the repository using pip install -e .. Since CUDA 10.0 includes a bug that affects PPRGo we strongly recommend using e.g. 10.1.

Run the code

This repository contains a demo notebook for running training and inference (demo.ipynb) and a script for running the model on a cluster with SEML (run_seml.py).

Contact

Please contact a.bojchevski@in.tum.de or klicpera@in.tum.de if you have any questions.

Cite

Please cite our paper if you use the model or this code in your own work: