/Graph-Mining-Toolchain

A curated list of tools for wrangling, analyzing and reasoning on graph-structured multi-relational data.

Mining from Graph-Structured Data

With the advance of latent feature, observable and hybird models for mining from multi-relational datasets, the lack of an open toolchain or framework to graph-mining like TensorFlow/Pytorch to Deep Learning imposes unnecessary tool-coding tasks on researchers in the field, and due to the data structural inconsistency presented in graph-mining algorithms proposed by different research groups, the experiment reproducibility is often compromised and make it much harder to produce baseline models than our colleagues in Deep Learning.

This repository serves as a hub to curate tools/algorithms/datasets in Graph-Mining research, especially for reasoning on large-scale knowledge graph, in the hope to ease the difficulty of entering in this interesting research area.

Frameworks

Tools and Algorithms

Raw Benchmark Datasets

Data Modelling/Graph Visualization

Data Representation/Representation Learning

  • SNAP - Node2Vec:

    Node2Vec is an algorithmic framework for representational learning on graphs. Given any graph, it can learn continuous feature representations for the nodes, which can then be used for various downstream machine learning tasks.

Latent Feature Models

Observable Models

Hybird Models

Feature Selection

Research Projects/Groups

  • UCSE - LINQS

    Led by Prof. Lise Getoor, on Statsitical Relational Learning.

  • Stanford - SNAP

    Led by Prof. Jure Leskovec, on general purpose network analysis and graph mining.

  • CMU - NELL

    CMU Read the web project, utilizing link-prediction for completing the never-ending language learning knowledge base.

  • UT Dallas - StarLing

    Relational Functional Gradient Boosting based algorithms for statistical relational learning.

Researchers