Welcome to the home repository of Graspan C++.
Graspan is a disk-based parallel graph system that uses an edge-pair centric computation model to compute dynamic transitive closures on large program graphs. Graspan has been implemented in two languages: Java and C++. This repository provides the C++ implementation of Graspan.
This Readme (under revision) provides a how-to-use guide for Graspan C++. To see how to use the Java version of Graspan, look here.
See here to understand how Graspan works and how it performs in comparison to other systems (under revision).
For a detailed description of our system, please see the preliminary version of our paper here, which has been accepted in ASPLOS 2017. In addition, a tutorial of Graspan is scheduled to be presented in ASPLOS 2017. If you are interested, please visit our ASPLOS 2017 tutorial page.
Graspan C++ is simple to use, with a very straight-forward compilation procedure.
Ensure that you have a recent version of the Boost library installed in your system. You can obtain the library from here.
First, download the entire graspan-cpp source code into your machine. Next, edit the src/makefile to set the paths to the Boost library include files and lib files in your machine. Do the same for the src/run file. Finally, run the makefile in the src folder using make. GraspanCPP should now be compiled and ready to run.
Graspan needs two input files: (1) a graph on which Graspan can perform computations and (2) a grammar file which describes how the computations (edge additions) are to be performed.
You may copy any graph and grammar file from our sample datasets here inside the graspan-cpp/src folder in your machine.
Note that Graspan supports graph files in edge-list format as shown below,
[EDGE SOURCE] [EDGE DESTINATION] [EDGE VALUE]
The grammar file consists of production rules in CNF form, where a line like the following,
A B C
represents a rule such that A is the left-side of the rule and BC is the right-side of the rule.
After getting the graph and grammar file into the graspan-cpp/src folder, run the run.sh script in your command line specifying,
- the graph file,
- the grammar file
- the number of partitions user would like to generate from the graph during preprocessing, prior to computation,
- the memory budget: the user’s preferred memory budget (in GB). It represents an upper bound of memory for Graspan to use for the entire computation
- the number of threads: the number of threads the user would like the program to use. The recommended value is twice the number of available cores.
as shown below,
./run <graph_file> <grammar_file> <number_partitions> <memory_budget> <num_threads>
After running the above command, you can monitor the progress of the computation by viewing the generated cpp.x.output file in the graspan-cpp/src/log folder. After computation ends, this file will show the number of edges generated and the total computation time. The .partition. output files will contain the partitioned graph with new edges.
Participate in our discussion group, GraspanMeet.
- Kai Wang - PhD Student, UCI
- Aftab Hussain - PhD Student, UCI
- Zhiqiang Zuo - Postdoc Scholar, UCI
- Harry Xu - Assistant Professor, UCI
- Ardalan Sani - Assistant Professor, UCI
- John Thorpe - Undergraduate Student, UCI
- Sung-Soo Son - Visiting Undergraduate Student, UCI
- Khanh Ngyuen - PhD Student, UCI