cuda-fixnum for snark challenge

Use this code to get started on the snark challenge. It implements the logic needed to do field arithmetic. In particular, for the fields used by mnt4-753 and mnt6-753, this takes the pairwise product of two arrays of field elements. That is, it maps over two arrays.

See main.cu for the implementation

Suggested steps to make a submission for the snark challenge

Here are some suggested steps to solve the tutorial stage and work towards a faster, GPU powered snark prover:

For each of these, the cuda kernel code will need to be changed (see main.cu:21-35)

And here is our best guess on how to effectively make a submission for the full prover (up to $70,000, and $7,000 immediately for the first submission to 2x the speed)

The SNARK prover is composed of several FFTs and multiexponentiations. In the C++ reference implementation, the FFTs are here and the multiexponentiations are here.

Once you've finished the tutorial, try improving the multi-exponentiations with on-GPU versions using the curve operations from the tutorial. Each multi-exponentiation can be seen as a map-reduce, as explained here. The reduce part may be complicated to implement for GPU, so it may be a good idea to start by implementing the "map" part on GPU and the "reduce" part on CPU.
Do the multi-exponentiations entirely on-GPU using an on-GPU reduce
Use an on-GPU FFT (see for example cuFFT), adapted to finite fields. You can find a C++ implementation of a finite-field FFT here.

To build and run

To build and run:

./build.sh
./main compute inputs outputs
shasum outputs should be b0f4a59a4be1c878dd9698fae7f1be86d8261025

you will need to edit /Makefile:GENCODES to match your GPU see here

wkarshat/cuda-fixnum

cuda-fixnum for snark challenge

Suggested steps to make a submission for the snark challenge

To build and run