Issues
- 2
Benchmark against nVidia's XMP library
#46 opened by unzvfu - 0
Cases not found in tests
#56 opened by duanbing - 0
- 0
Start wiki page for bug post-mortem analyses
#54 opened by unzvfu - 0
Automate memcheck/address sanitisation checks
#53 opened by unzvfu - 0
Implement systematic profiling
#52 opened by unzvfu - 0
- 1
- 0
Implement the NTT
#40 opened by unzvfu - 0
- 0
- 0
Use AND for multiplying a carry flag
#48 opened by unzvfu - 0
Remove final subtraction from Monty multiplication
#47 opened by unzvfu - 0
Implement Wallace tree multiplier
#45 opened by unzvfu - 0
Support 'signed-digit' representations
#44 opened by unzvfu - 0
Load modulus into shared memory
#43 opened by unzvfu - 0
- 1
Overhaul benchmarking system
#30 opened by unzvfu - 0
Implement a register cache
#41 opened by unzvfu - 1
Review consequences of Cuda 7 Independent Thread Scheduling on warp-synchronicity assumptions
#27 opened by unzvfu - 0
Incorporate test vectors from Wycheproof
#39 opened by unzvfu - 0
- 0
- 0
Implement faster Newton-Raphson
#36 opened by unzvfu - 0
- 0
Rewrite test suite to run via Python interface
#34 opened by unzvfu - 0
Re-jigger the API to ease writing HLL interfaces
#33 opened by unzvfu - 0
Implement Python interface
#32 opened by unzvfu - 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
- 0
Clean up warp_fixnum division code
#19 opened by unzvfu - 0
- 0
- 0
- 0
Store test cases in compressed format
#15 opened by unzvfu - 0
- 0
- 0
- 0
- 0
Implement support for even moduli
#9 opened by unzvfu - 0
Correctly handle errors from device code
#8 opened by unzvfu - 0
Specialise modexp for word-sized base
#7 opened by unzvfu