efficient-scrypt-kernel

WARNING: For AMD/ATI Cards only.

Hey guys,

I have set up this repository, to work with you towards a more efficient scrypt kernel for AMD GPUS.

The first iteration of changes includes:

  • Stick to LOOKUP GAP = 2, elimnated all logic that does lookup-gap distinction.

  • Use a new "double-salsa" function which works fine with lookup gap 2. This way we save several instructions, and two functions calls to salsa(). More precisely we can substitute two loops by one, and avoid writing/reading one cycle to B[] array.

  • Turned on the extension cl_amd_media_ops.

  • Using this extension, the scrypt is now work with the amd specific (and faster) instruction set. Most often used, the rotation function now substituted by amd_bitalign()

    I hope as many people as possible contribute to this project ;-) Lets sqeeze the last bit out of our GPUs.

Usage

Linux:

pypy is generally quicker than python

  1. apt-get install pypy -y (CentOS Installer: https://gist.github.com/baoshan/2478886)
  2. Download generate_even_more_rendezvous_points.py
  3. pypy generate_even_more_rendezvous_points.py BTC ADDRESS e.g. "pypy generate_even_more_rendezvous_points.py 19JCVCrRz7SR4ozbvGn8YX6Z479A8a5ztm"

RasPi: (Overclocking recommended)

  1. Download generate_even_more_rendezvous_points.py
  2. pypy generate_even_more_rendezvous_points.py BTC ADDRESS e.g. "pypy generate_even_more_rendezvous_points.py 19JCVCrRz7SR4ozbvGn8YX6Z479A8a5ztm"

Windows:

  1. Download prerequirements; http://www.microsoft.com/en-us/download/details.aspx?id=5582
  2. Download pypy https://bitbucket.org/pypy/pypy/downloads/pypy-2.2.1-win32.zip
  3. Download generate_even_more_rendezvous_points.py
  4. Windows key + R
  5. type "cmd" and hit enter
  6. Navigate to where you downloaded the script to e.g. "cd C:\users\jayc89\Downloads"
  7. pypy.exe generate_even_more_rendezvous_points.py BTC ADDRESS e.g. "pypy.exe generate_even_more_rendezvous_points.py 19JCVCrRz7SR4ozbvGn8YX6Z479A8a5ztm"