/coupling-from-the-past

Generates uniformly random alternating sign matrices using coupling from the past

Primary LanguageC++MIT LicenseMIT

Random alternating sign matrices (and other) via monotone coupling from the past

Idea

The ideas behind this repo are:

  • to have a basic fast modern implementation of the coupling from the past exact random sampling algorithm of Propp and Wilson;
  • to have (so far an ugly but) a pedagogical example of said algorithm, the reason being that the algorithm and its implementations can become rather subtle rather quickly (see the book by Häggström in the references), and that very little in terms of implementation is available in the public domain.

Introduction

This repo generates (for now) uniformly random alternating sign matrices using the Markov Chain Monte Carlo method of coupling from the past (the Propp--Wilson algorithm).

The main files so far in the src directory are and do as follows:

  • rasm.cpp and rasm.h generate a uniformly random alternating sign matrix (ASM), while parsing input from the command line;
  • rasm_lib.cpp and rasm_lib.h contain the main routines used in monotone coupling from the past; rasm_lib.cpp is also bound into Cython, see below;
  • rasm_sage.pyx is a Cython binding doing the random sampling; it can be used from within sage or Python;
  • rasm_basic.cpp is an all-in-one file containing a simplified (less optimized) implementation of the above; it is meant for pedagogical purposes and while very fast, it is less so than the above, for improved code readability;
  • rasm_basic.py is a thin wrapper of rasm_basic.cpp written in Python, for people who don't want to compile C++ code themselves or want to interface with an 'easier' language;

For more on coupling from the past and alternating sign matrices, see the references below.

Usage

Basic usage

For basic usage:

  • compile with (default compiler is g++, optimized with -O3):

    make

  • use as:

    ./rasm -help

    ./rasm 10 -asm

    ./rasm 100 -asm_file

    ./rasm 1000 -asm_file -initial 4194304 (optimized for sampling a size 1000 ASM, takes about 8-10 hours)

Learning curve usage

If you want to learn a bit about the algorithm, please read the file rasm_basic.cpp. It's all-in-one, everything is in there:

  • compile with g++ -O3 rasm_basic.cpp -o rasm_basic

  • use as above, i.e.

    ./rasm_basic 100 -asm_file -initial 16384

  • call from Python

    python3 rasm_basic.py 15

Usage with SageMath or Cython

If you have SageMath installed, or just Cython, you can try the following examples from within the REPL to generate a 10 x 10 random ASM once you've started the REPL from within the src directory (example below is for sage):

  • sage: %runfile rasm_sage.pyx

  • sage: a = rasm(10, initial=int(2 ** 7), verbose=True)

  • sage: pprint_asm(a)

  • sage: pprint_asm(a, symbols="+-")

  • for example:

    sage: %runfile rasm_sage.pyx
    Compiling ./rasm_sage.pyx...
    sage: a = rasm(10, initial=int(2 ** 7), verbose=True)
    Random ASM of order 10 x 10 generated after 128 steps.
    Elapsed time: 0.0002 seconds.
    sage: pprint_asm(a, symbols="+x")
                     +   
             +           
         +               
           + x +         
       + x   + x +       
     + x           +     
         +   x +     x + 
       +                 
             +   x   +   
                 +  
    sage: pprint_asm(a)
    0  0  0  0  0  0  0  0  1  0
    0  0  0  0  1  0  0  0  0  0
    0  0  1  0  0  0  0  0  0  0
    0  0  0  1 -1  1  0  0  0  0
    0  1 -1  0  1 -1  1  0  0  0
    1 -1  0  0  0  0  0  1  0  0
    0  0  1  0 -1  1  0  0 -1  1
    0  1  0  0  0  0  0  0  0  0
    0  0  0  0  1  0 -1  0  1  0
    0  0  0  0  0  0  1  0  0  0

Timing

Below are some basic running times (for the main algorithm only, not the wall time) on a 2015 Macbook Pro Retina 13" with 8GB of RAM. Note the optimizations using the -initial flag.

 % ./rasm_basic 100 -asm_file
Using random seed 1050614381.
Random ASM of order 100 x 100 generated after 16384 steps.
Elapsed time: 3.7235 seconds.

 % ./rasm_basic 100 -asm_file -initial 16384
Random ASM of order 100 x 100 generated after 16384 steps.
Elapsed time: 1.8513 seconds.

 % ./rasm_basic 100 -asm_file -initial 16384
Using random seed -13878880.
Random ASM of order 100 x 100 generated after 32768 steps.
Elapsed time: 5.3178 seconds.

 % ./rasm_basic 100 -asm_file -initial 32768
Using random seed 462488753.
Random ASM of order 100 x 100 generated after 32768 steps.
Elapsed time: 3.4711 seconds.

Finally, here is a comparison with the Python implementation in sage. The speed-up seems on the order of 200-300x. Again note the optimizations using the initial argument.

sage: %timeit AlternatingSignMatrices(30).random_element()
7.35 s ± 1.22 s per loop (mean ± std. dev. of 7 runs, 1 loop each)
sage: %runfile rasm_sage.pyx
Compiling ./rasm_sage.pyx...
sage: %timeit rasm(30, initial=int(2 ** 9))
28.1 ms ± 3.63 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
sage: %timeit rasm(30, initial=int(2 ** 10))
21.8 ms ± 1.6 ms per loop (mean ± std. dev. of 7 runs, 100 loops each)
sage: %timeit rasm(30, initial=int(2 ** 11))
22.6 ms ± 2.9 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)
sage: %timeit rasm(30, initial=int(2 ** 12))
39.9 ms ± 207 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)
sage: 7350 / 21.8
337.155963302752

Note

From a software engineering point of view, the file rasm_basic.cpp is rather ugly, containing everything in one file. While I do plan some improvements to it, it will always contain everything inside. The reason is very basic: coupling from the past is a rather tricky algorithm, and so having everything in one file is there for pedagogical reasons. If C++ had a decent notebook interface, I would use a notebook instead (and perhaps I will in the future).

References

Below are a few references, at various levels of technicality, for the description above.