/ShuffleProofs.jl

Verificatum compatible verifier and prover for NIZK proofs of shuffle

Primary LanguageJuliaMIT LicenseMIT

ShuffleProofs.jl

codecov

Cryptography is often looked at as a tool to secure and make communications confidential. It only requires to have a digital signature algorithm (DSA), Diffie Hellman key exchange (computation) and a good block cypher to satisfy the security requirements of 99% (metaphorically) of current online marketplaces and internet banking. However, security with those tools comes with a price of privacy and the necessity to trust the other end to keep your transactions private.

A much more difficult case is present for systems that do require a high order of privacy and transparency, such as voting and auctions where an authority can not be blindly trusted, either with security or privacy. Complex multiparty protocols can be structured so that security and privacy do not lie in the hands of few. However, eliminating the effects of dishonest participants does require synchronous interactions between participants and thus is not scalable.

Cryptography does offer an alternative known as zero-knowledge proofs (ZKP), which can often eliminate the need for synchronous interactions between participants to obtain an honest result. An excellent example is the zero-knowledge proof of shuffle, which we shall illustrate with a simple voting system.

(Actually, arguments, but a common reference string can be constructed in a verifiably random way)

It's best to illustrate it with the ElGamal cryptosystem. Let's say that officials have set up keypair sk, pk = g^sk and let everyone know g, pk.

  1. A voter selects an option corresponding to a message m, encrypts it with a freely secretly chosen randomisation factor r and obtains a tuple c <- (g^r, m*pk^r), which they submit to a bulletin board.
  2. The votes are re-encrypted and shuffled
    • each received ciphertext is re-encrypted as c′ <- (c[1]*g^p, c[2] * pk^p)
    • all re-encrypted ciphertexts are sorted, constituting a shuffle, and published to a bulletin board
  3. The authority who knows the secret key sk (pk = g^sk) decrypts each ciphertext c′ as m <- c′[2] * c′[1]^(-sk) and publishes that to a bulletin board.

This scheme, however, does pose serious drawbacks to election integrity. If the election authority is malicious, it can deviate from the re-encryption procedure in step 2. omit and substitute its own desired ciphertexts. Similarly, in the third step, nothing prevents the adversary from publishing their own desired decryption at step 3.

Fortunately, zero-knowledge proofs are here to make such a system verifiable. Even if the adversary controlled, all election authorities would not be able to add, remove or modify votes.

An example situation

Let's consider the voting system above. To simplify the illustration, we shall use let ... end block as a representation of the privacy barrier between different computers.; bbord for messages published to a public bulletin board apart from protocol setup parameters g, pk; options shall represent a valid set of election candidates which voters can choose from, and rand functions their free will. Then the voting system can be represented as a code as follows:

# This code yet to be tested
using ShuffleProofs

sk = 123 # Only authorithy who do decryption would know

g = PrimeGenerator(23)
options = [g, g^2, g^3] 
pk = g^sk

bbord = (;) # A representation of a buletin board

######## Step 1. ########

bbord = let
    enc = Enc(pk, g)
    ciphertexts_in = [enc(options[rand(1:3)], rand(1:10)) for i in 1:3]
    (; bbord..., ciphertexts_in)
end

######## Step 2. ########

bbord = let
    enc = Enc(pk, g)
    proposition, secret = shuffle(bbord.ciphertexts_in, enc) # reencrypts and sorts output
    ciphertexts_out = proposition.𝐞′
    (; bbord..., ciphertexts_out)
end

####### Step 3. ########

bbord = let
    dec = Dec(sk)
    votes = dec(bbord.ciphertexts_out)
    (; bbord..., votes)
end

where ellection outcome is afterwards obtainned by tallying like count(x-> x == options[1], bbord.votes). Alternatively, homomorphic tallying could also be used, making it easier to airgap the system with a secret key.

(A confused reader may assert that authority knowing the secret key, knows how every participant had voted. However, this is not so bad in practice. The last step of decryption can be done between multiple parties so no single party knows the full secret key. Alternatively, the decryption can happen in a gapless computer put in a Faraday cage only for ciphertexts which come out of reencryption step 2. A curious reader may delve into details Helios voting system, after which the example is modelled)

Although all information is accessible to a bulletin board, we can not convince the voters that their votes have been properly counted and that adversary as corrupt authority or malware have not deviated from honest shuffling or decryption (steps 2 or 3) where it can add, modify or remove the votes.

Proving a correct shuffle

When we did a shuffle at step 2, the shuffle function returned two arguments: proposition and secret. The proposition contains all information about the statement whose truth should be asserted by a piece of evidence. The secret contains re-encryption factors and permutations. This can be used as evidence to validate the truth of the statement:

verify(proposition, secret)

However, if secret is used as evidence, it can't be made public; otherwise, it would violate the anonymity of the voters. Instead, we really want to prove that we know a secret such that verify(proposition, secret) == true, which in the literature is known as zero-knowledge proofs and, in particular, proofs of shuffle.

Currently, WikstromTerelius proof of shuffle is the most widely used protocol implemented in Verificatum, CHVote and has been used to add verifiability for remote electronic voting systems in Estonia, Norway, Switzerland and others1. Verification, in particular, has a diligently written specification encouraging anyone willing to implement an independent verifier2 so that proofs would not need to be trusted at face value. On the other hand, Haines3 provides the most concise pseudocode for anyone willing to implement the prover but does deviate with verifier implementation from the Verifiactum protocol. The ShuffleProofs.jl implements a Verificatum verifier and compatible prover (so that the original Java code for Verificatum, in principle, could validate the integrity of a proof generated with ShuffleProofs.jl).

To prove a statement for a corresponding proposition, it is necessary to instantiate a verifier which challenges the prover. A non-interactive zero-knowledge verifier compatible with Verifiactum can be initiated as:

verifier = ProtocolSpec(; g) 

It contains details on how independent basis vectors are chosen in a verifiable way, how challenges are computed from provers outputs like hash function to be used for random oracles and pseudo-random number generators, and how many bits are needed for pseudorandom numbers.

With such a verifier, a proof can then be constructed easily as:

proof = prove(proposition, secret, verifier)

Now finally, the proposition that no votes have been added, modified (spoiled) or removed at step 2. can be verified as:

verify(proposition, proof, verifier)

where proof and verifier can be published on the bulletin board along with the proposition so that everyone can verify the integrity of elections.

Lastly, the SuffleProofs.jl implements a shuffle(e::ElGamal, g, pk, verifier::Verifier)::Simulator, which does the proof generation at the time of shuffling for convenience. The returned type then is Simulator which contains three fields simulator.proposition (discussed earlier), simulator.verifier and lastly, the proof as simulator.proof. The simulator contains all necessary information and can be published on the bulletin board (The exact way duplication of data is avoided up to the user of the library to decide).

Verifying Verificatum generated proof of shuffle

Verification of Verifiactum generated proof is quite simple. First, a simulator needs to be generated, which can be done as follows:

simulator = load_verificatum_simulator(DEMO_DIR)

which parses XML, protocolSpec.xml and other relevant files encoded in binary tree notation as specified in the Verificatum verifier document. When loaded, the simulator can be verified:

verify(simulator)

CUSTOMISING verifier

Initially, I implemented described prover and verifier, directly resembling the pseudocode Haines3 paper and then later implemented the Verificatum verifier2. Unfortunately, implementing the Verificatum prover from the research papers was not within my capabilities, so I focused on merging both implementations. It became clear quickly that specifying challenges as multiple-function arguments would be highly cumbersome. To alleviate that, I made a verifier as an entry point of the finite state machine (FSM).

Every verifier state does have two methods step(verifier::Verifier, msg)::Verifier where msg is new commitments from prover at a particular stage of execution. The other method is challenge(verifier::Verifier), which returns a challenge to the prover upon which the protocol's security lies.

For example, let's consider a verifier, HonestVerifier, which has chosen all challenges in advance. Although it is insecure, it will help to illustrate the point. We shall use the PoSChallenge type, which runs the last verification step, after which challenges from proof have been computed. The resulting implementation can be represented as:

import ShuffleProofs: step, challenge, PoSChallenge

@enum VState Config Init PermCommit PoSCommit

struct HonestVerifier{T} <: Verifier
    challenge::PoSChallenge
end

HonestVerifier(challenge::PoSChallenge) = HonestVerifier{Config}(challenge)
HonestVerifier{T}(verifier::HonestVerifier) where T = HonestVerifier{T}(verifier.challenge)

PoSChallenge(verifier::HonestVerifier{PoSCommit}) = verifier.challenge

step(verifier::HonestVerifier{Config}, proposition::Shuffle) = HonestVerifier{Init}(verifier)
step(verifier::HonestVerifier{Init}, 𝐜) = HonestVerifier{PermCommit}(verifier)
step(verifier::HonestVerifier{PermCommit}, 𝐜̂, t) = HonestVerifier{PoSCommit}(verifier)

challenge(verifier::HonestVerifier{Init}) = (verifier.challenge.𝐡, verifier.challenge.𝐡[1])
challenge(verifier::HonestVerifier{PermCommit}) = verifier.challenge.𝐮
challenge(verifier::HonestVerifier{PoSCommit}) = verifier.challenge.c

HonestVerifier here is a parametrised struct with enums, but in practice, each state does have its own relevant variables, and it makes sense to represent them with their own types. For example, see Verificatum verifier implementation in the src/vverifier.jl.

The generality of the verifier can easily be extended to satisfy all kinds of variations available of WikstromTerelius proof of shuffle. For instance, an interactive protocol can be implemented in each challenge method to read from IO. Or verifier for CHVote, which computes the generator basis differently, can be supported by a subtyping verifier.

Limitations

  • Only prime groups are currently supported (NIST implementations of elliptic groups are coming)
  • Currently, only proof of shuffle can be verified for the Verificatum protocol. Implementing verification of correct threshold decryption has a low priority.
  • The current Julia implementation for the proof of shuffle is likely around 10...100 slower than the Verificatum Java library, as little care has been taken to make code type stable and minimise the number of allocations.

Braiding

An alternative use of proof of shuffle is to form knot-like structures or braids where inputs are related to outputs only through privately known exponents. As an example, consider private exponents y = [2, 3, 4] and some generator g. A list of public keys is Y = [g^2, g^3, g^4].

We can consider a mix which raises every public key in Y by a secret factor x and shuffles the list producing Y' = [Y_3^x, Y_1^x, Y_2^x] and publishes that together with a new relative generator h = g^x.

In such a situation, owners of the private exponents can compute public keys with a relative generator h as [h^2, h^3, h^4], which happens to be also available in Y'. This is useful as the owners of private exponents can issue anonymous signatures on messages with respect to relative generator h while asserting their membership to the group. This is useful in the context of voting system design or the protection of whistleblowers.

The mix, however, in the above systems has the freedom to deviate from intended behaviour and replace undesirable public keys with his own. To avoid that, a zero-knowledge proof of shuffle can be used together with zero-knowledge proof of decryption to form a proof of correct braiding.

To do such braiding, which produces proof of correctness ShuffleProofs.jl provides a braid(generator::G, members::Vector{G}, verifier::Verifier)::Simulator. A self-explanatory example is provided below:

import CryptoGroups: curve, ECGroup, generator, specialize
import ShuffleProofs: ProtocolSpec, braid, verify, output_generator, output_members

_curve = curve("P-256")
G = specialize(ECGroup, _curve, name = :P_256)
g = G(generator(_curve))

y = [4, 2, 3]
Y = g .^ y

verifier = ProtocolSpec(;g)
simulator = braid(g, Y, verifier)

@assert verify(simulator)

h = output_generator(simulator.proposition)
Y′ = output_members(simulator.proposition)

@assert sort(h .^ y) == sort(Y′)  

Current progress

  • Binary tree parser
  • Random oracles according to specification2 (according to verifier document)
  • Independent generators (as generated from output vmnv -t bas.h)4
  • Verifying ciphertexts shuffling by decrypting with a secret key. (Tests correctness of ElGamal and correct input of the key)
  • Verifying NIZK proof
    • Generation of a proof to be parsed
    • Parsing of proof ouputs into relevant variables.
    • NIZK verifier. (Partially done using other reference)53
  • Feed in the verifier in WikstromTerelius.jl to obtain a proof
  • Cleanup
    • Implement checking proof from multiple parties
    • Adding code for F in the Verificatum verifier
    • Upstream to CryptoGroups.jl
    • Remove g and pk from ProtocolSpec
    • Consistent notation between g, G, 𝓰, 𝓖
    • Adding verbose::Bool option to verify and evalueate the return value
    • Abstract cryptographic operations in Haines proof and find a way to remove _a, _b
    • Strong random numbers in the proofs (pass as function argument)
    • Make releavnt types concrete
    • t₃ sensitive to randomization factors (to investigate).
  • Consider moving out Verificatum tests into a seperate repository and automate their generation.
  • Elliptic groups
    • Implementation of fields Fp and F2 (done internally)
    • Elliptic curve point multiplication by an integer (done internally; tested on P-192 curve)
    • Upstream and expose curve implementations in CryptoGroups.jl
    • Elliptic curve basis generation (done in CryptoGroups.jl)
    • Test that prover and verifier works also with elliptic groups
    • Parser for proposition (group, public_key, cyphertexts)
    • Parser for proofs
  • Benchmarks
  • Storing the simulator in convinient directory structure
  • Storing the simulator in Verificatum understandable way
  • Decryption proofs
  • Documentation
    • Put a first draft of README.md
    • Add some docstrings in the code
    • Set up docummenter
    • Add examples and make them run

Footnotes

  1. verificatum.org

  2. Wikstrom, “How To Implement A Stand-Alone Verifier for the Verificatum Mix-Net.” 2 3

  3. Haenni et al., “Pseudo-Code Algorithms for Verifiable Re-Encryption Mix-Nets.” 2 3

  4. Wikström, “User Manual for the Verificatum Mix-Net.”

  5. Wikström, “A Commitment-Consistent Proof of a Shuffle.”