Enhance Decoding Performance

Question

Enhance Decoding Performance

Opened this issue 2 years ago · 2 comments

I am having performance issues decoding large inputs (>10KB). The issue may be with bs58-rs, not this project.

function	decode 1M (seconds)	decode 256B, timeit 5k (seconds)
gmpy2.mpz(data, base=58)	0.0387	0.0472
recursive_base58_decode(data)	0.256	0.508
based58.b58decode(data)	21.6	0.251
base58.b58decode(data)	86.4	1.13

I don't know if there is a base58 project that uses rug or similar for better large integer performance.

Answer 1 · 2022-11-22T14:42:05.000Z

Thanks for raising. This library is a very thin wrapper around bs58-rs so I'm not sure if there's anything that can be done on our side, but I'll leave this open in case anyone has any ideas

Answer 2 · 2022-11-23T23:01:34.000Z

This nested loop is the upstream issue: https://github.com/Nullus157/bs58-rs/blob/pr%C4%ABmum/src/decode.rs#L307

for byte in &mut output[..index] {
    val += (*byte as usize) * 58;
    *byte = (val & 0xFF) as u8;
    val >>= 8;
}

Using rug or num_bigint would almost certainly be better. rug has an unsafe method, assign_bytes_radix_unchecked(), and num_bigint has from_radix_be(). I expect assign_bytes_radix_unchecked() would be faster for very large inputs, but I don't know what additional input cleaning of the input would be required.

let translated_input = vec![1u8, 1, 1];
let b58_biguint = BigUint::from_radix_be(&translated_input, 58).unwrap();
let decoded_vec = b58_biguint.to_bytes_be();
assert_eq!(decoded_vec, "\r_".as_bytes());

Cleaning the input once, using BigUint, and moving the data back into the output appears to significantly faster for the 256_bytes/decode_bs58 benchmark.