The memchr crate provides heavily optimized routines for searching bytes.
Dual-licensed under MIT or the UNLICENSE.
The memchr function is traditionally provided by libc, however, the
performance of memchr can vary significantly depending on the specific
implementation of libc that is used. They can range from manually tuned
Assembly implementations (like that found in GNU's libc) all the way to
non-vectorized C implementations (like that found in MUSL).
To smooth out the differences between implementations of libc, at least
on x86_64 for Rust 1.27+, this crate provides its own implementation of
memchr that should perform competitively with the one found in GNU's libc.
The implementation is in pure Rust and has no dependency on a C compiler or an
Assembler.
Additionally, GNU libc also provides an extension, memrchr. This crate
provides its own implementation of memrchr as well, on top of memchr2,
memchr3, memrchr2 and memrchr3. The difference between memchr and
memchr2 is that that memchr2 permits finding all occurrences of two bytes
instead of one. Similarly for memchr3.
memchr links to the standard library by default, but you can disable the
use_std feature if you want to use it in a #![no_std] crate:
[dependencies]
memchr = { version = "2", default-features = false }On x86 platforms, when the use_std feature is disabled, the SSE2
implementation of memchr will be used in compilers that support it. When
use_std is enabled, the AVX implementation of memchr will be used if the CPU
is determined to support it at runtime.
memchr is a routine that is part of libc, although this crate does not use
libc by default. Instead, it uses its own routines, which are either vectorized
or generic fallback routines. In general, these should be competitive with
what's in libc, although this has not been tested for all architectures. If
using memchr from libc is desirable and a vectorized routine is not otherwise
available in this crate, then enabling the libc feature will use libc's
version of memchr.
The rest of the functions in this crate, e.g., memchr2 or memrchr3, are not
a standard part of libc, so they will always use the implementations in this
crate. One exception to this is memrchr, which is an extension commonly found
on Linux. On Linux, memrchr is used in precisely the same scenario as
memchr, as described above.
