Build Status | |
---|---|
Travis |
This repository demonstrates how to minimize the size of a Rust binary.
By default, Rust optimizes for execution speed rather than binary size, since for the vast majority of applications this is ideal. But for situations where a developer wants to optimize for binary size instead, Rust provides mechanisms to accomplish this.
By default, cargo build
builds the Rust binary in debug mode. Debug mode disables many
optimizations, which helps debuggers (and IDEs that run them) provide a better debugging
experience. Debug binaries can be 30% or more larger than release binaries.
To minimize binary size, build in release mode:
$ cargo build --release
Note: Looking for a tool to help automated
strip
ing? Check outcargo-strip
or follow Cargo #3483.
See also:
sstrip
a small utility that removes a few bytes from an executable that strip leaves behind.sstrip
should be run afterstrip
.
By default on Linux and macOS, symbol information is included in the compiled .elf
file. This
information is not needed to properly execute the binary.
To remove this, run strip
on the .elf
file:
$ strip target/release/min-sized-rust
Available starting 1.45.0-nightly (2020-05-28)
,
Cargo has strip
functionality built in:
$ cargo +nightly build -Z strip=symbols
Cargo defaults its optimization level to 3
for release builds,
which optimizes the binary for speed. To instruct Cargo to optimize for minimal binary
size, use the z
optimization level in
Cargo.toml
:
[profile.release]
opt-level = 'z' # Optimize for size.
By default, Cargo instructs compilation units to be compiled and optimized in isolation. LTO instructs the linker to optimize at the link stage. This can, for example, remove dead code and often times reduces binary size.
Enable LTO in Cargo.toml
:
[profile.release]
lto = true
As of Rust 1.32,
jemalloc
is removed by default.
If using Rust 1.32 or newer, no action is needed to reduce binary size regarding this
feature.
Prior to Rust 1.32, to improve performance on some platforms Rust bundled jemalloc, an allocator that often outperforms the default system allocator. Bundling jemalloc added around 200KB to the resulting binary, however.
To remove jemalloc
on Rust 1.28 - Rust 1.31, add this code to the top of main.rs
:
use std::alloc::System;
#[global_allocator]
static A: System = System;
By default, Cargo specifies 16 parallel codegen units for release builds. This improves compile times, but prevents some optimizations.
Set this to 1
in Cargo.toml
to allow for maximum size reduction optimizations:
[profile.release]
codegen-units = 1
Note: Up to this point, the features discussed to reduce binary size did not have an impact on the behaviour of the program (only its execution speed). This feature does have an impact on behavior.
By default, when Rust code encounters a situation when it must call panic!()
,
it unwinds the stack and produces a helpful backtrace. The unwinding code, however, does require
extra binary size. rustc
can be instructed to abort immediately rather than unwind, which
removes the need for this extra unwinding code.
Enable this in Cargo.toml
:
[profile.release]
panic = 'abort'
Note: See also the nightly Cargo
-Z build-std
feature , which will likely evolve into a replacement for much of what Xargo currently does.
Note: Xargo is currently in maintenance status, but eventually the features used below should make their way into Cargo.
Example project is located in the
xargo
folder.
Rust ships pre-built copies of the standard library (libstd
) with its toolchains. This means
that developers don't need to build libstd
every time they build their applications. libstd
is statically linked into the binary instead.
While this is very convenient there are several drawbacks if a developer is trying to aggressively optimize for size.
-
The prebuilt
libstd
is optimized for speed, not size. -
It's not possible to remove portions of
libstd
that are not used in a particular application (e.g. LTO and panic behaviour).
This is where Xargo comes in. Xargo is able to compile
libstd
with your application from the source. It does this with the rust-src
component that
rustup
conveniently provides.
Add a Xargo.toml
file to the root of your project
(this doesn't replace Cargo.toml
, just is in addition):
[dependencies]
std = {default-features=false}
Install the appropriate toolchain and Xargo:
$ rustup toolchain install nightly
$ rustup override set nightly
$ rustup component add rust-src
$ cargo install xargo
Build using Xargo:
# Find your host's target triple.
$ rustc -vV
...
host: x86_64-apple-darwin
# Use that target triple when building with Xargo.
$ xargo build --target x86_64-apple-darwin --release
Remember to strip
the resulting executable. On macOS, the final binary size is reduced to 51KB.
Example project is located in the
panic_immediate_abort
folder.
Even if panic = abort
is specified in Cargo.toml
, rustc
will still include panic strings
and formatting code in final binary by default.
An unstable panic_immediate_abort
feature
has been merged into the nightly
rustc
compiler to address this.
To use this, repeat the instructions above to use Xargo, but instead use the following
Xargo.toml
:
[dependencies]
std = {default-features=false, features=["panic_immediate_abort"]}
Remember to strip
the resulting executable. On macOS, the final binary size is reduced to 30KB.
Example project is located in the
no_main
folder.
This section was contributed in part by @vi
Up until this point, we haven't restricted what utilities we used from libstd
. In this section
we will restrict our usage of libstd
in order to reduce binary size further.
If you want an executable smaller than 20 kilobytes, Rust's string formatting code,
core::fmt
must
be removed. panic_immediate_abort
only removes some usages of this code. There is a lot of other
code that uses formatting in some of cases. That includes Rust's "pre-main" code in libstd
.
By using a C entry point (by added the #![no_main]
attribute) , managing stdio manually, and
carefully analyzing which chunks of code you or your dependencies include, you can sometimes
make use of libstd
while avoiding bloated core::fmt
.
Expect the code to be hacky and unportable, with more unsafe{}
s than usual. It feels like
no_std
, but with libstd
.
Start with an empty executable, ensure
xargo bloat --release --target=...
contains no
core::fmt
or something about padding. Add (uncomment) a little bit. See that xargo bloat
now
reports drastically more. Review source code that you've just added. Probably some external crate or
a new libstd
function is used. Recurse into that with your review process
(it requires [replace]
Cargo dependencies and maybe digging in libstd
), find out why it
weighs more than it should. Choose alternative way or patch dependencies to avoid unnecessary
features. Uncomment a bit more of your code, debug exploded size with xargo bloat
and so on.
On macOS, the final stripped binary is reduced to 8KB.
Example project is located in the
no_std
folder.
Up until this point, our application was using the Rust standard library, libstd
. libstd
provides many convenient, well tested cross platform APIs and data types. But if a user wants
to reduce binary size to an equivalent C program size, it is possible to depend only on libc
.
It's important to understand that there are many drawbacks to this approach. For one, you'll
likely need to write a lot of unsafe
code and lose access to a majority of Rust crates
that depend on libstd
. Nevertheless, it is one (albeit extreme) option to reducing binary size.
A strip
ed binary built this way is around 8KB.
#![no_std]
#![no_main]
extern crate libc;
#[no_mangle]
pub extern "C" fn main(_argc: isize, _argv: *const *const u8) -> isize {
// Since we are passing a C string the final null character is mandatory.
const HELLO: &'static str = "Hello, world!\n\0";
unsafe {
libc::printf(HELLO.as_ptr() as *const _);
}
0
}
#[panic_handler]
fn my_panic(_info: &core::panic::PanicInfo) -> ! {
loop {}
}
Up until this point, all size-reducing techniques were Rust-specific. This section describes a language-agnostic binary packing tool that is an option to reduce binary size further.
UPX is a powerful tool for creating a self contained, compressed binary with no addition runtime requirements. It claims to typically reduce binary size by 50-70%, but the actual result depends on your executable.
$ upx --best --lzma target/release/min-sized-rust
It should be noted that there have been times that UPX-packed binaries have flagged heuristic-based anti-virus software because malware often uses UPX.
cargo-bloat
- Find out what takes most of the space in your executable.
- Why is a Rust executable large? - 2016
- Freestanding Rust Binary - 2018
- Tiny Rocket - 2018
- Formatting is Unreasonably Expensive for Embedded Rust - 2019
- Tiny Windows executable in Rust - 2019
- Reducing the size of the Rust GStreamer plugin - 2020
min-sized-rust-windows
- Windows-specific tricks to reduce binary size