Prototype for running predictable, IO-bound benchmarks in an ocurrent pipeline. This is work in progress.
See general instructions in ocaml-bench-scripts for configuring the benchmarking hardware. In particular, you need an isolated CPU to run the benchmarks on.
Use the —docker-cpu
parameter to pin the benchmark to a single CPU. This will pass the —cpuset-cpus
parameter to Docker behind the scenes to run the container on a single core.
The main difference from the scripts hosted in ocaml-bench-scripts and this ocurrent pipeline is that the tasks will be executed inside docker containers. This requires a few more adjustments to how the containers are launched. Most of this is handled automatically by the pipeline by passing parameters to Docker. Some additional details are documented below.
Before you can start the pipeline, you need to setup the Postgres instance.
The docker-compose
file will both
- setup the postgres instance
- start the graphql engine
First you will need to build the docker images, and pass the build argument to the docker-compose build
command.
docker-compose build --build-arg FRONTEND_DIRECTORY="<location of frontend directory>"
You can clone the frontend from here.
You can then run,
$ docker-compose up -d
to start the postgres database, grahpql engine and the frontend.
The postgres instance will be available on port 5432
and if youwant to access it locally you can run:
$ psql -h localhost:5432 -U docker
Build the pipeline:
$ opam install --deps-only .
# build
$ dune build pipeline.exe
Run the pipeline:
./_build/default/pipeline.exe mirage/index --github-token-file <your_github_token> --docker-cpu 3 -v --oauth-user <user_github_user_name> --conn-info "host=/var/run/postgresql"
You can find more options for different configurations, posting message to slack, etc by running --help
to the pipeline executable.
If you want to enroll your repository or setup this benchmark repository for your repository, we make the following assumptions.
- There is a dune bench target which can run the benchmarks.
- The benchmarks results are json with of the following format:
{
"results" : [
{
"name": <name-of-the-benchmarks>,
"metrics": {
"<metric-1>": "",
"<metric2>": "",
...
},
...
}
]
}
Here's an example from index with regards to what the format looks like.
The metadata about repo
, branch
and commit
is added by the pipeline. Your repo only needs to output the results as a json.
Follow instructions from instructions here: https://hasura.io/docs/1.0/graphql/manual/deployment/docker/index.html
The results of IO bound benchmarks can vary greatly between different device/storage types and how they are configured. For this prototype we’re aiming for predictable results so we are using an in-memory tmpfs
partition in /dev/shm
for all storage.
The —docker-shm-size
parameter can be passed to the pipeline to adjust the size of the tmpfs
partition. The default is 4G.
tmpfs
partitions are similar to ramfs
partitions in that the content will be stored entirely in internal kernel cache, but they have a size limitation and may trigger swapping. It is therefore important to make sure that the system is configured in such a way that swapping doesn’t occur while the benchmark is running. For more details about tmpfs
/ramfs
see https://www.kernel.org/doc/Documentation/filesystems/tmpfs.txt.
If running on a system with NUMA enabled the tmpfs
file system should be allocated in a memory area that is local to the core running the benchmark. Otherwise the kernel could allocate this in different areas over time and affect the IO performance results. To avoid this issue, the tmpfs
volume can be created with a specific memory allocation policy.
The pipeline provides a —docker-numa-node
command line parameter that forces the tmpfs
volume in /dev/shm
to be allocated from a specific NUMA node. lscpu
shows which NUMA nodes are local to each core.
NOTE: Although it should be possible to get good results on a NUMA enabled system, we do not plan to use this in production and have limited experience with it. The main reason is that the system wide optimisations required would likely reduce performance for general tasks, while the benchmark itself only runs on a single core. This makes it more suitable to run on a dedicated, smaller server, which typically has less memory and doesn’t require NUMA.
ASLR affects performance as the memory layout is changed each time the benchmark is loaded. The ocurrent pipeline disables ASLR inside the container automatically by wrapping the benchmark command in a call to setarch [...] --addr-no-randomize
. This is normally blocked by the default Docker seccomp profile, so we have modified the profile to allow personality(2)
to be invoked with the ADDR_NO_RANDOMIZE
flag.