Here is the result of simple disk I/O benchmark with Rust with io_uring, Tokio-uring. One iteration of the benchmark spawns and awaits 2,000 async tasks. Each task reads 10 bytes from /dev/urandom
and then writes them to /dev/null
. The benchmark performs 1,000 iterations. The original Tokio version is taken from reddit and I added the Tokio-uring version.
The existing application using Tokio can be easily modified to use io_uring with Tokio-uring. However, you could see a performance degradation. For example, Tokio-uring version is about 8 times slower on EC2 c6gn.8xlarge (32 vCPUs).
A major difference between Tokio and Tokio-uring is that Tokio runs async tasks on multiple CPU cores; Tokio-uring runs tasks on only one CPU core. With Tokio-uring, your application explicitly needs to use multiple CPU cores. The careful design is necessary to pull out the Tokio-uring potential.