Strata is a research prototype file system, presented in SOSP 2017 (Strata).
Strata is developed and tested on Ubuntu 16.04 LTS, Linux kernel 4.8.12 and gcc version 5.4.0.
This repository contains initial source code and tests. Benchmarks will be released soon. As a research prototype, Strata has several limitations, described in the limitations section.
To run NVM emulation, your machine should have enough DRAM for testing. Kernel will reserve the DRAM for NVM emulation. Strata requires at least two partitions of NVM: operation log (1 - 2 GB) and NVM shared area (It depends on your test. I recommend to use more than 8 GB at least).
Assume current directory is a project root directory.
Make sure to initialize the repository and sub-repositories first:
git clone https://github.com/ut-osa/strata.git
git submodule init
git submodule update
./utils/change_dev_size.py [dax0.0] [SSD] [HDD] [dax1.0]
This script does the following:
- Opens
libfs/src/storage/storage.h
- Modifies
dev_size
array values with each storage size (the same as in your grub conf, see the running Strata section) in bytes.dev_size[0]
: could be always 0 (not used)dev_size[1]
: dax0.0 sizedev_size[2]
: SSD size : just put 0 for nowdev_size[3]
: HDD size : put 0 for nowdev_size[4]
: dax1.0 size
sudo apt install libssl-dev || sudo yum install openssl-devel
cd kernel/kbuild
make -f Makefile.setup .config
make -f Makefile.setup
make -j$(nproc)
sudo make modules_install ; sudo make install
sudo update-grub2 || sudo grub2-mkconfig -o /boot/grub2/grub.cfg
This step requires reboot your machine after installing the new kernel.
Building glibc might not be an easy task in some machines. We provide pre-built libc binaries in "shim/glibc-build". If you keep failing to build glibc, I recommend to use the pre-built glibc for your testing.
make -C shim
cd libfs/lib
git clone https://github.com/pmem/nvml
make -C nvml
tar xvjf jemalloc-4.5.0.tar.bz2
cd jemalloc-4.5.0
./autogen.sh
./configure
make
cd ..
make
For SPDK build errors, please check a SPDK website (http://www.spdk.io/doc/getting_started.html)
For NVML build errors, please check a NVML repository (https://github.com/pmem/nvml/)
make -C libfs
make -C kernfs
make -C kernfs/tests
make -C shim/libshim
Strata emulates NVM using a physically contiguous memory region, and relies on the kernel NVDIMM support.
You need to make sure that your kernel is built with NVDIMM support enabled (CONFIG_BLK_DEV_PMEM), and then you can reserve the memory space by booting the kernel with memmap command line option.
For instance, adding memmap=16G!8G to the kernel boot parameters will reserve
16GB memory starting from 8GB address, and the kernel will create a pmem0 block
device under the /dev directory. Adding GRUB_CMDLINE_LINUX="memmap=16G!4G, 4G!20G"
will add a pmem0 and pmem1.
Details are available at: http://pmem.io/2016/02/22/pm-emulation.html
This step requires rebooting your machine.
cd utils
sudo ./use_dax.sh bind
This instruction will change pmem emulation to use dev-dax mode.
e.g., /dev/pmem0
-> /dev/dax0
To rollback to previous setting,
sudo ./use_dax.sh unbind
This step requires rebuilding of Libfs and KernFS.
TODO: Some instructions to setup storage size (by a script or manually)
cd utils
sudo ./uio_setup.sh linux config
To rollback to previous setting,
sudo ./uio_setup.sh linux reset
cd libfs
sudo ./bin/mkfs.mlfs <dev id>
dev id is a device identifier used in Strata (hardcoded).
1 : NVM shared area (dax0.0)
2 : SSD shared area
3 : HDD shared area
4 : Operation log of processes (dax1.0)
If you encounter an error message, "mmap invalid argument", it means kernel does not allow mmap for NVM emulation. Usually, incorrect (or unaligned) setting of storage sizes (at step 3) causes the problem. Please make sure that your storage size is correct in "libfs/src/storage/storage.h"
cd kernfs/tests
make
sudo ./run.sh kernfs
cd libfs/tests
make
sudo ./run.sh iotest sw 2G 4K 1 #sequential write, 2GB file with 4K IO and 1 thread
In libfs/Makefile
, search MLFS_FLAGS
as keyword
MLFS_FLAGS = -DLIBFS -DMLFS_INFO
#MLFS_FLAGS += -DCONCURRENT
MLFS_FLAGS += -DINVALIDATION
#MLFS_FLAGS += -DKLIB_HASH
MLFS_FLAGS += -DUSE_SSD
#MLFS_FLAGS += -DUSE_HDD
#MLFS_FLAGS += -DMLFS_LOG
DCONCURRENT
- allow parallelism in libfs
DKLIB_HASH
- use klib hashing for log hash table
DUSE_SSD
, DUSE_HDD
- make LibFS to use SSD and HDD
#MLFS_FLAGS = -DKERNFS
MLFS_FLAGS += -DBALLOC
#MLFS_FLAGS += -DDIGEST_OPT
#MLFS_FLAGS += -DIOMERGE
#MLFS_FLAGS += -DCONCURRENT
#MLFS_FLAGS += -DFCONCURRENT
#MLFS_FLAGS += -DUSE_SSD
#MLFS_FLAGS += -DUSE_HDD
#MLFS_FLAGS += -DMIGRATION
#MLFS_FLAGS += -DEXPERIMENTAL
DBALLOC
- use new block allocator (use it always)
DIGEST_OPT
- use log coalescing
DIOMERGE
- use io merging
DCONCURRENT
- allow concurrent digest
DMIGRATION
- allow data migration. It requires turning on DUSE_SSD
For debugging, DIGEST_OPT, DIOMERGE, DCONCURRENT is disabled for now
Here are some common issues and how we were able to resolve them.
sudo ./run.sh kernfs
orsudo ./run.sh iotest ...
segfaults.
- Make sure to run
sudo ./bin/mkfs.mlfs
on all devices used for testing.sudo ./bin/mkfs.mlfs 1
fordax0.0
(required)sudo ./bin/mkfs.mlfs 4
fordax1.0
(required)sudo ./bin/mkfs.mlfs 2
for SSD area (only ifDUSE_SSD
is defined)sudo ./bin/mkfs.mlfs 3
for HDD area (only ifDUSE_HDD
is defined)
- KernelFS is currently implemented in user-level.
- Leases are not fully implemented.
- A directory could contain up to 1000 files.
- mmap is not supported yet.
- Benchmarks are not fully tested in all configurations. Working configurations are described in our paper.
- Can only rename inside the same directory.
- There are known bugs in fork.
For documentation on current work or for more detailed documentation about a particular feature, please check the docs directory.
Available topics: