An implementation of a parallel word indexer program in Rust.
$ cargo build --release
$ cargo run --release <path_to_config_file>
Or if there is an index.cfg
file in the root directory of the project, you can use:
$ cargo run --release
To run the program multiple times and get the min total time you can use the run.py
scritp:
./run.py <number_of_runs> <path_to_config_file>
Here is an example index.cfg
:
indir="data" # Директорія для індексації
out_by_a="res_a.txt" # Результати, посортовані за алфавітом
out_by_n="res_n.txt" # Результати, посортовані за кількістю
indexing_threads=4
merging_threads=2
Result Files("guttenberg_2020_03_06.iso" dir 3/):
res_n.txt: https://drive.google.com/file/d/19Pouub10YKdXOzlSXAam0l4D3MTKNP3i/view?usp=sharing
res_a.txt: https://drive.google.com/file/d/1xrSIDJi9Azt1Amf8W3XDL0mEUsZicBQH/view?usp=sharing