/PEFIM

implementation of EFIM algorithm in pyspark

Primary LanguagePythonMIT LicenseMIT

PEFIM

implementation of EFIM algorithm in pyspark

How to run

python3 main.py input.txt output.txt minUtil numPartitions

input.txt - huim dataset
output.txt - output file containing the generated itemsets
minUtil - user specified minimum utility value
numPartitions - number of partitions (parallel)