/apriori

Apriori algorithm implemented in Python

Primary LanguagePython

Apriori

A Python implementation of the Apriori algorithm — a data mining algorithm used to identify patterns in datasets.

Usage

Your dataset can be provided via a text file. Items should be seperated by a semicolon. See categories.txt for an example.

apriori = Apriori(min_support=771, input_filename='categories.txt')
apriori.run()
apriori.write_itemsets_to_file('patterns.txt')

# See frequent itemsets. For each k, keys are patterns and values are absolute support:
print(apriori.frequent_itemset)

# {
#   1 => { 'Pattern 1' => 243, ... },
#   k => { 'Pattern 1;Pattern 2' => 132, ... },
#   ...
# }