interaction-aware factorization machines for recommender systems

This is our implementation for the paper (Interaction-aware Factorization Machines for Recommender Systems) based on Attentional Factorization Machines

Please cite our AAAI'19 paper if you use our codes. Thanks!

Environments

Tensorflow
numpy
sklearn

Dataset

We use the same input format as the LibFM toolkit (http://www.libfm.org/). In this instruction, we use MovieLens. The MovieLens data has been used for personalized tag recommendation, which contains 668,953 tag applications of users on movies. We convert each tag application (user ID, movie ID and tag) to a feature vector using one-hot encoding and obtain 90,445 binary features. The following examples are based on this dataset and it will be referred as ml-tag wherever in the files' name or inside the code. When the dataset is ready, the current directory should be like this:

code
- IFM.py
- FM.py
- LoadData.py
data
- ml-tag
  - ml-tag.train.libfm
  - ml-tag.validation.libfm
  - ml-tag.test.libfm

Quick Example with Optimal parameters

Use the following command to train the model with the optimal parameters:

# step into the code folder
cd code
# train FM model and save as pretrain file
python FM.py --dataset ml-tag --epoch 100 --pretrain -1 --batch_size 4096 --hidden_factor 256 --lr 0.01 --keep 0.3
# train IFM model using the pretrained weights from FM
python IFM.py --dataset ml-tag --epoch 100 --pretrain 1 --batch_size 4096 --hidden_factor [8,256] --keep [1.0,0.9] --temp 10 --lr 0.1 --lamda_attention1 0.01
# train INN model using the pretrained weights from FM
python INN.py --dataset ml-tag --epoch 100 --pretrain 1 --batch_size 4096 --hidden_factor [8,256] --keep [1.0,0.9] --temp 10 --lr 0.1 --lamda_attention1 0.01

The instruction of commands has been clearly stated in the codes (see the parse_args function).

The current implementation supports regression classification, which optimizes RMSE.

CTR prediction for a industry dataset

A reference implementation of DeepIFM is also released. The usage example:

model_params = {
    "wide_columns": wide_columns,
    "deep_columns": deep_columns,
    "learning_rate": 0.001,
    "batch_norm_decay": 0.999,
    "l2_reg": 0.001,
    "interaction_factor_hidden": 6,
    "attention_size": 16,
    'embedding_size': embedding_size,
    'lamda_factorization': 0.001,
    'lamda_attention': 0.001,
    'dim_to_field': dim_to_field,
    'num_samples': 0.1,
    'temperature': 10,
    'num_field': num_field,
    "deep_layers": '100,75,50',
    'dropout': 0.7,
    "optimizer": 'Adagrad',
    "batch_norm": False
}

sifm_network.SIFMNetwork(
            model_dir=model_dir,
            model_params=model_params,
            config=run_config)

dim_to_field is a pos to field map. For example, a log a b c d e with dim_to_field[2] = 1 means c belongs to field 1.

cstur4/interaction-aware-factorization-machines

interaction-aware factorization machines for recommender systems

Environments

Dataset

Quick Example with Optimal parameters

CTR prediction for a industry dataset