Dynamic LM KB

Coarse prototype of LM with limited KB access. Assume data/brown.txt. Features are: (top 4 logits, top 4 softmax probs), prediction is (masked word in top 8 predictions). Currently context of 8+8 is given to the BERT model (bert-base-cased). The top 8 accuracy is 53.71%.

./src/create_features.py # first 100000 characters using BERT
./src/classifier.py

Classifier results (threshold p=0.5):

BCE: 0.6183, acc: 66.79%
TP: 41.01%, FP: 20.52%, TN: 25.78%, FN: 12.70%

zouharvi/dynamic-lm-kb

Dynamic LM KB