text-classification

This is for multi-class short text classification.
Model is built with Word Embedding, LSTM ( or GRU), and Fully-connected layer by Pytorch.
A mini-batch is created by 0 padding and processed by using torch.nn.utils.rnn.PackedSequence.
Cross-entropy Loss + Adam optimizer.
Support pretrained word embedding (GloVe).

Reference

paper:

Learning to Classify Short and Sparse Text & Web with Hidden Topics from Large-scale Data Collections

code:

python preprocess.py

train data at ./data/aminer_train.tsv

label	sentence
<lable>	<sentence>

python main.py

GET Method:

http://166.111.5.228:5012/query/<query>

|- main.py 
|- classify.py 
|- [dir] glove (word library)
|- [dir] data (dataset)
|- [dir] gen (well-trained models)