dpi data analysis, textual mining of queries and browse data
data processing: utf-8 and gbk decode of query key words, http get of browse content
algorithms:
1.bigram,tf-idf weighting, sparse features corresponding to Logistic Regression
2.paragraph2vec,dense features corresponding to xgboost or Neural Network
3.char-cnn of text data,NIPS2015