/dpiDataAnalysis

dpi data analysis, textual mining of queries and browse data

Primary LanguagePython

dpiDataAnalysis

dpi data analysis, textual mining of queries and browse data

data processing: utf-8 and gbk decode of query key words, http get of browse content

algorithms:

1.bigram,tf-idf weighting, sparse features corresponding to Logistic Regression

2.paragraph2vec,dense features corresponding to xgboost or Neural Network

3.char-cnn of text data,NIPS2015