This project is my research group project, and it is also a study of TensorFlow, Deep Learning(CNN, RNN, LSTM, etc.).
The main objective of the project is to solve the multi-label text classification problem based on Convolutional Neural Networks. Thus, the format of the data label is like [0, 1, 0, ..., 1, 1] according to the characteristics of such problem.
- Python 3.x
- Tensorflow 1.0.0 +
- Numpy
- Gensim
Research data may attract copyright protection under China law. Thus, there is only code.
实验数据属于实验室与某公司的合作项目,涉及商业机密,在此不予提供,还望谅解。
- Make the data support Chinese and English.(Which use
gensim
seems easy) - Can use your own pre-trained word vectors.
- Add a new Highway Layer.
- Add parent label bind to limit the output of the prediction label.
- Can choose train the model directly or restore the model from checkpoint.
- Add model test code.
Use gensim
package to pre-train my data.
References:
References:
- Convolutional Neural Networks for Sentence Classification
- A Sensitivity Analysis of (and Practitioners' Guide to) Convolutional Neural Networks for Sentence Classification
References:
References:
References:
黄威,Randolph
SCU SE Bachelor; USTC CS Master
Email: chinawolfman@hotmail.com
My Blog: randolph.pro
LinkedIn: randolph's linkedin