chenergy1991

:-)

Pinned Repositories

BirthdayCard
Language:Java0 2 00
chenergy1991.github.io
My blog~~
Language:HTML0 2 00
Chinese-Text-Classification-Based-on-Naive-Bayes
The development of computer and communications technology has resulted in huge amount of data. The automatic text classification technique has become very significant. Naive Bayes algorithm is based on probabilistic model. It is an effective way to deal with automatic text classification. The main task of this paper is to discuss the theoretical basis of Naive Bayes text classifier and describe the process of using Java language to accomplish the classifier. We can divide the classifier into two parts: the feature extraction and the calculation according to the feature. In the feature extraction part, I use the Chinese word segmentation method and the stop words filtering. In the classification part, I calculate the prior probability, the likelihood function value and the maximum a posterior estimation. During the simple test, the author uses the Sogou laboratory’s text classification corpus as the training set and the test set. During the test, the accuracy is between 39% to 56 %. The results show that there is still room for improvement. The paper also includes the discussion of its improvement methods and wider application.
Language:Java6 2 02
DetectionScript
Language:Python0 2 00
docker-network-graph
Quickly visualize docker networks with graphviz.
Language:Python0 1 00
docs
documents worth spreading
0 2 00
P_QQ_Management
Language:Python0 2 00
SDPcontroller
Control Module for Software Defined Perimeter (SDP)
Language:JavaScript0 2 00
WebSecurityTestcases
Language:Java0 2 00
Youku-Android-APP-Sniffer
Language:Python2 2 00

chenergy1991's Repositories

chenergy1991/Chinese-Text-Classification-Based-on-Naive-Bayes
The development of computer and communications technology has resulted in huge amount of data. The automatic text classification technique has become very significant. Naive Bayes algorithm is based on probabilistic model. It is an effective way to deal with automatic text classification. The main task of this paper is to discuss the theoretical basis of Naive Bayes text classifier and describe the process of using Java language to accomplish the classifier. We can divide the classifier into two parts: the feature extraction and the calculation according to the feature. In the feature extraction part, I use the Chinese word segmentation method and the stop words filtering. In the classification part, I calculate the prior probability, the likelihood function value and the maximum a posterior estimation. During the simple test, the author uses the Sogou laboratory’s text classification corpus as the training set and the test set. During the test, the accuracy is between 39% to 56 %. The results show that there is still room for improvement. The paper also includes the discussion of its improvement methods and wider application.
Language:Java6 2 02
chenergy1991/Youku-Android-APP-Sniffer
Language:Python2 2 00
chenergy1991/BirthdayCard
Language:Java0 2 00
chenergy1991/chenergy1991.github.io
My blog~~
Language:HTML0 2 00
chenergy1991/DetectionScript
Language:Python0 2 00
chenergy1991/docker-network-graph
Quickly visualize docker networks with graphviz.
Language:Python0 1 00
chenergy1991/docs
documents worth spreading
0 2 00
chenergy1991/P_QQ_Management
Language:Python0 2 00
chenergy1991/SDPcontroller
Control Module for Software Defined Perimeter (SDP)
Language:JavaScript0 2 00
chenergy1991/WebSecurityTestcases
Language:Java0 2 00

chenergy1991

Pinned Repositories

BirthdayCard

chenergy1991.github.io

Chinese-Text-Classification-Based-on-Naive-Bayes

DetectionScript

docker-network-graph

docs

P_QQ_Management

SDPcontroller

WebSecurityTestcases

Youku-Android-APP-Sniffer

chenergy1991's Repositories

chenergy1991/Chinese-Text-Classification-Based-on-Naive-Bayes

chenergy1991/Youku-Android-APP-Sniffer

chenergy1991/BirthdayCard

chenergy1991/chenergy1991.github.io

chenergy1991/DetectionScript

chenergy1991/docker-network-graph

chenergy1991/docs

chenergy1991/P_QQ_Management

chenergy1991/SDPcontroller

chenergy1991/WebSecurityTestcases