/webHostSpotCap

通过分词 - 聚类 - 提取的思路,挖掘医学论坛中的热点信息

Primary LanguagePythonApache License 2.0Apache-2.0

webHostSpotCap

通过分词 - 聚类 - 提取的思路,挖掘医学论坛中的热点信息

请运行 Capture.py 而不是testMain.py 测试这个程序... 程序还在开发,现在的结果还有点醉人... 不信你自己跑跑看

For International users:

This project (maybe just a demo) based on the idea of word split-> cluster -> tag extrac, quite simple... But it only works with CHINESE!

If you wanna test this stupid project (or, demo) for fun, please run python Capture.py in your PC or Mac or any other stuff which can run Python.

Result file is so stupid now.

Relied libs: jieba, pandas, scikit-learn