/TBBT-transcripts-wordcloud-NLP

The wordcloud and NLP anlysis of The Big Bang Theory

Primary LanguagePython

TBBT-transcripts-wordcloud-NLP

The wordcloud and NLP anlysis of The Big Bang Theory, including transcripts and how to get them. Just a learning project.

《生活大爆炸》台词的词云和NLP分析,同时还包含台词文本和获取方式(爬虫代码),仅仅是个人兴趣的学习项目。

Thanks for the efforts of the founder whose website is my crawler source.

感谢该网站对《生活大爆炸》台词的收集,没有这些台词就不可能有这个项目。

Code in this repository accompanies with the CSDN blog(in Chinese).

代码和思路解析详见博客

Requirements:

  • Python 3.x
  • scipy
  • urllib
  • lxml
  • wordcloud
  • nltk

wordcloud