webing-crawlers
drawWordCloud.py
키워드로 검색되는 뉴스들의 본문요약을 분석해서 형태소(언어 표현의 최소 단위)들을 추출하고 빈도수를 기반으로 wordCloud 형태로 출력해주는 스크립트
###Requirements
The installation steps assume that you have the following things installed:
- Python 2.7
- Java1.6+
- pip and setuptools Python packages. Nowadays pip requires and installs setuptools if not installed. Python 2.7.9 and later include pip by default, so you may have it already.
- lxml. Most Linux distributions ships prepackaged versions of lxml. Otherwise refer to http://lxml.de/installation.html
- OpenSSL. This comes preinstalled in all operating systems, except Windows where the Python installer ships it bundled.
===== $ pip install JPype1 # dependencies for konlpy $ pip install konlpy # a Python package for natural language processing (NLP) of the Korean language. $ brew install sdl sdl_image sdl_mixer sdl_ttf portmidi $ brew tap samueljohn/python $ brew install pygame $ pip install git+https://github.com/e9t/PyTagCloud.git
###Installation & run
$ git clone https://github.com/we-bing/webing-crawlers.git
$ cd webing-crawlers
# python drawWordCloud.py {"query for keywords"} {the number of keywords}
####Examples
$ python drawWordCloud.py "오바마 대통령" 10
$ cat wordcloud.csv
####Reference