/textmining

Text mining resources and experiences.

Primary LanguageHTMLApache License 2.0Apache-2.0

textmining

Text mining resources and experiences.

Getting started

There is one test at the moment which unpacks the output from a manual search of TheLens. About 100 patents of which about 80 have textual "descriptions".

text_analytics dir

Contains the whole of scikit-learn tutorial. To load the data you have to

 cd text_analysics/data/languages
 python -m fetch_data

This downloads the data.