Code repository for Natural Language Processing Python and NLTK
##What You Will Learn:
- Get a glimpse of the complexity of natural languages and how they are processed by machines
- Clean and wrangle text using tokenization and chunking to help you better process data
- Tokenize text into sentences, and sentences into words
- Classify text and perform sentiment analysis
- Implement string matching algorithms and normalization techniques
- Understand and implement the concepts of information retrieval and text summarization
- Find out how to implement various NLP tasks in Python
Chapter number | Software required (with version) | Download links to the software | Hardware specifications | OS required |
---|---|---|---|---|
1-5 | Python/Anaconda NLTK | https://www.python.org/, http://continuum.io/downloads, http://www.nltk.org/ | Common Unix Printing System | any |
6 | scikit-learn and gensim | http://scikit-learn.org/stable/, https://radimrehurek.com/gensim/ | Common Unix Printing System | any |
7 | Scrappy | http://scrapy.org/ | Common Unix Printing System | any |
8 | NumPy, SciPy, pandas, and matplotlib | http://www.numpy.org/, http://www.scipy.org/, http://pandas.pydata.org/, http://matplotlib.org/ | Common Unix Printing System | any |
9 | Twitter Python APIs and Facebook python APIs | https://dev.twitter.com/overview/api/twitter-libraries, https://developers.facebook.com | Common Unix Printing System | any |
| Chapter number | Software required (with version) | Free/Proprietary | Download links to the software | | -------------- | -------------- |-------------- |-------------- |-------------- | | 1 | NLTK>=3.0a4, NLTK Data | Free | http://www.nltk.org, http://www.nltk.org/data.html | | 2 | pyenchant>=1.6.5 | Free | http://pythonhosted.org/pyenchant/ | | 3 | lockfile>=0.9.1, MongoDB >= 2.6, pymongo>=2.6.3 | Free | https://pypi.python.org/pypi/lockfile, http://www.mongodb.org/, https://pypi.python.org/pypi/pymongo/ | | 4 | NLTK-Trainer >= 0.9 | Free | https://github.com/japerk/nltk-trainer | | 7 | scikit-learn>=0.14.1 | Free |http://scikit-learn.org/stable/ | | 8 | Redis >= 2.8, redis>=2.8.0 , execnet>=1.1 | Free | http://redis.io/, https://pypi.python.org/pypi/redis/, https://codespeak.net/execnet/ | | 9 | python-dateutil>=2.0, beautifulsoup4>=4.3.2, lxml>=3.2.3, charade>=1.0.3 | Free | http://labix.org/python-dateutil, http://www.crummy.com/software/BeautifulSoup/, http://lxml.de/, https://pypi.python.org/pypi/charade |
Chapter number | Software required (with version) | Hardware Specifications | OS required |
---|---|---|---|
All chapters | Python 2.7 or 3.2+ | Install NLTK 3.0 either on 32-bit or 64-bit machine | Windows or Mac/Unix |
###Note Modules 1, 2 and 3 have code arranged by chapter (for the chapters that have code). Click here if you have any feedback or suggestions.