/NLP

Natural Language Processing using Python

Primary LanguageJupyter Notebook

💻 Natural Language Processing

Natural Language Processing (NLP) using Python

Text Preprocessing

  1. Sentence Tokenization
  2. Word Tokenization
  3. Stemming
  4. Lemmatization
  5. Stopwords removal
  6. Part of speech tagging

Installation

  1. Make sure python3 and git is installed. Install Python : From official website

Additional helpful commands

Install Python :

From official website

To check version of Python :

py --version
python --version

Install Natural Language ToolKit :

py -m pip install nltk

Create a virtual environment :

py -m venv blog_app

To activate virtual-environment :

.\Scripts\activate

To deactivate virtual-environment :

.\Scripts\deactivate

To check python is used in virtualenv :

pip -V

If you are running the virtual env, it'll show the path to the environment's location.

To install a package in virtual environment :

py -m pip install package_name

Touch alternative in windows powershell :

$null > file_name.cpp

Configure GIT :

git config --global user.email "user_mail"
git config --global user.name "user_name"
NOTE: Make .gitignore file after initialising the GIT repository

To checkout a particular branch:

git checkout <branch-name>

To list all the branches in the git repository:

git branch -a

To lists all the modules installed for the project:

py -m pip freeze
py -m pip list

To merge a branch into master branch:

Change to master branch

git checkout master
git merge <branch-name>

To see branch graphs in git:

git log --graph --oneline --decorate --all

To see all remotes

git remote -v

To push all branches to remote repository

git push --all <remote-origin>

To add a new remote to push changes to

git remote add new-remote-origin-name https://github.com/user_name/repo.git

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

MIT