/NLP-Natural-Language-Processing

Projects and useful articles / links

Primary LanguageJupyter NotebookMIT LicenseMIT

Hits

Constantly updated. Subscribe not to miss anything.

  • For full Data Science tasks, materials, etc. please check Data Science repository.
  • For Deep Learning algorithms please check Deep Learning repository.

💠 Natural Language Processing Tasks

Folders with all materials for specific task/domain

🎓 Courses

📚 Books

🎥 YouTube

Title Description
ACL at Vimeo
[Stanford CS224N: Natural Language Processing with Deep Learning Winter 2021](https://www.youtube.com/playlist?list=PLoROMvodv4rOSH4v6133s9LFPRHjEmbmJ)
Natural Language Processing (NLP) Zero to Hero by TensorFlow
Zero to Hero: NLP with Tensorflow and Keras (GDG Sofia meetup)
Natural Language Processing This content is based on Machine Learning University (MLU) Accelerated Natural Language Processing class. Slides, notebooks and datasets are available on GitHub

🖥️ Web Sites

📰 Atricles

:octocat: GitHub Repositories

Title Description
NVIDIA Deep Learning Examples for Tensor Cores - Natural Language Processing Deep Learning Examples
Natural Language Processing with Transformers Notebooks and materials for the O'Reilly book "Natural Language Processing with Transformers"
Awesome NLP References A curated list of resources dedicated to Knowledge Distillation, Recommendation System, especially Natural Language Processing (NLP)
NLP - Tutorial
Natural Language-Process Tutorials
NLP with Python Scikit-Learn, NLTK, Spacy, Gensim, Textblob and more...
NLP and Data Science GitHub Repository Spotlight Daily spotlights of some underrated NLP and Data Science GitHub repositories.
NLP 101: a Resource Repository for Deep Learning and Natural Language Processing This document is drafted for those who have enthusiasm for Deep Learning in natural language processing. If there are any good recommendations or suggestions, I will try to add more.
NLP-progress Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Hugging Face Public repo for HF blog posts
AllenNLP An open-source NLP research library, built on PyTorch. Allenai.org

🗣️ Conferences

  • NeurIPS - Neural Information Processing Systems
  • ACL - ACL Home Association for Computational Linguistics
    • ACL Anthology - The ACL Anthology currently hosts 80890 papers on the study of computational linguistics and natural language processing.

🛠️ Tools

NLP libraries, frameworks, modules

Title Description
Natural Language Toolkit (NLTK) NLTK - the Natural Language Toolkit - is a suite of open source Python modules, data sets, and tutorials supporting research and development in Natural Language Processing.
flair

    A very simple framework for state-of-the-art Natural Language Processing (NLP).

    Flair is:

  • A powerful NLP library. Flair allows you to apply our state-of-the-art natural language processing (NLP) models to your text, such as named entity recognition (NER), part-of-speech tagging (PoS), special support for biomedical data, sense disambiguation and classification, with support for a rapidly growing number of languages.
  • A text embedding library. Flair has simple interfaces that allow you to use and combine different word and document embeddings, including our proposed Flair embeddings, BERT embeddings and ELMo embeddings.
  • A PyTorch NLP framework. Our framework builds directly on PyTorch, making it easy to train your own models and experiment with new approaches using Flair embeddings and classes.
textacy textacy is a Python library for performing a variety of natural language processing (NLP) tasks, built on the high-performance spaCy library. With the fundamentals --- tokenization, part-of-speech tagging, dependency parsing, etc. --- delegated to another library, textacy focuses primarily on the tasks that come before and follow after.
AllenNLP NLP research library, built on PyTorch, for developing state-of-the-art deep learning models on a wide variety of linguistic tasks.
NLPGym NLPGym is a toolkit to bridge the gap between applications of RL and NLP. This aims at facilitating research and benchmarking of DRL application on natural language processing tasks. The toolkit provides interactive environments for standard NLP tasks such as sequence tagging, question answering, and sequence classification.
Gensim

👩🏻‍🏫 Tutorials

Title Description
nlp-tutorial

    Natural Language Processing Tutorial for Deep Learning Researchers

    nlp-tutorial is a tutorial for who is studying NLP(Natural Language Processing) using Pytorch. Most of the models in NLP were implemented with less than 100 lines of code.(except comments or blank lines)

Natural Language Processing in Python Tutorial comparing stand up comedians using natural language processing

🔝 Leaderboard for NLP

💠 Deep Learning architectures for NLP

🔹 100 Must-Read NLP Papers

🔹 Sci-Hub(Papers)

🔹 Stanford, NLP Seminar Schedule

🔹 CS224n: Natural Language Processing with Deep Learning

🔹 CIS 700-008 - Interactive Fiction and Text Generation

🔹 Harvard NLP

🔹 The Illustrated Transformer


🔺 Projects

Question Answering System using BiDAF Model on SQuAD

Implemented a Bidirectional Attention Flow neural network as a baseline on SQuAD, improving Chris Chute's model implementation, adding word-character inputs as described in the original paper and improving GauthierDmns' code.

Useful Articles

Some Concepts

  • Selectional Preference - (Katz and Fodor, 1963; Wilks, 1975; Resnik, 1993) are the tendency for a word to semantically select or constrain which other words may appear in a direct syntactic relation with it." In case this selection is expressed in binary term (allowed/not-allowed), it is also called selectional restriction (Séaghdha and Korhonen, 2014). SP can be contrasted with verb subcategorization "with subcategorization describing the syntactic arguments taken by a verb, and selectional preferences describing the semantic preferences verbs have for their arguments" (Van de Cruys et al., 2012)
  • Selectional Restrictions -