deswire's Stars
adbar/trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
juletx/corpus-linguistics
Corpus Linguistics slides, labs, assignments and data
notesjor/corpusexplorer2.0
Korpuslinguistik war noch nie so einfach...
notesjor/CorpusExplorer.Terminal.Console
Erlaubt anderen Programmen/Programmiersprachen den Zugriff auf Analysen/Daten des CorpusExplorer v2.0
sloria/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
stanfordnlp/CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
nltk/nltk
NLTK Source
allenai/allennlp
An open-source NLP research library, built on PyTorch.
piskvorky/gensim
Topic Modelling for Humans
keon/awesome-nlp
:book: A curated list of resources dedicated to Natural Language Processing (NLP)
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools