n-grams
There are 210 repositories under n-grams topic.
patrickschur/language-detection
A language detection library for PHP. Detects the language from a given text string.
komodojp/tinyld
Simple and Performant Language detection library for NodeJS
andreekeberg/ml-classify-text-js
Machine learning based text classification in JavaScript using n-grams and cosine similarity
gpoulter/python-ngram
Python Set subclass that supports searching by ngram similarity
ezgisubasi/turkish-tweets-sentiment-analysis
This sentiment analysis project determines whether the tweets posted in the Turkish language on Twitter are positive or negative.
Data-on-the-Mind/2017-summer-workshop
Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
tienthanhdhcn/Vietnamese-Accent-Prediction
A simple/fast/accurate accent prediction for non-accented Vietnamese text
beowolx/haiku-search
Haiku-Search is a high-performance fuzzy search library designed for web applications. It is built using Rust and compiled to WebAssembly
KhaledAshrafH/Auto-Filling-Text
This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.
parvvaresh/Evaluation-of-machine-translation-by-NLP
To evaluate machine translation, they use several methods, some of which we fully implemented
ZavierYang/N-gram-model-for-Hangman-game
Use different orders of N-gram model to play Hangman game.
stathwang/POS-Taggers
Part-of-Speech Tagging Models in Python
DanielJohnBenton/Ngrams.java
:cake: A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.
dohliam/hawaiian-corpus
Data from a corpus of written Hawaiian
go-generalize/volcago
Model Generator for Firestore
ltfschoen/AIND-Recognizer
Term 1 Project 3 Design a Sign Language Recognition System by Luke Schoen for Udacity Artificial Intelligence Nanodegree (AIND)
evan-l-munson/saotd
Sentiment Analysis of Twitter Data (saotd)
Al00X/LanguageDetector
Detect language from a text string in Swift!
chattylabs/language-detector
Package to detect the language of a given text (focusing on short "sms" type text used on tweets, facebook, WhatsApp, etc)
FutureComputing4AI/KiloGrams
KiloGram algorithm for finding the top-k most frequent n-grams for large values of n quickly with fixed memory.
abhinav-upadhyay/nbspell
New spell(1) implementation for NetBSD
drupol/phpngrams
Get N-Grams !
fredriko/metacurate-lexicon
A web service that exposes semantic similarity search via a web GUI and a RESTful API.
justachetan/nlp
Code written as a part of assignments for CSE556 Natural Language Processing taught by Dr. Tanmoy Chakraborty at IIIT Delhi in Monsoon 2018
hb20007/greek-dialect-classifier
Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek
vgherard/kgrams
k-grams, Language Models, and All That
cvcio/go-plagiarism
Plagiarism detection using stopwords n-grams
DanielJohnBenton/TownsAndVillages
:european_castle: Mapping British place names and other analysis
lemonzi/DateMyMusic
Predict the composition year of a given MIDI piece - Classical Music Hack Day 2013 @ Vienna. Live at:
OnlyChristmas/SMP-ETST-2018
SMP_ETST 2018 christmas
go-generalize/firestore-repo
Model Generator for Firestore(move to volcago)
keya-desai/Natural-Language-Processing
Python implementation of N-gram Models, Log linear and Neural Linear Models, Back-propagation and Self-Attention, HMM, PCFG, CRF, EM, VAE
spChalk/Vaccine-Sentiment-Classifier
:syringe: Vaccine Sentiment Classifier is a deep learning classifier trained on real world twitter data, that distinguishes 3 types of tweets: Neutral, Anti-vax & Pro-vax.
vaitybharati/Assignment-11-Text-Mining-01-Elon-Musk
Assignment-11-Text-Mining-01-Elon-Musk, Perform sentimental analysis on the Elon-musk tweets (Exlon-musk.csv), Text Preprocessing: remove both the leading and the trailing characters, removes empty strings, because they are considered in Python as False, Joining the list into one string/text, Remove Twitter username handles from a given twitter text. (Removes @usernames), Again Joining the list into one string/text, Remove Punctuation, Remove https or url within text, Converting into Text Tokens, Tokenization, Remove Stopwords, Normalize the data, Stemming (Optional), Lemmatization, Feature Extraction, Using BoW CountVectorizer, CountVectorizer with N-grams (Bigrams & Trigrams), TF-IDF Vectorizer, Generate Word Cloud, Named Entity Recognition (NER), Emotion Mining - Sentiment Analysis.
yoraghav/Automated_Hangman
Uses letter frequency and catboost classifier model in synchronous for guessing letters in hangman game instance. The model performance is evaluated on both seen words in the dictionary and words out of the dictionary.