n-grams

There are 222 repositories under n-grams topic.

  • patrickschur/language-detection

    A language detection library for PHP. Detects the language from a given text string.

    Language:PHP832283885
  • tinyld

    komodojp/tinyld

    Simple and Performant Language detection library for NodeJS

    Language:TypeScript44252513
  • ml-classify-text-js

    andreekeberg/ml-classify-text-js

    Machine learning based text classification in JavaScript using n-grams and cosine similarity

    Language:JavaScript13261111
  • gpoulter/python-ngram

    Python Set subclass that supports searching by ngram similarity

    Language:Python11961124
  • ezgisubasi/turkish-tweets-sentiment-analysis

    This sentiment analysis project determines whether the tweets posted in the Turkish language on Twitter are positive or negative.

    Language:Jupyter Notebook612214
  • Data-on-the-Mind/2017-summer-workshop

    Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)

    Language:HTML3511019
  • tienthanhdhcn/Vietnamese-Accent-Prediction

    A simple/fast/accurate accent prediction for non-accented Vietnamese text

    Language:Java348311
  • beowolx/haiku-search

    Haiku-Search is a high-performance fuzzy search library designed for web applications. It is built using Rust and compiled to WebAssembly

    Language:Rust24100
  • dohliam/hawaiian-corpus

    Data from a corpus of written Hawaiian

  • ItrcAiLabs/eval_mt

    To evaluate machine translation, they use several methods, some of which we fully implemented

    Language:Python16101
  • KhaledAshrafH/Auto-Filling-Text

    This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.

    Language:Python16414
  • ZavierYang/N-gram-model-for-Hangman-game

    Use different orders of N-gram model to play Hangman game.

    Language:Python16106
  • go-generalize/volcago

    Model Generator for Firestore

    Language:Go155444
  • stathwang/POS-Taggers

    Part-of-Speech Tagging Models in Python

    Language:Python15135
  • DanielJohnBenton/Ngrams.java

    :cake: A library for creating n-grams, skip-grams, bag of words, bag of n-grams, bag of skip-grams.

    Language:Java14307
  • ltfschoen/AIND-Recognizer

    Term 1 Project 3 Design a Sign Language Recognition System by Luke Schoen for Udacity Artificial Intelligence Nanodegree (AIND)

    Language:Jupyter Notebook13205
  • Al00X/LanguageDetector

    Detect language from a text string in Swift!

    Language:Swift12200
  • evan-l-munson/saotd

    Sentiment Analysis of Twitter Data (saotd)

    Language:R12369
  • chattylabs/language-detector

    Package to detect the language of a given text (focusing on short "sms" type text used on tweets, facebook, WhatsApp, etc)

    Language:JavaScript11222
  • abhinav-upadhyay/nbspell

    New spell(1) implementation for NetBSD

    Language:C9200
  • FutureComputing4AI/KiloGrams

    KiloGram algorithm for finding the top-k most frequent n-grams for large values of n quickly with fixed memory.

    Language:Java9404
  • drupol/phpngrams

    Get N-Grams !

    Language:PHP8201
  • fredriko/metacurate-lexicon

    A web service that exposes semantic similarity search via a web GUI and a RESTful API.

    Language:Python8200
  • greek-dialect-classifier

    hb20007/greek-dialect-classifier

    Classifier that identifies Greek text as Cypriot Greek or Standard Modern Greek

    Language:Jupyter Notebook8303
  • justachetan/nlp

    Code written as a part of assignments for CSE556 Natural Language Processing taught by Dr. Tanmoy Chakraborty at IIIT Delhi in Monsoon 2018

    Language:Jupyter Notebook8002
  • cvcio/go-plagiarism

    Plagiarism detection using stopwords n-grams

    Language:Go7401
  • vgherard/kgrams

    k-grams, Language Models, and All That

    Language:R74190
  • DanielJohnBenton/TownsAndVillages

    :european_castle: Mapping British place names and other analysis

    Language:Python6200
  • lemonzi/DateMyMusic

    Predict the composition year of a given MIDI piece - Classical Music Hack Day 2013 @ Vienna. Live at:

    Language:JavaScript6400
  • OnlyChristmas/SMP-ETST-2018

    SMP_ETST 2018 christmas

    Language:Python6122
  • keya-desai/Natural-Language-Processing

    Python implementation of N-gram Models, Log linear and Neural Linear Models, Back-propagation and Self-Attention, HMM, PCFG, CRF, EM, VAE

    Language:Python5201
  • spChalk/Vaccine-Sentiment-Classifier

    :syringe: Vaccine Sentiment Classifier is a deep learning classifier trained on real world twitter data, that distinguishes 3 types of tweets: Neutral, Anti-vax & Pro-vax.

    Language:Jupyter Notebook5100
  • t184256/ngram-keylogger

    ngram-keylogger: typing stats that don't leak passwords

    Language:Python5311
  • vaitybharati/Assignment-11-Text-Mining-01-Elon-Musk

    Assignment-11-Text-Mining-01-Elon-Musk, Perform sentimental analysis on the Elon-musk tweets (Exlon-musk.csv), Text Preprocessing: remove both the leading and the trailing characters, removes empty strings, because they are considered in Python as False, Joining the list into one string/text, Remove Twitter username handles from a given twitter text. (Removes @usernames), Again Joining the list into one string/text, Remove Punctuation, Remove https or url within text, Converting into Text Tokens, Tokenization, Remove Stopwords, Normalize the data, Stemming (Optional), Lemmatization, Feature Extraction, Using BoW CountVectorizer, CountVectorizer with N-grams (Bigrams & Trigrams), TF-IDF Vectorizer, Generate Word Cloud, Named Entity Recognition (NER), Emotion Mining - Sentiment Analysis.

    Language:Jupyter Notebook5104
  • yoraghav/Automated_Hangman

    Uses letter frequency and catboost classifier model in synchronous for guessing letters in hangman game instance. The model performance is evaluated on both seen words in the dictionary and words out of the dictionary.

    Language:Jupyter Notebook5100