n-gram
There are 80 repositories under n-gram topic.
zhezhaoa/ngram2vec
Four word embedding models implemented in Python. Supporting arbitrary context features
rockymadden/stringmetric
:dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).
adrg/strutil
Go metrics for calculating string similarity and other string utility functions
andreekeberg/ml-classify-text-js
Machine learning based text classification in JavaScript using n-grams and cosine similarity
words/n-gram
Get n-grams from text
lucidrains/n-grammer-pytorch
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
princeton-nlp/calm-textgame
[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games
inspiringz/XDet
基于机器学习的 Webshell 检测 (OPCode -> N-Gram -> TF-IDF -> XGBoost)
chao1224/n_gram_graph
N-Gram Graph: Simple Unsupervised Representation for Graphs, NeurIPS'19 (https://arxiv.org/abs/1806.09206)
ajenhl/tacl
Tool for performing basic text analysis on the CBETA corpus
wooorm/trigrams
Trigram files for 400+ languages
Gabriel-Ciriaco/super-profanity
An advanced profanity filter based on phonetics and normal detection that detects swears in 27 different languages!
KhaledAshrafH/Auto-Filling-Text
This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.
AyushSoni86/n-gram-model
This project contains n-gram generator which is build using HTML, CSS and Javascript
cyclone-github/spider
URL Spider - web crawler and wordlist / ngram generator
corazzon/inflearn-new-year-event-2020
기획자와 마케터를 위한 이벤트 댓글 분석 - feat. 인프런 새해 다짐 이벤트
Jed-Z/ngram-text-prediction
用 n-gram 语言模型进行新闻文本内容预测。https://www.jeddd.com/article/python-ngram-language-prediction.html
ServiceNow/bytesteady
A fast classification and tagging tool using byte-level n-gram embeddings. bytesteady is a ServiceNow Research project that was started at Element AI.
fayrose/MiddleEgyptianDataset
Parses 3 dictionaries from PDFs, reconstructs lost formatting using N-gram and visual computing methods, and serializes to a database for web display.
enix223/nlp-course
NLP Course stuff and algorithm implementations
fredriko/metacurate-lexicon
A web service that exposes semantic similarity search via a web GUI and a RESTful API.
chika-kasymov/WordSuggestion
A simple Objective-C implementation of word suggestion using n-gram probabilistic model
aallam/string-similarity-kotlin
Kotlin multiplatform library offering various algorithms to measure string similarity and distance
GatoY/Spelling-Correction-Methods
Comparison among four spelling correction methods. n-gram, Levenshtein, Jaro, Jaro_winkler
go-generalize/firestore-repo
Model Generator for Firestore(move to volcago)
t184256/ngram-keylogger
ngram-keylogger: typing stats that don't leak passwords
BioGenies/CancerGram
Predicts anticancer peptides using random forests trained on the n-gram encoded peptides. The implemented algorithm can be accessed from both the command line and shiny-based GUI.
sayarghoshroy/Language-Modelling
Language Modelling for various corpora, Natural Language Generation using LMs, Corpus Statistics Visualization
choosewhatulike/Chinese-Ngram-LM-Hadoop
A distributed chinese n-gram language model implementation for train and test on large corpus , using Hadoop MapReduce.
knowledge-express/skipgram
For all your n-gram and skip-gram needs 🔠
zahrasalarian/Persian-Poet-Prediction
An automatic Persian poet detector from poetry, using NLP algorithms and Backoff model, which identifies which poet a verse of a poem belongs to among Ferdowsi, Saadi, and Rumi.
emanuelzaymus/Diacritics
Diacritics reconstruction (restoration) for Slovak text. Bachelor's thesis
filippofinke/n-gram
📝 n-gram based text categorization written in php
Software-Research-Lab/dropsuit-ngram
The ngram function is a JavaScript and Node.js function that is part of the DropSuit NLP library. It is designed to generate N-grams from input strings. It's open-source and available under the Apache License 2.0.
tamannanazmin/Xenophobic-Behavior-on-Social-Media
Observeing the semantic evolution of Xenophobic slurs using word embedding's.
YassirMatrane/arabicTextClassification
After collecting 40 thousand tweets and preprocessing it, I used word embeddings with arabert and tf-idf along with two neural network architectures and 5 machine learning algorithms. Due to the huge size of the dataset, I chose Amazon SageMaker to train the models