n-gram

There are 80 repositories under n-gram topic.

  • zhezhaoa/ngram2vec

    Four word embedding models implemented in Python. Supporting arbitrary context features

    Language:Python8466323174
  • rockymadden/stringmetric

    :dart: String metrics and phonetic algorithms for Scala (e.g. Dice/Sorensen, Hamming, Jaccard, Jaro, Jaro-Winkler, Levenshtein, Metaphone, N-Gram, NYSIIS, Overlap, Ratcliff/Obershelp, Refined NYSIIS, Refined Soundex, Soundex, Weighted Levenshtein).

    Language:Scala484412980
  • adrg/strutil

    Go metrics for calculating string similarity and other string utility functions

    Language:Go3624423
  • ml-classify-text-js

    andreekeberg/ml-classify-text-js

    Machine learning based text classification in JavaScript using n-grams and cosine similarity

    Language:JavaScript12661111
  • words/n-gram

    Get n-grams from text

    Language:JavaScript786517
  • lucidrains/n-grammer-pytorch

    Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch

    Language:Python72731
  • princeton-nlp/calm-textgame

    [EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

    Language:Python69487
  • inspiringz/XDet

    基于机器学习的 Webshell 检测 (OPCode -> N-Gram -> TF-IDF -> XGBoost)

    Language:PHP421010
  • chao1224/n_gram_graph

    N-Gram Graph: Simple Unsupervised Representation for Graphs, NeurIPS'19 (https://arxiv.org/abs/1806.09206)

    Language:Python38348
  • ajenhl/tacl

    Tool for performing basic text analysis on the CBETA corpus

    Language:Python3110629
  • wooorm/trigrams

    Trigram files for 400+ languages

    Language:JavaScript23403
  • Gabriel-Ciriaco/super-profanity

    An advanced profanity filter based on phonetics and normal detection that detects swears in 27 different languages!

    Language:JavaScript16252
  • KhaledAshrafH/Auto-Filling-Text

    This project is an auto-filling text program implemented in Python using N-gram models. The program suggests the next word based on the input given by the user. It utilizes N-gram models, specifically Trigrams and Bigrams, to generate predictions.

    Language:Python16414
  • AyushSoni86/n-gram-model

    This project contains n-gram generator which is build using HTML, CSS and Javascript

    Language:HTML13106
  • cyclone-github/spider

    URL Spider - web crawler and wordlist / ngram generator

    Language:Go12200
  • corazzon/inflearn-new-year-event-2020

    기획자와 마케터를 위한 이벤트 댓글 분석 - feat. 인프런 새해 다짐 이벤트

    Language:Jupyter Notebook114016
  • Jed-Z/ngram-text-prediction

    用 n-gram 语言模型进行新闻文本内容预测。https://www.jeddd.com/article/python-ngram-language-prediction.html

    Language:Jupyter Notebook10206
  • ServiceNow/bytesteady

    A fast classification and tagging tool using byte-level n-gram embeddings. bytesteady is a ServiceNow Research project that was started at Element AI.

    Language:C++10602
  • fayrose/MiddleEgyptianDataset

    Parses 3 dictionaries from PDFs, reconstructs lost formatting using N-gram and visual computing methods, and serializes to a database for web display.

    Language:C#9202
  • enix223/nlp-course

    NLP Course stuff and algorithm implementations

    Language:Jupyter Notebook8301
  • fredriko/metacurate-lexicon

    A web service that exposes semantic similarity search via a web GUI and a RESTful API.

    Language:Python8200
  • chika-kasymov/WordSuggestion

    A simple Objective-C implementation of word suggestion using n-gram probabilistic model

    Language:Objective-C7204
  • aallam/string-similarity-kotlin

    Kotlin multiplatform library offering various algorithms to measure string similarity and distance

    Language:Kotlin6200
  • GatoY/Spelling-Correction-Methods

    Comparison among four spelling correction methods. n-gram, Levenshtein, Jaro, Jaro_winkler

    Language:Python5202
  • go-generalize/firestore-repo

    Model Generator for Firestore(move to volcago)

    Language:Go54560
  • t184256/ngram-keylogger

    ngram-keylogger: typing stats that don't leak passwords

    Language:Python5311
  • BioGenies/CancerGram

    Predicts anticancer peptides using random forests trained on the n-gram encoded peptides. The implemented algorithm can be accessed from both the command line and shiny-based GUI.

    Language:R4300
  • sayarghoshroy/Language-Modelling

    Language Modelling for various corpora, Natural Language Generation using LMs, Corpus Statistics Visualization

    Language:Jupyter Notebook4203
  • choosewhatulike/Chinese-Ngram-LM-Hadoop

    A distributed chinese n-gram language model implementation for train and test on large corpus , using Hadoop MapReduce.

    Language:Java3100
  • knowledge-express/skipgram

    For all your n-gram and skip-gram needs 🔠

    Language:JavaScript3200
  • zahrasalarian/Persian-Poet-Prediction

    An automatic Persian poet detector from poetry, using NLP algorithms and Backoff model, which identifies which poet a verse of a poem belongs to among Ferdowsi, Saadi, and Rumi.

    Language:Python3100
  • emanuelzaymus/Diacritics

    Diacritics reconstruction (restoration) for Slovak text. Bachelor's thesis

    Language:C#2011
  • filippofinke/n-gram

    📝 n-gram based text categorization written in php

    Language:PHP230
  • Software-Research-Lab/dropsuit-ngram

    The ngram function is a JavaScript and Node.js function that is part of the DropSuit NLP library. It is designed to generate N-grams from input strings. It's open-source and available under the Apache License 2.0.

    Language:JavaScript2200
  • tamannanazmin/Xenophobic-Behavior-on-Social-Media

    Observeing the semantic evolution of Xenophobic slurs using word embedding's.

    Language:Jupyter Notebook2100
  • YassirMatrane/arabicTextClassification

    After collecting 40 thousand tweets and preprocessing it, I used word embeddings with arabert and tf-idf along with two neural network architectures and 5 machine learning algorithms. Due to the huge size of the dataset, I chose Amazon SageMaker to train the models

    Language:Jupyter Notebook2202