emanjavacas's Stars
allenai/allennlp
An open-source NLP research library, built on PyTorch.
brightmart/text_classification
all kinds of text classification models and more with deep learning
dennybritz/deeplearning-papernotes
Summaries and notes on Deep Learning research papers
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
ekzhu/datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble and HNSW
chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
hauntsaninja/pyp
Easily run Python at the shell! Magical, but never mysterious.
masmu/pulseaudio-dlna
A lightweight streaming server which brings DLNA / UPNP and Chromecast support to PulseAudio and Linux
maciejkula/glove-python
Toy Python implementation of http://www-nlp.stanford.edu/projects/glove/
anfederico/flaskex
Simple flask example for quick prototypes and small applications
gregversteeg/corex_topic
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
eladhoffer/seq2seq.pytorch
Sequence-to-Sequence learning using PyTorch
neubig/nlptutorial
A Tutorial about Programming for Natural Language Processing
Stonesjtu/Pytorch-NCE
The Noise Contrastive Estimation for softmax output written in Pytorch
locuslab/pytorch_fft
PyTorch wrapper for FFTs
falk0069/sony-pm-alt
Transfer pictures wirelessly for a Sony camera without using Playmemories (Sony PM Alternative)
manifoldai/merf
Mixed Effects Random Forest
volumio/volumio-plugins
rdspring1/PyTorch_GBW_LM
PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset
dasmiq/passim
Detect and align similar passages
smilli/kneser-ney
Kneser-Ney implementation in Python
rjagerman/pytorchltr
Learning to Rank in PyTorch
a1da4/paper-survey
Summary of machine learning papers
eliranwong/LXX-Rahlfs-1935
Septuagint database based on Rahlfs 1935 edition
proycon/python-ucto
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser written in C++ (http://ilk.uvt.nl/ucto).
ajaech/calm
Context Aware Language Models
fbkarsdorp/melodic-similarity
Source code for "Learning Similarity Metrics for Melody Retrieval"
riedlma/sequence_tagging
Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German
avjves/textreuse-blast
A software to detect text reuse with BLAST.
TechnionTDK/jngram