fjsj's Stars
fastai/fastbook
The fastai book, published as Jupyter Notebooks
caprover/caprover
Scalable PaaS (automated Docker+nginx) - aka Heroku on Steroids
explosion/thinc
đź”® A refreshing functional take on deep learning, compatible with your favorite libraries
hauntsaninja/pyp
Easily run Python at the shell! Magical, but never mysterious.
J535D165/recordlinkage
A powerful and modular toolkit for record linkage and duplicate detection in Python
philipperemy/name-dataset
The Python library for names.
goldsborough/ipc-bench
:racehorse: Benchmarks for Inter-Process-Communication Techniques
villasv/aws-airflow-stack
Turbine: the bare metals that gets you Airflow
Bergvca/string_grouper
Super Fast String Matching in Python
mattilyra/LSH
Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
LinkedInAttic/scanns
A scalable nearest neighbor search library in Apache Spark
MajorTal/DeepSpell
a Deep Learning based Speller
ticki/eudex
A blazingly fast phonetic reduction/hashing algorithm.
scify/JedAIToolkit
An open source, high scalability toolkit in Java for Entity Resolution.
chrislit/abydos
Abydos NLP/IR library for Python
IntuitionEngineeringTeam/chars2vec
Character-based word embeddings model based on RNN for handling real world texts
RUSH-LAB/Flash
LSH-GPU ANN package
xinyandai/string-embed
string embed for fast edit distance computation, codes for [Convolutional Embedding for Edit Distance (SIGIR 20)].
GiulioRossetti/TILES
TILES: an algorithm for community discovery in dynamic social networks
markokr/pghashlib
Stable hash functions for Postgres
cid-harvard/pandas-to-postgres
Copy Pandas DataFrames and HDF5 files to PostgreSQL database
mdcramer/Deep-Speeling
Deep Learning neural network for correcting spelling
Lettria/Char2Vec
zhao1701/extending-deep-ER
This project focuses on DeepER, a deep learning framework for entity resolution (record deduplication). It examines how DeepER performs on benchmark datasets under a variety of conditions and also tests a number of extensions designed to improve DeepER's accuracy.
J535D165/recordlinkage-annotator
A browser user interface for manual labeling of record pairs.
GiulioRossetti/DEMON
DEMON: a local-first discovery method for overlapping communities.
GiulioRossetti/Eva
Eva: Community Discovery for Labeled Graphs (networkx implementation)
arnimarj/py-judy
Opquast/Opquast-Web-Quality
The template to create your checklist on Devchecklists. https://devchecklists.com
paolociccarese/simile-vicino
Automatically exported from code.google.com/p/simile-vicino