vrozenbaum's Stars
google-research/bert
TensorFlow code and pre-trained models for BERT
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
piskvorky/gensim
Topic Modelling for Humans
smicallef/spiderfoot
SpiderFoot automates OSINT for threat intelligence and mapping your attack surface.
jina-ai/clip-as-service
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
marcotcr/lime
Lime: Explaining the predictions of any machine learning classifier
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
kojino/120-Data-Science-Interview-Questions
Answers to 120 commonly asked data science interview questions.
NikolaiT/GoogleScraper
A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.
chiphuyen/lazynlp
Library to scrape and clean web pages to create massive datasets.
asweigart/pyperclip
Python module for cross-platform clipboard functions.
luozhouyang/python-string-similarity
A library implementing different string similarity and distance measures using Python.
sbharadwajj/awesome-zero-shot-learning
A curated list of papers, code and resources pertaining to zero shot learning
tasos-py/Search-Engines-Scraper
Search google, bing, yahoo, and other search engines with python
mozilla/agithub
Agnostic Github client API -- An EDSL for connecting to REST servers
sujitpal/eeap-examples
Code for Document Similarity on Reuters dataset using Encode, Embed, Attend, Predict recipe
Deffro/text-preprocessing-techniques
16 Text Preprocessing Techniques in Python for Twitter Sentiment Analysis.
hamelsmu/ktext
Utilities for preprocessing text for deep learning with Keras
neo4j-contrib/neo4j-tableau
Neo4j Tableau Integration via WDC
fmpr/texttk
Text Preprocessing in Python
ankushbhatia2/community-detect
Community detection using attribute and structural similarities.
kohjiaxuan/NLP-Model-for-Corpus-Similarity
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
IanHGray/graph_analysis
Some scripts/guides for working with Neo4j in Python.
jcconnell/python-spotcrime
A python wrapper for the Spot Crime API
ankushbhatia2/Document_similarity_research_notebook
Jupyter notebook for my research in Document Similarity.
jjedMoriAnktah/FeatureSpaceTree
An easy way to perform preprocessing tasks in text-clasification
jieliang/analyzing_presidency_similarity
Python script analyzing 130K presidential documents to calculate presidency similarity in terms of most covered topics.
cipher982/Document-Phrase-Matching-Alrogithm
python word matching to score similarity in two documents
sbhatt14/Similarity-Matrix-by-text-mining
Generate similarity matrix of documents using NLTK. A brief description of a firm/organization is used as an example. Term Frequency- Inverse Document Frequency vectors of documents are created. Matrix is score is based on cosine similarity.