shawndr's Stars
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
qdrant/qdrant
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
o19s/quepid
Improve your Elasticsearch, OpenSearch, Solr, Vectara, Algolia and Custom Search search quality.
SeaseLtd/rated-ranking-evaluator
Search Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures
Yelp/xgboost-predictor-java
Pure Java implementation of XGBoost predictor for online prediction tasks.
THUxiexiaohui/SIGIR-2020-Tutorial
Modeling User Behavior for Vertical Search: Images, Apps and Products
NTMC-Community/awesome-neural-models-for-semantic-match
A curated list of papers dedicated to neural text (semantic) matching.
NTMC-Community/MatchZoo
Facilitating the design, comparison and sharing of deep text matching models.
markovi/PyClick
varepsilon/clickmodels
ClickModels is a small set of Python scripts for the user click models initially developed at Yandex. A Click Model is a probabilistic graphical model used to predict search engine click data from past observations. This project is aimed to deal with click models used in Information Retrieval (see next README.md) and intended to be easy-to-read and easy-to-modify. If it's not, please let me know how to improve it :)
wenyuer/POG
the datasets of the paper POG
bmitra-msft/NDRM
A repository for Neural Document Ranking Models.
nmslib/nmslib
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
rspeer/python-ftfy
Fixes mojibake and other glitches in Unicode text, after the fact.
tensorflow/similarity
TensorFlow Similarity is a python package focused on making similarity learning quick and easy.
zurutech/anomaly-toolbox
Anomaly detection using GANs.
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
VinAIResearch/BERTweet
BERTweet: A pre-trained language model for English Tweets (EMNLP-2020)
facebookresearch/NeuralDB
Database Reasoning Over Text project for ACL paper
scikit-learn-contrib/hdbscan
A high performance implementation of HDBSCAN clustering.
whylabs/whylogs
An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
facebookresearch/luckmatters
Understanding Training Dynamics of Deep ReLU Networks
google-research/sloe-logistic
linkedin/greykite
A flexible, intuitive and fast forecasting library
appsmithorg/appsmith
Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.
dsternlicht/RESTool
RESTool is an open source UI tool for managing RESTful APIs. It could save you time developing your own internal tools. A live example:
facebookresearch/suncet
Code to reproduce the results in the FAIR research papers "Semi-Supervised Learning of Visual Features by Non-Parametrically Predicting View Assignments with Support Samples" https://arxiv.org/abs/2104.13963 and "Supervision Accelerates Pre-training in Contrastive Semi-Supervised Learning of Visual Representations" https://arxiv.org/abs/2006.10803
facebookresearch/dino
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
jupyterhub/repo2docker
Turn repositories into Jupyter-enabled Docker images