meshidenn's Stars
explosion/tokenizations
Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/
allenai/pdffigures2
Given a scholarly PDF, extract figures, tables, captions, and section titles.
thunlp/MetaAdaptRank
Code and data of the ACL 2021 paper: Few-Shot Text Ranking with Meta Adapted Synthetic Weak Supervision
facebookresearch/contriever
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
cl-tohoku/AIO2_DPR_baseline
https://www.nlp.ecei.tohoku.ac.jp/projects/aio/
UKPLab/gpl
Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: Generative Pseudo Labeling for Unsupervised Domain Adaptation of Dense Retrieval" https://arxiv.org/abs/2112.07577
recsyslab/recsys-python
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
elsevierlabs/OA-STM-Corpus
Corpus of Open Access articles from multiple fields in Science, Technology, and Medicine.
drken1215/book_algorithm_solution
拙著「問題解決力を鍛える!アルゴリズムとデータ構造」の補足資料。ソースコードと、章末問題への略解を掲載。
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
CalculatedContent/WeightWatcher
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks
naver/splade
SPLADE: sparse neural search (SIGIR21, SIGIR22)
javascript-tutorial/ja.javascript.info
現代の JavaScript チュートリアル
allenai/ir_datasets
Provides a common interface to many IR ranking datasets.
goodfeli/dlbook_notation
LaTeX files for the Deep Learning book notation
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
nullnull/simstring
A Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.
rinatz/python-book
ゼロから学ぶ Python
shibuiwilliam/ml-system-in-actions
machine learning system examples
microsoft/ML-For-Beginners
12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
taku910/cabocha
Yet Another Japanese Dependency Structure Analyzer
dvgodoy/dl-visuals
Over 200 figures and diagrams of the most popular deep learning architectures and layers FREE TO USE in your blog posts, slides, presentations, or papers.
terrier-org/terrier-core
Terrier IR Platform
IlyaGrebnov/libsais
libsais is a library for linear time suffix array, longest common prefix array and burrows wheeler transform construction based on induced sorting algorithm.
hatena/solr-tutorial
Solrの導入資料です。LAMP構成に特化しています。
luyug/COIL
NAACL2021 - COIL Contextualized Lexical Retriever
osirrc/anserini-bm25prf-docker
OSIRRC Docker Image for Anserini-bm25prf
matsui528/faiss_tips
Some useful tips for faiss