casually-PYlearner's Stars
UKPLab/sentence-transformers
State-of-the-Art Text Embeddings
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
castorini/pyserini
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
salesforce/WikiSQL
A large annotated semantic parsing corpus for developing natural language interfaces.
tczhangzhi/pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
google-research/tapas
End-to-end neural table-text understanding models.
dorianbrown/rank_bm25
A Collection of BM25 Algorithms in Python
PaddlePaddle/RocketQA
🚀 RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.
luhua-rain/MRC_Competition_Dureader
机器阅读理解 冠军/亚军代码及中文预训练MRC模型
texttron/tevatron
Tevatron - A flexible toolkit for neural retrieval research and development.
NTMC-Community/MatchZoo-py
Facilitating the design, comparison and sharing of deep text matching models.
SpursGoZmy/Tabular-LLM
本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。
caiyinqiong/Semantic-Retrieval-Models
A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).
deepset-ai/haystack-tutorials
Here you can find all the Tutorials for Haystack 📓
Sleepychord/CogLTX
The source code of NeurIPS 2020 paper "CogLTX: Applying BERT to Long Texts"
luyug/Reranker
Build Text Rerankers with Deep Language Models
RUCAIBox/DenseRetrieval
ZhuiyiTechnology/TableQA
NL2SQL competition dataset
OpenMatch/OpenMatch
An Open-Source Package for Information Retrieval
Georgetown-IR-Lab/OpenNIR
An end-to-end neural ad-hoc ranking pipeline.
sherlcok314159/ChineseMRC-Data
收集了目前为止中文领域的MRC抽取式数据集
FeiWang96/GTR
[SIGIR 2021] Retrieving Complex Tables with Multi-Granular Graph Representation Learning.
castorini/anserini-tools
Evaluation tools shared across anserini, pyserini, and pygaggle
medtray/StruBERT
StruBERT: Structure-aware BERT for Table Search and Matching
HITsz-TMG/Hansel
Code and data of WSDM 2023 paper "Hansel: A Chinese Few-Shot and Zero-Shot Entity Linking Benchmark".
sean0042/Open_WikiTable
Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Table
BDBC-KG-NLP/Chinese-Pretrain-MRC-Model
SpursGoZmy/IM-TQA
Dataset and Code for ACL 2023 paper: "IM-TQA: A Chinese Table Question Answering Dataset with Implicit and Multi-type Table Structures". We proposed a new TQA problem which aims at real application scenarios, together with a supporting dataset and a baseline method.
svjack/tableQA-Chinese
Unsupervised tableQA and databaseQA on chinese finance question and tabular data