Pinned Repositories
AmbigPrompt
Answering Ambiguous Questions via Iterative Prompting
ckgc
"Conversations Powered by Cross-Lingual Knowledge" in SIGIR'21
GenKS
Code for "Generative Knowledge Selection for Knowledge-Grounded Dialogues"
GenRet
Learning to Tokenize for Generative Retrieval (NeurIPS 2023)
MAIR
MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]
MetaSim
Metaphorical User Simulation for Task-Oriented Dialogue Evaluation
MixCL
Contrastive Learning Reduces Hallucination in Conversations
RankGPT
Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]
spanish-reddit-dialogues-corpus
A Spanish Reddit dialogues corpus, constructed using Reddit comments of 2019.
user-satisfaction-simulation
"Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems" in SIGIR'21
sunnweiwei's Repositories
sunnweiwei/RankGPT
Is ChatGPT Good at Search? LLMs as Re-Ranking Agent [EMNLP 2023 Outstanding Paper Award]
sunnweiwei/GenRet
Learning to Tokenize for Generative Retrieval (NeurIPS 2023)
sunnweiwei/user-satisfaction-simulation
"Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue Systems" in SIGIR'21
sunnweiwei/MixCL
Contrastive Learning Reduces Hallucination in Conversations
sunnweiwei/AmbigPrompt
Answering Ambiguous Questions via Iterative Prompting
sunnweiwei/MAIR
MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]
sunnweiwei/GenKS
Code for "Generative Knowledge Selection for Knowledge-Grounded Dialogues"
sunnweiwei/ckgc
"Conversations Powered by Cross-Lingual Knowledge" in SIGIR'21
sunnweiwei/MetaSim
Metaphorical User Simulation for Task-Oriented Dialogue Evaluation
sunnweiwei/spanish-reddit-dialogues-corpus
A Spanish Reddit dialogues corpus, constructed using Reddit comments of 2019.
sunnweiwei/ckgc-system
Test system for CKGC.
sunnweiwei/CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
sunnweiwei/evals
Evals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
sunnweiwei/mteb
MTEB: Massive Text Embedding Benchmark
sunnweiwei/sunnweiwei-old.github.io
sunnweiwei/sunnweiwei.github.io
sunnweiwei/sunnweiwei.v2.github.io
sunnweiwei/sunweiwei-academic
sunnweiwei/template-sunnweiwei.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes