Pinned Repositories
do-not-answer
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
libra-eval
M4
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
SemEval2024-task8
SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
Empathic-Similarity
Factcheck-GPT
Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.
OpenFactCheck
Regularise-Regression-Noisy-Labels
Code of the paper: Noisy Label Regularisation for Textual Regression
Uncertainty-regression
USTS
This work explores collective human opinions in Semantic Textual Similarity, with a new uncertainty-aware STS dataset, USTS released.
yuxiaw's Repositories
yuxiaw/Factcheck-GPT
Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.
yuxiaw/OpenFactCheck
yuxiaw/Uncertainty-regression
yuxiaw/USTS
This work explores collective human opinions in Semantic Textual Similarity, with a new uncertainty-aware STS dataset, USTS released.
yuxiaw/Empathic-Similarity
yuxiaw/Regularise-Regression-Noisy-Labels
Code of the paper: Noisy Label Regularisation for Textual Regression