yuxiaw's Stars
gradio-app/gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
Libr-AI/OpenFactVerification
Loki: Open-source solution designed to automate the process of verifying factuality
GAIR-NLP/factool
FacTool: Factuality Detection in Generative AI
chatopera/efaqa-corpus-zh
❤️Emotional First Aid Dataset, 心理咨询问答、聊天机器人语料库
facebookresearch/EmpatheticDialogues
Dialogue model that produces empathetic responses when trained on the EmpatheticDialogues dataset.
shmsw25/FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
Sahandfer/EMPaper
This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.
Libr-AI/do-not-answer
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
behavioral-data/Empathy-Mental-Health
Repository containing codes and dataset access instructions for the EMNLP 2020 paper on empathy in text-based mental health support
Libr-AI/OpenRedTeaming
Papers about red teaming LLMs and Multimodal models.
yuxiaw/Factcheck-GPT
Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.
mbzuai-nlp/SemEval2024-task8
SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection
marslanm/Multimodality-Representation-Learning
This repository provides a comprehensive collection of research papers focused on multimodal representation learning, all of which have been cited and discussed in the survey just accepted https://dl.acm.org/doi/abs/10.1145/3617833 .
behavioral-data/PARTNER
Repository containing code for the WWW 2021 paper on empathic rewriting
hkust-nlp/felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
Nanami18/Snowballed_Hallucination
anthonywchen/RARR
RARR: Researching and Revising What Language Models Say, Using Language Models
yuxiaw/OpenFactCheck
ryuryukke/OUTFOX
[AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples"
lanluyu/zhihu
zhihu是一个知乎话题内容的爬虫,可以爬取知乎所有的话题相关的问答内容
oaimli/PeerSum
The dataset and code for PeerSum at EMNLP'23.
mitmedialab/empathic-stories
yuxiaw/USTS
This work explores collective human opinions in Semantic Textual Similarity, with a new uncertainty-aware STS dataset, USTS released.