acewjh

BUAA

acewjh's Stars

HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
15.9k 203 261.5k
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.7k 111 136406
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.4k 61 3211
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.3k 17 85119
vectara/hallucination-leaderboard
Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents
Language:Python1.2k 36 1548
snwfdhmp/awesome-gpt-prompt-engineering
A curated list of awesome resources, tools, and other shiny things for LLM prompt engineering.
Language:Python1k 29 5109
Libr-AI/OpenFactVerification
Loki: Open-source solution designed to automate the process of verifying factuality
Language:Python1k 5 744
ydyjya/Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
973 16 854
ThuCCSLab/Awesome-LM-SSP
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
912 23 2058
EdinburghNLP/awesome-hallucination-detection
List of papers on hallucination detection in LLMs.
669 27 453
google-deepmind/long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
Language:Python543 10 161
potsawee/selfcheckgpt
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
Language:Python467 6 2654
showlab/Awesome-MLLM-Hallucination
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
436 6 713
amazon-science/RefChecker
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
Language:Python299 10 1230
thu-coai/BPO
Language:Python293 4 2115
InternLM/OpenAOE
LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架：同时与多个大语言模型聊天。
Language:TypeScript249 6 823
zjunlp/IEPile
[ACL 2024] IEPile: A Large-Scale Information Extraction Corpus
Language:Python167 7 2716
xieyuquanxx/awesome-Large-MultiModal-Hallucination
😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.
144 5 112
junyangwang0410/AMBER
An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation
Language:Python93 1 42
yinzhangyue/SelfAware
Do Large Language Models Know What They Don’t Know?
Language:Python85 3 25
zjunlp/FactCHD
[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection
Language:Python81 4 23
OpenKG-ORG/EasyDetect
An Easy-to-use Hallucination Detection Framework for LLMs.
Language:Python49 4 23
javyduck/KnowHalu
Language:Python44 2 03
terrierteam/pyterrier_doc2query
Language:Jupyter Notebook37 4 49
Ki-Seki/chat_prompt_templates
Collection of Basic Prompt Templates for Various Chat LLMs (Chat LLM 的基础提示模板集合)
33 2 05
AIFlames/Flames
Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.
31 2 10
WhitzardIndex/WhitzardBench-2024A
复旦白泽大模型安全基准测试集（2024年夏季版）
25 1 03
webis-de/ACL-22
16 21 00
jiazhen-code/IntrinsicHallu
9 3 20
kinit-sk/disinformation-capabilities
Implementation of the paper "Disinformation Capabilities of Large Language Models"
Language:Jupyter Notebook4 1 00

acewjh

acewjh's Stars

HqWu-HITCS/Awesome-Chinese-LLM

huggingface/alignment-handbook

opendilab/awesome-RLHF

PKU-Alignment/safe-rlhf

vectara/hallucination-leaderboard

snwfdhmp/awesome-gpt-prompt-engineering

Libr-AI/OpenFactVerification

ydyjya/Awesome-LLM-Safety

ThuCCSLab/Awesome-LM-SSP

EdinburghNLP/awesome-hallucination-detection

google-deepmind/long-form-factuality

potsawee/selfcheckgpt

showlab/Awesome-MLLM-Hallucination

amazon-science/RefChecker

thu-coai/BPO

InternLM/OpenAOE

zjunlp/IEPile

xieyuquanxx/awesome-Large-MultiModal-Hallucination

junyangwang0410/AMBER

yinzhangyue/SelfAware

zjunlp/FactCHD

OpenKG-ORG/EasyDetect

javyduck/KnowHalu

terrierteam/pyterrier_doc2query

Ki-Seki/chat_prompt_templates

AIFlames/Flames

WhitzardIndex/WhitzardBench-2024A

webis-de/ACL-22

jiazhen-code/IntrinsicHallu

kinit-sk/disinformation-capabilities