qcwthu's Stars
meta-llama/llama3
The official Meta Llama 3 GitHub site
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
jessevig/bertviz
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
eseckel/ai-for-grant-writing
A curated list of resources for using LLMs to develop more competitive grant applications.
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
multimodal-art-projection/MAP-NEO
metauto-ai/GPTSwarm
🐝 GPTSwarm: LLM agents as (Optimizable) Graphs
google-deepmind/recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
google-deepmind/long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
potsawee/selfcheckgpt
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
RUCAIBox/HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
shmsw25/FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
Re-Align/URIAL
LuckyyySTA/Awesome-LLM-hallucination
LLM hallucination paper list
OpenBMB/Eurus
YuxiXie/MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
IINemo/lm-polygraph
wang-chen/thesis_template_ntu
Thesis Latex Template for Nanyang Technological University (NTU)
google-deepmind/mishax
oneal2000/DRAGIN
Source code of DRAGIN, ACL 2024 main conference Long Paper
FateScript/token_visualizer
Token level visualization tools for large language models
activatedgeek/calibration-tuning
oneal2000/MIND
Source code of our paper MIND, ACL 2024 Long Paper
hnmr293/llama-viz
The attention map viewer for LLaMA models.
kttian/llm_factuality_tuning
katiekang1998/llm_hallucinations
JiaqiLi404/Know_the_Unknown
code for paper: Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning