qcwthu's Stars
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
meta-llama/llama3
The official Meta Llama 3 GitHub site
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
triton-lang/triton
Development repository for the Triton language and compiler
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
steven2358/awesome-generative-ai
A curated list of modern Generative Artificial Intelligence projects and services
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
jiaweizzhao/GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
AmbroseX/Awesome-AISourceHub
本仓库收集AI科技领域高质量信息源。 可以起到一个同步信息源的作用,避免信息差和信息茧房。
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
multimodal-art-projection/MAP-NEO
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
metauto-ai/GPTSwarm
🐝 GPTSwarm: LLM agents as (Optimizable) Graphs
google-deepmind/recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
zhuzilin/ring-flash-attention
Ring attention implementation with flash attention
BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
RUCAIBox/HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
LuckyyySTA/Awesome-LLM-hallucination
LLM hallucination paper list
OpenBMB/Eurus
YuxiXie/MCTS-DPO
This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.
BobXWu/TopMost
A Topic Modeling System Toolkit
zorazrw/awesome-tool-llm
wang-chen/thesis_template_ntu
Thesis Latex Template for Nanyang Technological University (NTU)
FateScript/token_visualizer
Token level visualization tools for large language models
ntunlp/LLMSanitize
An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).
katiekang1998/llm_hallucinations
JiaqiLi404/Know_the_Unknown
code for paper: Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning