cytan17726

Soochow UniversityNo.1 Shizi Street, Soochow, Jiangsu, China

cytan17726's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python37.1k 219 5.6k4.6k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.8k 203 3982.3k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python15.1k 112 1.1k1.2k
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Language:Python8.3k 72 419827
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python7.4k 39 1.2k2k
xusenlinzy/api-for-open-llm
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
Language:Python2.4k 24 288275
ydyjya/Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
1.1k 17 1156
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
960 12 352
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
Language:Python901 8 2249
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Language:Python772 8 2447
haonan-li/CMMLU
CMMLU: Measuring massive multitask language understanding in Chinese
Language:Python712 11 3760
sylinrl/TruthfulQA
TruthfulQA: Measuring How Models Imitate Human Falsehoods
Language:Jupyter Notebook639 8 1377
IAAR-Shanghai/UHGEval
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
Language:Python187 12 417
ictnlp/TruthX
Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"
Language:Python135 5 46
Spico197/Mirror
🪞A powerful toolkit for almost all the Information Extraction tasks.
Language:Python115 5 712
shizhediao/R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't Know'"
Language:Python101 3 79
canghongjian/beam_retriever
[NAACL 2024] End-to-End Beam Retrieval for Multi-Hop Question Answering
Language:Python87 3 138
yinzhangyue/SelfAware
Do Large Language Models Know What They Don’t Know?
Language:Python85 3 25
OpenMOSS/Say-I-Dont-Know
[ICML'2024] Can AI Assistants Know What They Don't Know?
Language:Python76 1 67
dki-lab/Pangu
Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments
Language:Python73 8 2011
huiyeruzhou/arxiv_crawler
这是一个高效，快捷的arXiv论文爬虫，它可以将指定时间范围，指定主题，包含指定关键词的论文信息爬取到本地，并且将其中的标题和摘要翻译成中文。
Language:Python43 1 06
Spico197/MoE-SFT
🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts
Language:Python35 1 00
zjysteven/mink-plus-plus
Min-K%++: Improved baseline for detecting pre-training data of LLMs https://arxiv.org/abs/2404.02936
Language:Python28 2 75
intuit-ai-research/DCR-consistency
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language Models
Language:Python22 4 03
thunlp/FalseQA
Repo for ACL2023 paper "Won't Get Fooled Again: Answering Questions with False Premises"
Language:Python21 2 0
zhliu0106/probing-lm-data
Official Implementation of "Probing Language Models for Pre-training Data Detection"
Language:Python17 1 02
genglinliu/UnknownBench
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge
Language:Jupyter Notebook12 1 10
zhliu0106/learning-to-refuse
Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"
Language:Python8 1 44
amayuelas/knowledge-of-knowledge
Language:Python4 1 00
Spico197/server-remote-control
Remote power control by accessing BMI.
Language:Python1 2 0

cytan17726

cytan17726's Stars

hiyouga/LLaMA-Factory

meta-llama/llama-recipes

QwenLM/Qwen

OptimalScale/LMFlow

EleutherAI/lm-evaluation-harness

xusenlinzy/api-for-open-llm

ydyjya/Awesome-LLM-Safety

HillZhang1999/llm-hallucination-survey

pjlab-sys4nlp/llama-moe

ContextualAI/HALOs

haonan-li/CMMLU

sylinrl/TruthfulQA

IAAR-Shanghai/UHGEval

ictnlp/TruthX

Spico197/Mirror

shizhediao/R-Tuning

canghongjian/beam_retriever

yinzhangyue/SelfAware

OpenMOSS/Say-I-Dont-Know

dki-lab/Pangu

huiyeruzhou/arxiv_crawler

Spico197/MoE-SFT

zjysteven/mink-plus-plus

intuit-ai-research/DCR-consistency

thunlp/FalseQA

zhliu0106/probing-lm-data

genglinliu/UnknownBench

zhliu0106/learning-to-refuse

amayuelas/knowledge-of-knowledge

Spico197/server-remote-control