oneal2000's Stars
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
lixin4ever/Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
attardi/wikiextractor
A tool for extracting plain text from Wikipedia dumps
HillZhang1999/llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"
zjunlp/KnowledgeEditingPapers
Must-read Papers on Knowledge Editing for Large Language Models.
jianzhnie/LLamaTuner
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
potsawee/selfcheckgpt
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
xv44586/Chinese-instruction-datasets
中文 Instruction tuning datasets
AtomEcho/AtomBulb
旨在对当前主流LLM进行一个直观、具体、标准的评测
CoinCheung/gdGPT
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
oneal2000/DRAGIN
Source code of DRAGIN, ACL 2024 main conference Long Paper
NJU-LegalAI/Legal-ChatGLM
基于中文法律知识的ChatGLM指令微调
oneal2000/MIND
Source code of our paper MIND, ACL 2024 Long Paper
oneal2000/Wikiformer
Code for AAAI 2024 paper Wikiformer
andy-yangz/Awesome-Chinese-Instruction-Datasets
中文 Instruction 相关数据集整理
THUlawtech/LegalAttack
oneal2000/STARD
StaRD: Statute Retrieval Dataset based on Real-World Legal Consultation
ict-bigdatalab/utility_judgments
oneal2000/EntityHallucination
oneal2000/Caseformer
Source code of our long paper: Caseformer: Pre-training for Legal Case Retrieval