wangwisdom's Stars
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
google/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
openai/openai-python
The official Python library for the OpenAI API
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
huggingface/trl
Train transformer language models with reinforcement learning.
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
facebookresearch/metaseq
Repo for external large-scale work
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
yangjianxin1/Firefly
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
microsoft/vscode-docs
Public documentation for Visual Studio Code
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
lyhue1991/torchkeras
Pytorch❤️ Keras 😋😋
openai/openai-quickstart-python
Python example app from the OpenAI API quickstart tutorial
laekov/fastmoe
A fast MoE impl for PyTorch
THUDM/AgentTuning
AgentTuning: Enabling Generalized Agent Abilities for LLMs
THUDM/SwissArmyTransformer
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
tangqiaoyu/ToolAlpaca
the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"
IEIT-Yuan/Yuan-2.0
Yuan 2.0 Large Language Model
suzgunmirac/BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
yangjianxin1/Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
genetics-statistics/GEMMA
Genome-wide Efficient Mixed Model Association
open-chinese/alpaca-chinese-dataset
Alpaca Chinese Dataset -- 中文指令微调数据集【人工+GPT4o持续更新】
jakecyr/openai-function-calling
Helper functions to generate JSON schema dicts for OpenAI ChatGPT function calling requests.
xinzhanguo/hellollm
pre train a new llm
mindspore-ai/zidongtaichu
pengwei-iie/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
dogyman/Chinese-Guanaco
中文Guanaco(原驼)大语言模型 QLora 量化训练 +本地CPU/GPU部署 (Chinese Guanaco QLoRA: Efficient Finetuning of Quantized LLMs)