victorsungo

Research @Microsoft

MicrosoftUSA

victorsungo's Stars

mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Language:Jupyter Notebook40.4k 418 694.3k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36k 347 2.9k4.2k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python32.7k 271 5.7k5k
mckaywrigley/chatbot-ui
AI chat for any model.
Language:TypeScript29.1k 252 9678.1k
facebookresearch/fastText
Library for fast text representation and classification.
Language:HTML26k 846 1.1k4.7k
Vision-CAIR/MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Language:Python25.5k 218 4692.9k
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
19.7k 382 271.6k
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python19.5k 177 1.4k1.6k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.8k 111 1.1k1.7k
meta-llama/codellama
Inference code for CodeLlama models
Language:Python16.1k 188 2071.9k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.8k 204 3982.3k
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
Language:Python15.8k 132 6161.9k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python15k 110 1.1k1.2k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.8k 124 1.2k1.4k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.4k 77 1.3k1.3k
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook10.1k 85 249826
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python8.1k 110 158484
bigcode-project/starcoder
Home of StarCoder: fine-tuning & inference!
Language:Python7.3k 72 144522
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Language:Python7k 127 4511k
xorbitsai/inference
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
Language:Python5.8k 42 1.5k476
InternLM/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Language:JavaScript5.6k 40 183563
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.8k 111 137420
ricklamers/gpt-code-ui
An open source implementation of OpenAI's ChatGPT Code interpreter
Language:Python3.6k 44 29448
google-research/t5x
Language:Python2.7k 36 141309
ise-uiuc/magicoder
[ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct
Language:Python2k 25 41164
WisdomShell/codeshell
A series of code large language models developed by PKU-KCL
Language:Python1.6k 25 79120
EleutherAI/math-lm
Language:Python1.1k 18 4885
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)
Language:Python897 8 2248
abacaj/code-eval
Run evaluation on LLMs using human-eval benchmark
Language:Python386 11 836
mzbac/wizardCoder-vsc
Visual Studio Code extension for WizardCoder
Language:TypeScript145 5 712

victorsungo

victorsungo's Stars

mlabonne/llm-course

microsoft/DeepSpeed

vllm-project/vllm

mckaywrigley/chatbot-ui

facebookresearch/fastText

Vision-CAIR/MiniGPT-4

Hannibal046/Awesome-LLM

mlc-ai/mlc-llm

huggingface/peft

meta-llama/codellama

meta-llama/llama-recipes

THUDM/ChatGLM2-6B

QwenLM/Qwen

Dao-AILab/flash-attention

huggingface/trl

artidoro/qlora

jzhang38/TinyLlama

bigcode-project/starcoder

EleutherAI/gpt-neox

xorbitsai/inference

InternLM/MindSearch

huggingface/alignment-handbook

ricklamers/gpt-code-ui

google-research/t5x

ise-uiuc/magicoder

WisdomShell/codeshell

EleutherAI/math-lm

pjlab-sys4nlp/llama-moe

abacaj/code-eval

mzbac/wizardCoder-vsc