yowenter

吾尝终日而思，不如须臾之所学也！

Meituan

yowenter's Stars

ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go105k 605 5.3k8.3k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.3k 352 1.8k4.6k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python36.8k 219 5.6k4.5k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.7k 344 2694.1k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.6k 230 2733.2k
mudler/LocalAI
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference
Language:Go27.3k 192 9042k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda24.8k 252 1412.8k
microsoft/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
Language:C#22.4k 272 3.4k3.4k
chroma-core/chroma
the AI-native open-source embedding database
Language:Rust16k 90 1.2k1.4k
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language:Python15.4k 83 3.8k1.8k
1Panel-dev/MaxKB
💬 Ready-to-use, flexible RAG Chatbot.
Language:Python12.2k 79 8681.6k
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python11k 70 108697
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook10.1k 85 249826
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）
Language:HTML8k 108 442761
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
Language:SystemVerilog7.2k 69 24547
prasadgujar/low-level-design-primer
Dedicated Resources for the Low-Level System Design. Learn how to design and implement large-scale systems. Prep for the system design interview.
6.6k 171 22.3k
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python6k 55 281533
pytorch/torchtune
PyTorch native post-training library
Language:Python4.5k 48 804468
FedML-AI/FedML
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.
Language:Python4.2k 117 327786
facebookresearch/fairscale
PyTorch extensions for high performance and large scale training.
Language:Python3.2k 50 360283
iusztinpaul/hands-on-llms
🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴
Language:Jupyter Notebook3.1k 48 21493
young-geng/EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Language:Python2.4k 43 88260
ashishpatel26/LLM-Finetuning
LLM Finetuning with peft
Language:Jupyter Notebook2.3k 33 3620
VerticalResearchGroup/miaow
An open source GPU based off of the AMD Southern Islands ISA.
Language:Verilog1.1k 146 16238
hughperkins/VeriGPU
OpenSource GPU, in Verilog, loosely based on RISC-V ISA
Language:SystemVerilog862 30 1694
microsoft/Freeflow
High performance container overlay networks on Linux. Enabling RDMA (on both InfiniBand and RoCE) and accelerating TCP to bare metal performance. Freeflow requires zero modification on application code/binary.
Language:C608 34 2292
huggingface/llm_training_handbook
An open collection of methodologies to help with successful training of large language models.
Language:Python465 53 035
k8snetworkplumbingwg/sriov-network-device-plugin
SRIOV network device plugin for Kubernetes
Language:Go412 33 227177
NVIDIA/k8s-dra-driver
Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes
Language:Go295 17 4556
hkproj/transformer-from-scratch-notes
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
229 3 153

yowenter

yowenter's Stars

ollama/ollama

lm-sys/FastChat

hiyouga/LLaMA-Factory

tatsu-lab/stanford_alpaca

meta-llama/llama3

mudler/LocalAI

karpathy/llm.c

microsoft/semantic-kernel

chroma-core/chroma

BerriAI/litellm

1Panel-dev/MaxKB

microsoft/LoRA

artidoro/qlora

LianjiaTech/BELLE

adam-maj/tiny-gpu

prasadgujar/low-level-design-primer

yangjianxin1/Firefly

pytorch/torchtune

FedML-AI/FedML

facebookresearch/fairscale

iusztinpaul/hands-on-llms

young-geng/EasyLM

ashishpatel26/LLM-Finetuning

VerticalResearchGroup/miaow

hughperkins/VeriGPU

microsoft/Freeflow

huggingface/llm_training_handbook

k8snetworkplumbingwg/sriov-network-device-plugin

NVIDIA/k8s-dra-driver

hkproj/transformer-from-scratch-notes