andywy110's Stars
NVIDIA/DIGITS
Deep Learning GPU Training System
NVIDIA/Cosmos
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. Cosmos is purpose built for physical AI. The Cosmos repository will enable end users to run the Cosmos models, run inference scripts and generate videos.
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
pytorch/torchtitan
A PyTorch native library for large model training
Kwai-Kolors/Kolors
Kolors Team
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
krahets/hello-algo
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
karpathy/llm.c
LLM training in simple, raw C/CUDA
karpathy/LLM101n
LLM101n: Let's build a Storyteller
andrewyng/translation-agent
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
NVIDIA/GenerativeAIExamples
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
below/HelloSilicon
An introduction to ARM64 assembly on Apple Silicon Macs
NVIDIA/TensorRT-Model-Optimizer
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.
huggingface/optimum-nvidia
datawhalechina/self-llm
《开源大模型食用指南》针对**宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
kelseyhightower/kubernetes-the-hard-way
Bootstrap Kubernetes the hard way. No scripts.
NVIDIA/NVPLSamples
NVIDIA Performance Libraries: Sample code
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
stas00/ml-engineering
Machine Learning Engineering Open Book
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
iamadamdev/bypass-paywalls-chrome
Bypass Paywalls web browser extension for Chrome and Firefox.
NVIDIA/NeMo-Framework-Launcher
Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
mshumer/gpt-llm-trainer
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence