GuWei007's Stars
triton-lang/triton
Development repository for the Triton language and compiler
pytorch/torchtune
A Native-PyTorch Library for LLM Fine-tuning
pytorch/torchtitan
A native PyTorch Library for large model training
NVIDIA/nccl
Optimized primitives for collective multi-GPU communication
pytorch/tutorials
PyTorch tutorials.
linux-test-project/lcov
LCOV
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
pytorch/ao
Create and integrate custom data types, layouts and kernels with up to 2x speedups with 65% less VRAM for inference and training
albanD/pytorch_dev_env_setup
TimDettmers/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
NVIDIA/nccl-tests
NCCL Tests
PlexPt/awesome-chatgpt-prompts-zh
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
NVIDIA/NVTX
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.
pytorch/torchrec
Pytorch domain library for recommendation systems
rusty1s/pytorch_scatter
PyTorch Extension Library of Optimized Scatter Operations
pytorch/test-infra
This repository hosts code that supports the testing infrastructure for the main PyTorch repo. For example, this repo hosts the logic to track disabled tests and slow tests, as well as our continuation integration jobs HUD/dashboard.
THUDM/ChatGLM2-6B
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
NVIDIA/DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Ascend/AscendSpeed
liu-jianhao/Cpp-Design-Patterns
C++设计模式
forthespada/InterviewGuide
🔥🔥「InterviewGuide」是阿秀从校园->职场多年计算机自学过程的记录以及学弟学妹们计算机校招&秋招经验总结文章的汇总,包括但不限于C/C++ 、Golang、JavaScript、Vue、操作系统、数据结构、计算机网络、MySQL、Redis等学习总结,坚持学习,持续成长!
doocs/leetcode
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
MegEngine/MegEngine
MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible