xuao1's Stars
torvalds/linux
Linux kernel source tree
huggingface/transformers
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Light-City/CPlusPlusThings
C++ι£δΊδΊ
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
slidevjs/slidev
Presentation Slides for Developers
blackmatrix7/ios_rule_script
εζ΅θ§εγιεεθ§εεθζ¬γ
NVIDIA/open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
ccfddl/ccf-deadlines
β° Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features from various papers
flashinfer-ai/flashinfer
FlashInfer: Kernel Library for LLM Serving
CS-BAOYAN/CSSummerCamp2023
google-deepmind/open_x_embodiment
AmberLJC/LLMSys-PaperList
Large Language Model (LLM) Systems Paper List
opendilab/LMDrive
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
tkestack/vcuda-controller
wayveai/Driving-with-LLMs
PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"
envytools/envytools
Tools for people envious of nvidia's blob driver.
lucidrains/robotic-transformer-pytorch
Implementation of RT1 (Robotic Transformer) in Pytorch
shinpei0208/gdev
First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.
IrohXu/Awesome-Multimodal-LLM-Autonomous-Driving
[WACV 2024 Survey Paper] Multimodal Large Language Models for Autonomous Driving
NVlabs/NVBit
NTHU-LSALAB/KubeShare
Share GPU between Pods in Kubernetes
Bruce-Lee-LY/cuda_hook
Hooked CUDA-related dynamic libraries by using automated code generation tools.
lucidrains/mirasol-pytorch
Implementation of π» Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
NTHU-LSALAB/Gemini
An efficient GPU resource sharing system with fine-grained control for Linux platforms.
CPFL/gdev
First-Class GPU Resource Management: Device Drivers, Runtimes, and CUDA Compilers for Nouveau.
perceptionpoint/suprotect
Changing memory protection in an arbitrary process
sjtu-epcc/Laius
The source code of the paper"Laius: Towards Latency Awareness and Improved Utilization of Spatial Multitasking Accelerators in Datacenters" in ICS 2019.
ZSL98/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.