0uMuMu0's Stars
meta-llama/llama
Inference code for Llama models
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
ray-project/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
huggingface/text-generation-inference
Large Language Model Text Generation Inference
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
facontidavide/PlotJuggler
The Time Series Visualization Tool that you deserve.
OpenDriveLab/UniAD
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
OpenDriveLab/End-to-end-Autonomous-Driving
[IEEE T-PAMI] All you need for End-to-end Autonomous Driving
catapult-project/catapult
Deprecated Catapult GitHub. Please instead use http://crbug.com "Speed>Benchmarks" component for bugs and https://chromium.googlesource.com/catapult for downloading and editing source code..
danijar/dreamerv3
Mastering Diverse Domains through World Models
ModelTC/MQBench
Model Quantization Benchmark
OpenGVLab/OmniQuant
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
SqueezeAILab/SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
opendilab/InterFuser
[CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
mit-han-lab/qserve
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
NM512/dreamerv3-torch
Implementation of Dreamer v3 in pytorch.
wayveai/mile
PyTorch code for the paper "Model-Based Imitation Learning for Urban Driving".
OpenDriveLab/TCP
[NeurIPS 2022] Trajectory-guided Control Prediction for End-to-end Autonomous Driving: A Simple yet Strong Baseline.
zhejz/carla-roach
Roach: End-to-End Urban Driving by Imitating a Reinforcement Learning Coach. ICCV 2021.
j96w/MimicPlay
"MimicPlay: Long-Horizon Imitation Learning by Watching Human Play" code repository
ikostrikov/rlpd
RupertLuo/Valley
The official repository of "Video assistant towards large language model makes everything easy"
OpenDriveLab/DriveAdapter
[ICCV 2023 Oral] A New Paradigm for End-to-end Autonomous Driving to Alleviate Causal Confusion
SJTU-ReArch-Group/Paper-Reading-List