songhan

Song Han is an associate professor at MIT EECS and distinguished scientist at NVIDIA. His research interest is efficient AI computing.

MIT, NVIDIA

songhan's Stars

home-assistant/core
:house_with_garden: Open source home automation that puts local control and privacy first.
Language:Python69.7k 1.3k 49.9k28.8k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python33.6k 339 2.6k3.9k
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Language:Python6.3k 60 78356
gaogaotiantian/viztracer
VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.
Language:Python4.6k 48 185354
mit-han-lab/bevfusion
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
Language:Python2.1k 42 603379
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Language:Python2.1k 23 157150
mit-han-lab/temporal-shift-module
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Language:Python2k 42 219416
mit-han-lab/once-for-all
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
Language:Python1.8k 53 75332
mit-han-lab/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
Language:Python1.6k 33 114141
intel/intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
Language:Python1.5k 34 489222
mit-han-lab/data-efficient-gans
[NeurIPS 2020] Differentiable Augmentation for Data-Efficient GAN Training
Language:Python1.3k 19 97174
mit-han-lab/torchquantum
A PyTorch-based framework for Quantum Classical Simulation, Quantum Machine Learning, Quantum Neural Networks, Parameterized Quantum Circuits with support for easy deployments on real quantum computers.
Language:Jupyter Notebook1.2k 25 115183
mit-han-lab/torchsparse
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
Language:Cuda1.1k 18 239130
mit-han-lab/gan-compression
[CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs
Language:Python1.1k 29 101147
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Language:Python1.1k 19 81126
Efficient-Large-Model/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python866 19 6855
mit-han-lab/anycost-gan
[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing
Language:Python774 23 3096
mit-han-lab/tinyengine
[NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning; [NeurIPS 2022] MCUNetV3: On-Device Training Under 256KB Memory
Language:C758 20 75127
mit-han-lab/tinyml
Language:Python721 37 26132
mit-han-lab/pvcnn
[NeurIPS 2019, Spotlight] Point-Voxel CNN for Efficient 3D Deep Learning
Language:Python629 22 78128
mit-han-lab/TinyChatEngine
TinyChatEngine: On-Device LLM Inference Library
Language:C++618 12 3757
mit-han-lab/lite-transformer
[ICLR 2020] Lite Transformer with Long-Short Range Attention
Language:Python597 20 3980
mit-han-lab/spvnas
[ECCV 2020] Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution
Language:Python578 24 99109
mit-han-lab/amc
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Language:Python421 17 25108
mit-han-lab/tiny-training
On-Device Training Under 256KB Memory [NeurIPS'22]
Language:Python414 17 857
mit-han-lab/offsite-tuning
Offsite-Tuning: Transfer Learning without Full Model
Language:Python363 8 1036
mit-han-lab/hardware-aware-transformers
[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing
Language:Python323 13 1748
mit-han-lab/inter-operator-scheduler
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
Language:C++189 8 2429
mit-han-lab/parallel-computing-tutorial
Language:C++123 10 012
mit-han-lab/tinychat-tutorial
Language:C++35 7 310

songhan

songhan's Stars

home-assistant/core

microsoft/DeepSpeed

mit-han-lab/streaming-llm

gaogaotiantian/viztracer

mit-han-lab/bevfusion

mit-han-lab/llm-awq

mit-han-lab/temporal-shift-module

mit-han-lab/once-for-all

mit-han-lab/efficientvit

intel/intel-extension-for-pytorch

mit-han-lab/data-efficient-gans

mit-han-lab/torchquantum

mit-han-lab/torchsparse

mit-han-lab/gan-compression

mit-han-lab/smoothquant

Efficient-Large-Model/VILA

mit-han-lab/anycost-gan

mit-han-lab/tinyengine

mit-han-lab/tinyml

mit-han-lab/pvcnn

mit-han-lab/TinyChatEngine

mit-han-lab/lite-transformer

mit-han-lab/spvnas

mit-han-lab/amc

mit-han-lab/tiny-training

mit-han-lab/offsite-tuning

mit-han-lab/hardware-aware-transformers

mit-han-lab/inter-operator-scheduler

mit-han-lab/parallel-computing-tutorial

mit-han-lab/tinychat-tutorial