andoorve

Previously also: @aws-murandoo

CentML

Pinned Repositories

cupy
NumPy & SciPy for GPU
Language:Python00
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++00
FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Language:Python00
Neural-Net
Language:Python00
Neural-Net-2
An optimized version of Neural Net
Language:Python00
neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
Language:Python00
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python00
rfcs
PyTorch RFCs (experimental)
00
Tempo
Memory footprint reduction for transformer models
Language:Python00
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Language:Python01

andoorve's Repositories

andoorve/cupy
NumPy & SciPy for GPU
Language:Python00
andoorve/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++00
andoorve/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Language:Python00
andoorve/Neural-Net
Language:Python00
andoorve/Neural-Net-2
An optimized version of Neural Net
Language:Python00
andoorve/neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
Language:Python00
andoorve/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python00
andoorve/rfcs
PyTorch RFCs (experimental)
00
andoorve/Tempo
Memory footprint reduction for transformer models
Language:Python00
andoorve/transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
Language:Python01
andoorve/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python00