Pinned Repositories
deep-learning-pytorch-huggingface
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
DeepSpeedExamples
Example models using DeepSpeed
ds_repro_2746
tohtana.github.io
validate_zero
tohtana's Repositories
tohtana/tohtana.github.io
tohtana/validate_zero
tohtana/deep-learning-pytorch-huggingface
tohtana/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
tohtana/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
tohtana/DeepSpeedExamples
Example models using DeepSpeed
tohtana/ds_repro_2746
tohtana/ds_repro_4295
tohtana/flash-attention
Fast and memory-efficient exact attention
tohtana/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
tohtana/rannc
RaNNC is an automatic parallelization middleware used to train very large-scale neural networks.
tohtana/test_repos2
tohtana/repro_leaf_module
tohtana/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
tohtana/triton
Development repository for the Triton language and compiler
tohtana/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs