Marks101

German Aerospace Center (DLR)

Pinned Repositories

conan-center-index
Recipes for the ConanCenter repository
Language:Python950 20 5.1k1.7k
conan-center-index
Recipes for the ConanCenter repository
Language:Python00
Megatron-LM
Ongoing research training transformer models at scale
Language:Python00
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++00
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python00
Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.1k 163 7262.3k
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++8.3k 88 1.8k920
TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python1.8k 35 323308
triton
Development repository for the Triton language and compiler
Language:C++12.9k 191 1.4k1.6k

Marks101's Repositories

Marks101/conan-center-index
Recipes for the ConanCenter repository
Language:Python00
Marks101/Megatron-LM
Ongoing research training transformer models at scale
Language:Python00
Marks101/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++00
Marks101/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python00

Marks101

Pinned Repositories

conan-center-index

conan-center-index

Megatron-LM

TensorRT-LLM

TransformerEngine

Megatron-LM

TensorRT-LLM

TransformerEngine

triton

Marks101's Repositories

Marks101/conan-center-index

Marks101/Megatron-LM

Marks101/TensorRT-LLM

Marks101/TransformerEngine