anthony-intel

Pinned Repositories

cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++00
DirectML
DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers, including all DirectX 12-capable GPUs from vendors such as AMD, Intel, NVIDIA, and Qualcomm.
Language:Python00
kubeedge
Kubernetes Native Edge Computing Framework (project under CNCF)
Language:Go10
LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Language:Python0 0 00
llama.cpp
LLM inference in C/C++
Language:C++1 0 00
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python0 0 00
mlrun
Machine Learning automation and tracking
Language:Python0 0 00
neural-speed
An innovative library for efficient LLM inference via low-bit quantization
Language:C++0 0 00
intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python2.1k 28 166211
neural-speed
An innovative library for efficient LLM inference via low-bit quantization
Language:C++349 8 4738

anthony-intel's Repositories

anthony-intel/kubeedge
Kubernetes Native Edge Computing Framework (project under CNCF)
Language:Go10
anthony-intel/llama.cpp
LLM inference in C/C++
Language:C++1 0 00
anthony-intel/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++00
anthony-intel/DirectML
DirectML is a high-performance, hardware-accelerated DirectX 12 library for machine learning. DirectML provides GPU acceleration for common machine learning tasks across a broad range of supported hardware and drivers, including all DirectX 12-capable GPUs from vendors such as AMD, Intel, NVIDIA, and Qualcomm.
Language:Python00
anthony-intel/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Language:Python0 0 00
anthony-intel/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python0 0 00
anthony-intel/mlrun
Machine Learning automation and tracking
Language:Python0 0 00
anthony-intel/neural-speed
An innovative library for efficient LLM inference via low-bit quantization
Language:C++0 0 00