luoyu-intel

Intel

Pinned Repositories

llama.cpp
LLM inference in C/C++
Language:C++65.7k 548 3.8k9.4k
intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
Language:Python2.1k 28 165208
llvm
Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.
Language:LLVM1.2k 78 1.9k732
neural-speed
An innovative library for efficient LLM inference via low-bit quantization
Language:C++343 8 4737
llama.cpp
LLM inference in C/C++
Language:C++00
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++00
onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++14.2k 245 6.5k2.9k

luoyu-intel's Repositories

luoyu-intel/llama.cpp
LLM inference in C/C++
Language:C++00
luoyu-intel/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Language:C++00