zhewang1-intc

IntelShanghai

Pinned Repositories

ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Language:Go104k 605 5.3k8.3k
auto-round
SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
Language:Python0 0 00
bitsandbytes
8-bit CUDA functions for PyTorch
Language:Python0 0 00
data-parallel-CPP
Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben Ashbaugh, James Brodman, Michael Kinsner, John Pennycook, Xinmin Tian (Apress, 2020).
Language:CMake0 0 00
intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
Language:Python0 0 00
neural-speed
Language:C++0 0 00
ollama
Get up and running with Llama 2, Mistral, and other large language models locally.
Language:Go2 0 02
oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language:C++0 0 00

zhewang1-intc/ollama
Get up and running with Llama 2, Mistral, and other large language models locally.
Language:Go2 0 02
zhewang1-intc/auto-round
SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
Language:Python0 0 00
zhewang1-intc/bitsandbytes
8-bit CUDA functions for PyTorch
Language:Python0 0 00
zhewang1-intc/data-parallel-CPP
Source code for 'Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL' by James Reinders, Ben Ashbaugh, James Brodman, Michael Kinsner, John Pennycook, Xinmin Tian (Apress, 2020).
Language:CMake0 0 00
zhewang1-intc/intel-extension-for-pytorch
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
Language:Python0 0 00
zhewang1-intc/neural-speed
Language:C++0 0 00
zhewang1-intc/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
Language:C++0 0 00