Pinned Repositories
abseil-cpp
Abseil Common Libraries (C++)
ao
PyTorch native quantization and sparsity for training and inference
AutoDiffusion1
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
blis
BLAS-like Library Instantiation Software Framework
CLBlast
Tuned OpenCL BLAS
cpp-httplib
A C++ header-only HTTP/HTTPS server and client library
cpuinfo
CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)
cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
cutlass
CUDA Templates for Linear Algebra Subroutines
sihouzi21c's Repositories
sihouzi21c/abseil-cpp
Abseil Common Libraries (C++)
sihouzi21c/ao
PyTorch native quantization and sparsity for training and inference
sihouzi21c/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
sihouzi21c/cpp-httplib
A C++ header-only HTTP/HTTPS server and client library
sihouzi21c/cpuinfo
CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)
sihouzi21c/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
sihouzi21c/cutlass
CUDA Templates for Linear Algebra Subroutines
sihouzi21c/executorch
On-device AI across mobile, embedded and edge for PyTorch
sihouzi21c/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
sihouzi21c/fbjni
A library designed to simplify the usage of the Java Native Interface
sihouzi21c/flatbuffers
FlatBuffers: Memory Efficient Serialization Library
sihouzi21c/fmt
A modern formatting library
sihouzi21c/googletest
GoogleTest - Google Testing and Mocking Framework
sihouzi21c/ideep
Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.
sihouzi21c/json
JSON for Modern C++
sihouzi21c/kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
sihouzi21c/LLM4Decompile
Reverse Engineering: Decompiling Binary Code with Large Language Models
sihouzi21c/mimalloc
mimalloc is a compact general purpose allocator with excellent performance.
sihouzi21c/NVTX
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.
sihouzi21c/onnx
Open standard for machine learning interoperability
sihouzi21c/opentelemetry-cpp
The OpenTelemetry C++ Client
sihouzi21c/protobuf
Protocol Buffers - Google's data interchange format
sihouzi21c/pybind11
Seamless operability between C++11 and Python
sihouzi21c/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
sihouzi21c/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
sihouzi21c/sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
sihouzi21c/TMPQ-DM
sihouzi21c/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
sihouzi21c/VulkanMemoryAllocator
Easy to integrate Vulkan memory allocation library
sihouzi21c/XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web