sihouzi21c

Pinned Repositories

abseil-cpp
Abseil Common Libraries (C++)
Language:C++0 0 00
ao
PyTorch native quantization and sparsity for training and inference
Language:Python0 0 00
AutoDiffusion1
0 0 00
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python00
blis
BLAS-like Library Instantiation Software Framework
Language:C0 0 00
CLBlast
Tuned OpenCL BLAS
Language:C++0 0 00
cpp-httplib
A C++ header-only HTTP/HTTPS server and client library
Language:C++00
cpuinfo
CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)
Language:C00
cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
Language:C++00
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++00

sihouzi21c's Repositories

sihouzi21c/abseil-cpp
Abseil Common Libraries (C++)
Language:C++0 0 00
sihouzi21c/ao
PyTorch native quantization and sparsity for training and inference
Language:Python0 0 00
sihouzi21c/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
Language:Python00
sihouzi21c/cpp-httplib
A C++ header-only HTTP/HTTPS server and client library
Language:C++00
sihouzi21c/cpuinfo
CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)
Language:C00
sihouzi21c/cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
Language:C++00
sihouzi21c/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++00
sihouzi21c/executorch
On-device AI across mobile, embedded and edge for PyTorch
sihouzi21c/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Language:C++0 0
sihouzi21c/fbjni
A library designed to simplify the usage of the Java Native Interface
Language:C++0 0
sihouzi21c/flatbuffers
FlatBuffers: Memory Efficient Serialization Library
sihouzi21c/fmt
A modern formatting library
Language:C++0 0
sihouzi21c/googletest
GoogleTest - Google Testing and Mocking Framework
sihouzi21c/ideep
Intel® Optimization for Chainer*, a Chainer module providing numpy like API and DNN acceleration using MKL-DNN.
Language:C++0 0
sihouzi21c/json
JSON for Modern C++
sihouzi21c/kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
sihouzi21c/LLM4Decompile
Reverse Engineering: Decompiling Binary Code with Large Language Models
sihouzi21c/mimalloc
mimalloc is a compact general purpose allocator with excellent performance.
Language:C0 0
sihouzi21c/NVTX
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resources in your applications.
Language:C0 0
sihouzi21c/onnx
Open standard for machine learning interoperability
sihouzi21c/opentelemetry-cpp
The OpenTelemetry C++ Client
Language:C++0 0
sihouzi21c/protobuf
Protocol Buffers - Google's data interchange format
Language:C++0 0
sihouzi21c/pybind11
Seamless operability between C++11 and Python
Language:C++0 0
sihouzi21c/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python0 0
sihouzi21c/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
Language:C++0 0
sihouzi21c/sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
Language:C0 0
sihouzi21c/TMPQ-DM
Language:Python1 0
sihouzi21c/torchchat
Run PyTorch LLMs locally on servers, desktop and mobile
Language:Python0 0
sihouzi21c/VulkanMemoryAllocator
Easy to integrate Vulkan memory allocation library
Language:C0 0
sihouzi21c/XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web