jasonsi1993's Stars
simd-everywhere/simde
Implementations of SIMD instruction sets for systems which don't natively support them.
FMInference/FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
jasonppy/VoiceCraft
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Stability-AI/generative-models
Generative Models by Stability AI
ggerganov/llama.cpp
LLM inference in C/C++
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
ENOT-AutoDL/onnx2torch
Convert ONNX models to PyTorch.
onnx/onnx-mlir
Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
microsoft/Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
OpenPPL/ppl.nn
A primitive library for neural network
ZhangGe6/onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
DLTcollab/sse2neon
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
oneapi-src/oneDNN
oneAPI Deep Neural Network Library (oneDNN)
herumi/xbyak
A JIT assembler for x86/x64 architectures supporting MMX, SSE (1-4), AVX (1-2, 512), FPU, APX, and AVX10.2
llvm/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
openpifpaf/openpifpaf
Official implementation of "OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association" in PyTorch.
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
LeChangCS/Outdoor-Gear-Master
A person-to-person rental web app