Pinned Repositories
ann-benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
bcc
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
chunk-attention
Deep-Learning-with-TensorFlow-book
深度学习入门开源书,基于TensorFlow 2.0案例实战。Open source Deep Learning book, based on TensorFlow 2.0 framework.
deeplearning-with-tensorflow-notes
龙曲良《TensorFlow深度学习》学习笔记及代码,采用TensorFlow2.0.0版本
dpdk
Data Plane Development Kit
ebooks-1
faiss
A library for efficient similarity search and clustering of dense vectors.
FasterTransformer
Transformer related optimization, including BERT, GPT
fastertransformer_backend
sniperxyp's Repositories
sniperxyp/ann-benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
sniperxyp/bcc
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
sniperxyp/chunk-attention
sniperxyp/Deep-Learning-with-TensorFlow-book
深度学习入门开源书,基于TensorFlow 2.0案例实战。Open source Deep Learning book, based on TensorFlow 2.0 framework.
sniperxyp/deeplearning-with-tensorflow-notes
龙曲良《TensorFlow深度学习》学习笔记及代码,采用TensorFlow2.0.0版本
sniperxyp/dpdk
Data Plane Development Kit
sniperxyp/ebooks-1
sniperxyp/faiss
A library for efficient similarity search and clustering of dense vectors.
sniperxyp/FasterTransformer
Transformer related optimization, including BERT, GPT
sniperxyp/fastertransformer_backend
sniperxyp/gperftools
Main gperftools repository
sniperxyp/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
sniperxyp/HugeCTR
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
sniperxyp/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
sniperxyp/jstorm
Java Storm
sniperxyp/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
sniperxyp/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
sniperxyp/LookaheadDecoding
sniperxyp/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
sniperxyp/redis
Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, Streams, HyperLogLogs, Bitmaps.
sniperxyp/rocksdb
A library that provides an embeddable, persistent key-value store for fast storage.
sniperxyp/seastar
High performance server-side application framework
sniperxyp/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
sniperxyp/sniperxypgit
tes1
sniperxyp/spark
Mirror of Apache Spark
sniperxyp/test
sniperxyp/triton
Development repository for the Triton language and compiler
sniperxyp/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs