Pinned Repositories
algorithms4_cpp
C++ version of Robert Sedgewick's Algorithms4 book
aop-helper
基于 aspectjweaver AOP 实现的 Annotation Profiling 和 一些 HDFS 和 Spark helper 方法
automl
Google Brain AutoML
dp90219.github.io
incremental_spark_als_mf
incremental matrix factoration in spark
itsliupeng.github.io
Blog
online_http_pytorch
Gunicorn aiohttp PyTorch, A concurrent HTTP server in order to inference using PyTorch.
tf_nn_function
Using tf to build neural network just like pytorch
torchnvjpeg
Decode JPEG image on GPU using PyTorch
TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
itsliupeng's Repositories
itsliupeng/1Panel
🔥🔥🔥 Web-based linux server management control panel. / 现代化、开源的 Linux 服务器运维管理面板。
itsliupeng/BitNet
Official inference framework for 1-bit LLMs
itsliupeng/cfx-article-src
itsliupeng/cutlass
CUDA Templates for Linear Algebra Subroutines
itsliupeng/cutlass-kernels
itsliupeng/cutlass_learn
itsliupeng/EAGLE
Official Implementation of EAGLE-1 and EAGLE-2
itsliupeng/flash-attention
Fast and memory-efficient exact attention
itsliupeng/folotoy-doc
All Documents for FoloToys
itsliupeng/Friend
AI wearable necklace
itsliupeng/GPTs
leaked prompts of GPTs
itsliupeng/libflash_attn
Standalone Flash Attention v2 kernel without libtorch dependency
itsliupeng/marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
itsliupeng/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
itsliupeng/Megatron-Energon
Megatron's multi-modal data loader
itsliupeng/Megatron-LM
Ongoing research training transformer models at scale
itsliupeng/mochi-models
The best OSS video generation models
itsliupeng/mytv-android
使用Android原生开发的电视直播软件
itsliupeng/ncmdump
转换网易云音乐 ncm 到 mp3 / flac. Convert Netease Cloud Music ncm files to mp3/flac files.
itsliupeng/sarathi-serve
A low-latency & high-throughput serving engine for LLMs
itsliupeng/sensitive-word
👮♂️The sensitive word tool for java.(敏感词/违禁词/违法词/脏词。基于 DFA 算法实现的高性能 java 敏感词过滤工具框架。请勿发布涉及政治、广告、营销、翻墙、违反国家法律法规等内容。高性能敏感词检测过滤组件,附带繁体简体互换,支持全角半角互换,汉字转拼音,模糊搜索等功能。)
itsliupeng/sglang
SGLang is yet another fast serving framework for large language models and vision language models.
itsliupeng/snippet
snippet for code
itsliupeng/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
itsliupeng/stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
itsliupeng/surya
OCR, layout analysis, reading order, table recognition in 90+ languages
itsliupeng/ThunderKittens
Tile primitives for speedy kernels
itsliupeng/triton
Development repository for the Triton language and compiler
itsliupeng/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
itsliupeng/voice-pro
The best gradio web-ui for ai transcription, translation and TTS. Automatic subtitle creation using faster whisper. Easy one click installation. Fully portable.