Pinned Repositories
adjustVideo2FitAudio
cupy
NumPy & SciPy for GPU
FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Game-Programmer-Study-Notes
:anchor: 我的游戏程序员生涯的读书笔记合辑。你可以把它看作一个加强版的Blog。涉及图形学、实时渲染、编程实践、GPU编程、设计模式、软件工程等内容。Keep Reading , Keep Writing , Keep Coding.
hiq
HiQ - A Modern Observability System
huggingbench
Find the optimal model serving solution for 🤗 Hugging Face models 🚀
jianbo27
jianbo27.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
llm-transparency-tool
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo
FlexGen-Extension
We did some modifications/enhancements for original FlexGen.
jianbo27's Repositories
jianbo27/adjustVideo2FitAudio
jianbo27/cupy
NumPy & SciPy for GPU
jianbo27/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
jianbo27/Game-Programmer-Study-Notes
:anchor: 我的游戏程序员生涯的读书笔记合辑。你可以把它看作一个加强版的Blog。涉及图形学、实时渲染、编程实践、GPU编程、设计模式、软件工程等内容。Keep Reading , Keep Writing , Keep Coding.
jianbo27/hiq
HiQ - A Modern Observability System
jianbo27/huggingbench
Find the optimal model serving solution for 🤗 Hugging Face models 🚀
jianbo27/jianbo27
jianbo27/jianbo27.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
jianbo27/llm-transparency-tool
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo
jianbo27/lsd
The next gen ls command
jianbo27/minas
Framework to manage memory affinity in large scale hierarchical shared memory multi-core platforms
jianbo27/mojo
The Mojo Programming Language
jianbo27/pChase
Pointer-chasing memory benchmark, I want to improve it.
jianbo27/diloco-sim
jianbo27/efficient-llm.cpp
jianbo27/numa_memory_latency
Very simple tool to measure memory latency on numa environment
jianbo27/OLMo
Modeling, training, eval, and inference code for OLMo
jianbo27/pcm
Intel® Performance Counter Monitor (Intel® PCM)
jianbo27/phxpaxos
The Paxos library implemented in C++ that has been used in the WeChat production environment.
jianbo27/scheduler-plugins
Repository for out-of-tree scheduler plugins based on scheduler framework.
jianbo27/volcano
A Kubernetes Native Batch System (Project under CNCF)