Tom-CaoZH
Interested in KV Stores, Cache, Disaggregated Memory(RDMA and CXL) and LLM.
Xidian UniversityChina
Pinned Repositories
SAS-Cache
[MSST '24] SAS-Cache: A Semantic-Aware Secondary Cache for LSM-based Key-Value Stores
6.s081_20fa
6.s081_21fa
CMU15-213
my solutions to CMU15-213 (updating)
CS106B
my sections and homeworks of the standford class
cs106L
cs50_ai
to record my study in cs50_ai class
cuckoofilter
CXL-101
Contain some materials about CXL.
xalloc
This lib is used to allocate normal DRAM-based memory and CXL-based memory using Rust.
Tom-CaoZH's Repositories
Tom-CaoZH/CXL-101
Contain some materials about CXL.
Tom-CaoZH/TinyDB
Just a very simple database
Tom-CaoZH/xalloc
This lib is used to allocate normal DRAM-based memory and CXL-based memory using Rust.
Tom-CaoZH/cuckoofilter
Tom-CaoZH/curve
Curve is a high-performance, lightweight-operation, cloud-native open source distributed storage system. Curve can be applied to: 1) mainstream cloud-native infrastructure platforms OpenStack and Kubernetes; 2) high-performance storage for cloud-native databases; 3) cloud storage middleware using S3-compatible object storage as a data storage.
Tom-CaoZH/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Tom-CaoZH/paper_readings
Keep track of the papers I have read and to be read
Tom-CaoZH/DistServe
Disaggregated serving system for Large Language Models (LLMs).
Tom-CaoZH/Ditto
This is the implementation repository of our SOSP'23 paper: Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System.
Tom-CaoZH/LearningOS_Record
Record my daily process when learning os-comp2022-winter
Tom-CaoZH/Leetcode
my solutions to some leetcode problems
Tom-CaoZH/LevelDBRead
To record some notes when I read the leveldb source code
Tom-CaoZH/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Tom-CaoZH/llama.cpp
LLM inference in C/C++
Tom-CaoZH/memkind
Memkind is an easy-to-use, general-purpose allocator which helps to fully utilize various kinds of memory available in the system, including DRAM, NVDIMM, and HBM
Tom-CaoZH/mit_6.824
to record my study of mit 6.824
Tom-CaoZH/notes-pictures
Tom-CaoZH/OncoMatcher
Tom-CaoZH/opendal
OpenDAL: Access data freely, painlessly, and efficiently
Tom-CaoZH/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Tom-CaoZH/RocksDBRead
To record some notes when I read the rocksdb source code
Tom-CaoZH/rpc_imp
implement a rpc framework using golang, just for exercise
Tom-CaoZH/runc
CLI tool for spawning and running containers according to the OCI specification
Tom-CaoZH/TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
Tom-CaoZH/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Tom-CaoZH/tests
Tom-CaoZH/Tom-CaoZH.github.io
This is my homepage.
Tom-CaoZH/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Tom-CaoZH/XD_EE_DSA_2022
my solution to XDU EE data structure and algorithm
Tom-CaoZH/zenfs
ZenFS is a storage backend for RocksDB that enables support for ZNS SSDs and SMR HDDs.