Tom-CaoZH

Interested in KV Stores, Cache, Disaggregated Memory(RDMA and CXL) and LLM.

Xidian UniversityChina

Pinned Repositories

SAS-Cache
[MSST '24] SAS-Cache: A Semantic-Aware Secondary Cache for LSM-based Key-Value Stores
11 1 13
6.s081_20fa
0 1 00
6.s081_21fa
1 1 00
CMU15-213
my solutions to CMU15-213 (updating)
Language:C0 1 00
CS106B
my sections and homeworks of the standford class
Language:Roff0 1 00
cs106L
Language:C++0 1 00
cs50_ai
to record my study in cs50_ai class
0 1 00
cuckoofilter
Language:C++0 0 00
CXL-101
Contain some materials about CXL.
15 4 02
xalloc
This lib is used to allocate normal DRAM-based memory and CXL-based memory using Rust.
Language:Rust1 1 00

Tom-CaoZH's Repositories

Tom-CaoZH/CXL-101
Contain some materials about CXL.
15 4 02
Tom-CaoZH/TinyDB
Just a very simple database
Language:C++1 1 00
Tom-CaoZH/xalloc
This lib is used to allocate normal DRAM-based memory and CXL-based memory using Rust.
Language:Rust1 1 00
Tom-CaoZH/cuckoofilter
Language:C++0 0 00
Tom-CaoZH/curve
Curve is a high-performance, lightweight-operation, cloud-native open source distributed storage system. Curve can be applied to: 1) mainstream cloud-native infrastructure platforms OpenStack and Kubernetes; 2) high-performance storage for cloud-native databases; 3) cloud storage middleware using S3-compatible object storage as a data storage.
Language:C++0 0 00
Tom-CaoZH/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
Language:Python0 0 00
Tom-CaoZH/paper_readings
Keep track of the papers I have read and to be read
0 1 170
Tom-CaoZH/DistServe
Disaggregated serving system for Large Language Models (LLMs).
Language:Jupyter Notebook0 0
Tom-CaoZH/Ditto
This is the implementation repository of our SOSP'23 paper: Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System.
Language:C++0 0
Tom-CaoZH/LearningOS_Record
Record my daily process when learning os-comp2022-winter
1 0
Tom-CaoZH/Leetcode
my solutions to some leetcode problems
Language:C++1 0
Tom-CaoZH/LevelDBRead
To record some notes when I read the leveldb source code
Language:C++1 0
Tom-CaoZH/LLaMA-Factory
Unify Efficient Fine-Tuning of 100+ LLMs
Language:Python0 0
Tom-CaoZH/llama.cpp
LLM inference in C/C++
Language:C++0 0
Tom-CaoZH/memkind
Memkind is an easy-to-use, general-purpose allocator which helps to fully utilize various kinds of memory available in the system, including DRAM, NVDIMM, and HBM
Language:C0 0
Tom-CaoZH/mit_6.824
to record my study of mit 6.824
Language:Go1 0
Tom-CaoZH/notes-pictures
1 0
Tom-CaoZH/OncoMatcher
Language:Python1 0
Tom-CaoZH/opendal
OpenDAL: Access data freely, painlessly, and efficiently
Language:Rust0 0
Tom-CaoZH/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C0 0
Tom-CaoZH/RocksDBRead
To record some notes when I read the rocksdb source code
Tom-CaoZH/rpc_imp
implement a rpc framework using golang, just for exercise
Language:Go1 0
Tom-CaoZH/runc
CLI tool for spawning and running containers according to the OCI specification
Language:Go0 0
Tom-CaoZH/TensorRT
NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.
Language:C++0 0
Tom-CaoZH/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++0 0
Tom-CaoZH/tests
Language:C++1 0
Tom-CaoZH/Tom-CaoZH.github.io
This is my homepage.
Language:Python1 0
Tom-CaoZH/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0
Tom-CaoZH/XD_EE_DSA_2022
my solution to XDU EE data structure and algorithm
Language:C++
Tom-CaoZH/zenfs
ZenFS is a storage backend for RocksDB that enables support for ZNS SSDs and SMR HDDs.
Language:C++0 0