thu-pacman
Parallel Architecture & Compiler technology of Mobile, Accelerated, and Networked systems
Beijing, China
Pinned Repositories
chitu
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
FasterMoE
GeminiGraph
A computation-centric distributed graph processing system.
GridGraph
Out-of-core graph processing on a single machine.
gscholar-citations-crawler
Crawl all your citations from Google Scholar
HyQuas
A hybrid partitioner based quantum circuit simulation system on GPU
LiveGraph
LiveGraph: a transactional graph storage system with purely sequential adjacency list scans
PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
SmartMoE-AE
ATC23 AE
TriCache
A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs
thu-pacman's Repositories
thu-pacman/chitu
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
thu-pacman/GeminiGraph
A computation-centric distributed graph processing system.
thu-pacman/GridGraph
Out-of-core graph processing on a single machine.
thu-pacman/PET
PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections
thu-pacman/FasterMoE
thu-pacman/TriCache
A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs
thu-pacman/gscholar-citations-crawler
Crawl all your citations from Google Scholar
thu-pacman/LiveGraph
LiveGraph: a transactional graph storage system with purely sequential adjacency list scans
thu-pacman/HyQuas
A hybrid partitioner based quantum circuit simulation system on GPU
thu-pacman/SmartMoE-AE
ATC23 AE
thu-pacman/GraphPi
thu-pacman/RisGraph
RisGraph: A Real-Time Streaming System for Evolving Graphs to Support Sub-millisecond Per-update Analysis at Millions Ops/s
thu-pacman/Spindle
thu-pacman/lab-guide
Everything about PACMAN!
thu-pacman/VAPRO
Light-weight Performance Variance Detection for Production-run Parallel Applications
thu-pacman/UniQ
UniQ: A Unified Programming Model for Efficient Quantum Circuit Simulation
thu-pacman/PerFlow
Domain-specific framework for performance analysis of parallel programs
thu-pacman/self-checkpoint
An in-memory checkpoint method using less space.
thu-pacman/AIPerf
thu-pacman/mpi-profiler
A simple and easy-to-use profiler for MPI programs. It profiles CPU time and MPI time for each process. No source code modification is need, just re-link the program with this library.
thu-pacman/AIPerf-MoE
MoE Model Benchmark of AIPerf
thu-pacman/LiveGraph-Binary
LiveGraph: a transactional graph storage system with purely sequential adjacency list scans
thu-pacman/CYPRESS
CYPRESS: Combining Static and Dynamic Analysis for Top-Down Communication Trace Compression
thu-pacman/Mat2Stencil
A Modular Matrix-Based DSL for Explicit and Implicit Matrix-Free PDE Solvers on Structured Grid.
thu-pacman/tprint
tprint is a printing library specially designed for SW architecture. Currently providing C and fortran API.
thu-pacman/thrill
Thrill - An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++
thu-pacman/Uberun
Spread-n-Share: Improving Application Performance and Cluster Throughput with Resource-aware Job Placement
thu-pacman/environment-eaglecitrine-production
thu-pacman/environment-eaglecitrine-staging
thu-pacman/husky
A more expressive and most importantly, more efficient system for distributed data analytics.