Huyuwei's Stars
cornell-zhang/GARNET
GARNET: Reduced-Rank Topology Learning for Robust and Scalable Graph Neural Networks
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
cornell-zhang/HiSparse
High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS
cornell-zhang/GraphLily
A graph linear algebra overlay
fishmingyu/RecModel
plasma-umass/scalene
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
memkind/memkind
Memkind is an easy-to-use, general-purpose allocator which helps to fully utilize various kinds of memory available in the system, including DRAM, NVDIMM, and HBM
garrettj403/SciencePlots
Matplotlib styles for scientific plotting
pythonprofilers/memory_profiler
Monitor Memory usage of Python code
NVSL/OptaneStudy
intel/ipmctl
awslabs/dgl-ke
High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
matiassingers/awesome-readme
A curated list of awesome READMEs
amazon-science/FeatGraph
alibaba/GraphScope
🔨 🍇 💻 🚀 GraphScope: A One-Stop Large-Scale Graph Computing System from Alibaba | 一站式图计算系统
Xtra-Computing/ThunderGP
HLS-based Graph Processing Framework on FPGAs
llvm/circt
Circuit IR Compilers and Tools
fastmachinelearning/hls4ml
Machine learning on FPGAs using HLS
KarypisLab/METIS
METIS - Serial Graph Partitioning and Fill-reducing Matrix Ordering
Hanjun-Dai/graph_comb_opt
Implementation of "Learning Combinatorial Optimization Algorithms over Graphs"
tgmattso/GraphBLAS
Materials for a GraphBLAS tutorial
tensor-compiler/taco-bench
Repository to reproduce the results from paper "The Tensor Algebra Compiler".
gunrock/graphblast
High-Performance Linear Algebra-based Graph Primitives on GPUs
facebookresearch/PyTorch-BigGraph
Generate embeddings from large-scale graph-structured data.
GraphIt-DSL/graphit
GraphIt - A High-Performance Domain Specific Language for Graph Analytics
tensorflow/lucid
A collection of infrastructure and tools for research in neural network interpretability.
tensor-compiler/taco
The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs