Pinned Repositories
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Awesome-GPU
Awesome resources for GPUs
basis-embedding
basis embedding: a product quantization based model compression method for language models.
pytorch-build-mpi-cuda
Docker image for pytorch with openmpi and cuda support
pytorch-learning
learning notes when learning the source code of pytorch
Pytorch-NCE
The Noise Contrastive Estimation for softmax output written in Pytorch
pytorch_memlab
Profiling and inspecting memory in pytorch
traj-lstm
pytorch's implementation of Layer Trajectory LSTM
gpustat
📊 A simple command-line utility for querying and monitoring GPU status
Stonesjtu's Repositories
Stonesjtu/pytorch_memlab
Profiling and inspecting memory in pytorch
Stonesjtu/basis-embedding
basis embedding: a product quantization based model compression method for language models.
Stonesjtu/Awesome-GPU
Awesome resources for GPUs
Stonesjtu/Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stonesjtu/AI-Chip
A list of ICs and IPs for AI, Machine Learning and Deep Learning.
Stonesjtu/AMD-GCN-ISA-for-CLRX
Syntax highlighting for AMD GCN ISA, specifically suitable for CLRadeonExtender (https://github.com/CLRX/CLRX-mirror).
Stonesjtu/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Stonesjtu/chisel3
Chisel 3: A Modern Hardware Design Language
Stonesjtu/FlameGraph
Stack trace visualizer
Stonesjtu/gprof2dot
Converts profiling output to a dot graph. (Add xtensa xt-gprof output format)
Stonesjtu/how-to-optimize-gemm
Stonesjtu/hydra
Hydra is a framework for elegantly configuring complex applications
Stonesjtu/InstantID
InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥
Stonesjtu/Interstellar-CNN-scheduler
Tool for optimize CNN blocking
Stonesjtu/litex
Build your hardware, easily!
Stonesjtu/llvm-tutorial
Stonesjtu/ModernCppStarter
🚀 Kick-start your C++! A template for modern C++ projects using CMake, CI, code coverage, clang-format, reproducible dependency management and much more.
Stonesjtu/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
Stonesjtu/Neural-Networks-on-Silicon
This is originally a collection of papers on neural network accelerators. Now it's more like my selection of research on deep learning and computer architecture.
Stonesjtu/onnx
Open standard for machine learning interoperability
Stonesjtu/optimas
Stonesjtu/picovoice
The end-to-end platform for building voice products at scale
Stonesjtu/PipeCNN
An OpenCL-based FPGA Accelerator for Convolutional Neural Networks
Stonesjtu/porcupine
On-device wake word detection powered by deep learning.
Stonesjtu/py-spy
Sampling profiler for Python programs
Stonesjtu/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Stonesjtu/roofline
Stonesjtu/setk
Tools for Speech Enhancement integrated with Kaldi
Stonesjtu/sru
Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755)
Stonesjtu/xls
XLS: Accelerated HW Synthesis