zyt1024's Stars
THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
qiuapeng921/DnfHelper-Java
Java-地下城与勇士-dnf工具
1995chen/dnf
qiuapeng921/dnf
HLSTransform/submission
Xilinx/Alveo-PYNQ
Introductory examples for using PYNQ with Alveo
Gabriele-bot/ALVEO-PYNQ_ML
Neural network inferences on Alveo cards with hls4ml framework
selwyn96/Alveo-tutorial
Tutorial for deploying models on Alveo boards
IntelLabs/distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
yoshitomo-matsubara/torchdistill
A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.
Jai2500/particlenet
A simple Implementation of ParticleNet in Pytorch Geometric
pyg-team/pytorch_geometric
Graph Neural Network Library for PyTorch
suisuisi/FPGAandCNN
基于FPGA的数字识别-实时视频处理的定点卷积神经网络实现
QShen3/CNN-FPGA
使用Verilog实现的CNN模块,可以方便的在FPGA项目中使用
Z-Siqi/Clash-for-Windows_Chinese
clash for windows汉化版. 提供clash for windows的汉化版, 汉化补丁及汉化版安装程序
Light-City/CPlusPlusThings
C++那些事
Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
li-plus/chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
wangzhaode/mnn-llm
llm deploy project based mnn.
chenzomi12/AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
dhm2013724/yolov2_xilinx_fpga
A demo for accelerating YOLOv2 in xilinx's fpga pynq/zedboard
heymesut/SJTU_microe
A FPGA-based neural network inference accelerator, which won the third place in DAC-SDC
jgoeders/dac_sdc_2020_designs
Designs for finalist teams of the DAC System Design Contest
Yang-YiFan/DiracDeltaNet
PyTorch implementation of DiracDeltaNet from paper Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs
DNN-Accelerators/Open-Source-IPs
sharc-lab/DGNN-Booster