zyt1024

zyt1024's Stars

THUDM/CodeGeeX
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Language:Python8.2k598
qiuapeng921/DnfHelper-Java
Java-地下城与勇士-dnf工具
Language:Java818
1995chen/dnf
Language:Shell1k334
qiuapeng921/dnf
32
HLSTransform/submission
Language:C647
Xilinx/Alveo-PYNQ
Introductory examples for using PYNQ with Alveo
Language:Jupyter Notebook4817
Gabriele-bot/ALVEO-PYNQ_ML
Neural network inferences on Alveo cards with hls4ml framework
Language:Ada71
selwyn96/Alveo-tutorial
Tutorial for deploying models on Alveo boards
Language:Jupyter Notebook52
IntelLabs/distiller
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
Language:Jupyter Notebook4.3k799
intel/neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Language:Python2.2k252
yoshitomo-matsubara/torchdistill
A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.
Language:Python1.4k131
Jai2500/particlenet
A simple Implementation of ParticleNet in Pytorch Geometric
Language:Python63
pyg-team/pytorch_geometric
Graph Neural Network Library for PyTorch
Language:Python21.1k3.6k
suisuisi/FPGAandCNN
基于FPGA的数字识别-实时视频处理的定点卷积神经网络实现
Language:Verilog26365
QShen3/CNN-FPGA
使用Verilog实现的CNN模块，可以方便的在FPGA项目中使用
Language:Verilog482107
Z-Siqi/Clash-for-Windows_Chinese
clash for windows汉化版. 提供clash for windows的汉化版, 汉化补丁及汉化版安装程序
Language:JavaScript20.4k2.7k
Light-City/CPlusPlusThings
C++那些事
Language:C++39k8.5k
Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Language:Cuda808127
PaddlePaddle/Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）
Language:C++22.1k5.6k
li-plus/chatglm.cpp
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
Language:C++2.9k334
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Language:Python11.7k3.5k
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python54.4k5.6k
wangzhaode/mnn-llm
llm deploy project based mnn.
Language:C++1.4k156
chenzomi12/AISystem
AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Language:Jupyter Notebook10.6k1.5k
dhm2013724/yolov2_xilinx_fpga
A demo for accelerating YOLOv2 in xilinx's fpga pynq/zedboard
Language:C766230
heymesut/SJTU_microe
A FPGA-based neural network inference accelerator, which won the third place in DAC-SDC
Language:C284
jgoeders/dac_sdc_2020_designs
Designs for finalist teams of the DAC System Design Contest
Language:Objective-C3419
Yang-YiFan/DiracDeltaNet
PyTorch implementation of DiracDeltaNet from paper Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs
Language:Python3010
DNN-Accelerators/Open-Source-IPs
Language:C++3214
sharc-lab/DGNN-Booster
Language:Python152