zijkx's Stars
mohammadasim98/Xilinx-DPUV3.0-Vivado-Proj
Deep Learning Processing Unit (DPU IP) integration with Application Processing Unit (APU) using (Zynq-7000 PS) in Xilinx Vivado Design Suite
t-kuha/dpu
Xilinx DPU/DNNDK example; moved to https://github.com/t-kuha/vai
sumilao/Zynq-7000-DPU-TRD
Zynq-7000 DPU TRD
ModelTC/MQBench
Model Quantization Benchmark
CLUEbenchmark/SuperCLUE
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
sefaburakokcu/quantized-yolov5
Low Precision(quantized) Yolov5
Xilinx/brevitas
Brevitas: neural network quantization in PyTorch
ModelTC/llmc
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
jeinlee1991/chinese-llm-benchmark
中文大模型能力评测榜单:目前已囊括115个大模型,覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型, 以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型,多维度能力评测。不仅提供能力评分排行榜,也提供所有模型的原始输出结果!
booniebears/CoMN
Xilinx/PYNQ_Workshop
sefaburakokcu/finn-quantized-yolo
Low-Precision YOLO on PYNQ with FINN
JinChen-tw/pynqz2_dpu140
This TRD is implement DPU v1.4.0 on PYNQ-Z2 board
Xilinx/finn
Dataflow compiler for QNN inference on FPGAs
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
SET-Scheduling-Project/GEMINI-HPCA2024
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
SET-Scheduling-Project/SET-ISCA2023
The framework for the paper "Inter-layer Scheduling Space Definition and Exploration for Tiled Accelerators" in ISCA 2023.
aliemo/transfomers-silicon-research
Research and Materials on Hardware implementation of Transformer Model
OpenPPL/ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
HewlettPackard/cacti
An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model
Accelergy-Project/accelergy-neurosim-plug-in
mit-emze/raella
Accelergy-Project/accelergy-timeloop-infrastructure
Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop
fire717/movenet.pytorch
A Pytorch implementation of MoveNet from Google. Include training code and pre-trained model.
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
jeonsworld/ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
doonny/basic_knowledge
Things to learn for new students in the Lab for AI chips and systems of BJTU .
soDLA-publishment/somnia
Learn NVDLA by SOMNIA
stanford-mast/nn_dataflow
Explore the energy-efficient dataflow scheduling for neural networks.