zijkx

Computer Architecture

Guangzhou, China

zijkx's Stars

mohammadasim98/Xilinx-DPUV3.0-Vivado-Proj
Deep Learning Processing Unit (DPU IP) integration with Application Processing Unit (APU) using (Zynq-7000 PS) in Xilinx Vivado Design Suite
Language:VHDL61
t-kuha/dpu
Xilinx DPU/DNNDK example; moved to https://github.com/t-kuha/vai
Language:C++57
sumilao/Zynq-7000-DPU-TRD
Zynq-7000 DPU TRD
Language:Verilog4518
ModelTC/MQBench
Model Quantization Benchmark
Language:Shell757137
CLUEbenchmark/SuperCLUE
SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese
2.9k94
sefaburakokcu/quantized-yolov5
Low Precision(quantized) Yolov5
Language:Python317
Xilinx/brevitas
Brevitas: neural network quantization in PyTorch
Language:Python1.2k192
ModelTC/llmc
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".
Language:Python24127
jeinlee1991/chinese-llm-benchmark
中文大模型能力评测榜单：目前已囊括115个大模型，覆盖chatgpt、gpt4o、百度文心一言、阿里通义千问、讯飞星火、商汤senseChat、minimax等商用模型，以及百川、qwen2、glm4、yi、书生internLM2、llama3等开源大模型，多维度能力评测。不仅提供能力评分排行榜，也提供所有模型的原始输出结果！
2.5k119
booniebears/CoMN
Language:C++9
Xilinx/PYNQ_Workshop
Language:Jupyter Notebook395158
sefaburakokcu/finn-quantized-yolo
Low-Precision YOLO on PYNQ with FINN
Language:Jupyter Notebook276
JinChen-tw/pynqz2_dpu140
This TRD is implement DPU v1.4.0 on PYNQ-Z2 board
Language:V4321
Xilinx/finn
Dataflow compiler for QNN inference on FPGAs
Language:Python723230
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
3.5k143
SET-Scheduling-Project/GEMINI-HPCA2024
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
Language:C++4710
SET-Scheduling-Project/SET-ISCA2023
The framework for the paper "Inter-layer Scheduling Space Definition and Exploration for Tiled Accelerators" in ISCA 2023.
Language:C++425
aliemo/transfomers-silicon-research
Research and Materials on Hardware implementation of Transformer Model
Language:Jupyter Notebook20229
OpenPPL/ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Language:Python1.5k228
HewlettPackard/cacti
An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model
Language:C++392132
Accelergy-Project/accelergy-neurosim-plug-in
Language:C++4
mit-emze/raella
Language:Jupyter Notebook82
Accelergy-Project/accelergy-timeloop-infrastructure
Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop
Language:Dockerfile4426
fire717/movenet.pytorch
A Pytorch implementation of MoveNet from Google. Include training code and pre-trained model.
Language:Python37487
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Language:C++5.8k889
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k1.3k
jeonsworld/ViT-pytorch
Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
Language:Jupyter Notebook1.9k364
doonny/basic_knowledge
Things to learn for new students in the Lab for AI chips and systems of BJTU .
Language:Python15633
soDLA-publishment/somnia
Learn NVDLA by SOMNIA
Language:Scala2611
stanford-mast/nn_dataflow
Explore the energy-efficient dataflow scheduling for neural networks.
Language:Python21482

zijkx

zijkx's Stars

mohammadasim98/Xilinx-DPUV3.0-Vivado-Proj

t-kuha/dpu

sumilao/Zynq-7000-DPU-TRD

ModelTC/MQBench

CLUEbenchmark/SuperCLUE

sefaburakokcu/quantized-yolov5

Xilinx/brevitas

ModelTC/llmc

jeinlee1991/chinese-llm-benchmark

booniebears/CoMN

Xilinx/PYNQ_Workshop

sefaburakokcu/finn-quantized-yolo

JinChen-tw/pynqz2_dpu140

Xilinx/finn

deepseek-ai/DeepSeek-V2

SET-Scheduling-Project/GEMINI-HPCA2024

SET-Scheduling-Project/SET-ISCA2023

aliemo/transfomers-silicon-research

OpenPPL/ppq

HewlettPackard/cacti

Accelergy-Project/accelergy-neurosim-plug-in

mit-emze/raella

Accelergy-Project/accelergy-timeloop-infrastructure

fire717/movenet.pytorch

NVIDIA/FasterTransformer

Dao-AILab/flash-attention

jeonsworld/ViT-pytorch

doonny/basic_knowledge

soDLA-publishment/somnia

stanford-mast/nn_dataflow