zhuzilin

☀️ SDE @Tencent WeChat AI, focusing on MLSys

tencentBeijing

Pinned Repositories

OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python2.1k 21 249206
bun
Incredibly fast JavaScript runtime, bundler, test runner, and package manager – all in one
Language:Zig73.4k 607 8.8k2.7k
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python82.5k 1.7k 45.3k22.2k
tensorflow
An Open Source Machine Learning Framework for Everyone
Language:C++186k 7.6k 39.8k74.2k
es
A JavaScript interpreter from scratch, supporting ES5 syntax.
Language:C++25 5 26
faster-nougat
Implementation of nougat that focuses on processing pdf locally.
Language:Python69 4 12
NP_ML
A tool library of classical machine learning algorithms with only numpy.
Language:Python222 13 269
pdf-with-its-own-md5
A PDF template that contains its own MD5!
Language:TeX36 3 03
ring-flash-attention
Ring attention implementation with flash attention
Language:Python538 10 3241
whisper-openvino
openvino version of openai/whisper
Language:Jupyter Notebook158 6 014

zhuzilin's Repositories

zhuzilin/ring-flash-attention
Ring attention implementation with flash attention
Language:Python538 10 3241
zhuzilin/whisper-openvino
openvino version of openai/whisper
Language:Jupyter Notebook158 6 014
zhuzilin/faster-nougat
Implementation of nougat that focuses on processing pdf locally.
Language:Python69 4 12
zhuzilin/pdf-with-its-own-md5
A PDF template that contains its own MD5!
Language:TeX36 3 03
zhuzilin/es
A JavaScript interpreter from scratch, supporting ES5 syntax.
Language:C++25 5 26
zhuzilin/chatgpt-desktop
Desktop version of ChatGPT, support manually set cookie
Language:JavaScript16 2 13
zhuzilin/aqt-pytorch
Language:Python7
zhuzilin/wandb-discord-bot
A discord bot for monitoring wandb project and runs.
Language:JavaScript6 2 10
zhuzilin/blog
my blog~
Language:JavaScript2 3 0
zhuzilin/llama
Inference code for LLaMA models
Language:Python2 1 0
zhuzilin/torchrec_mapper
Language:C++2 3 0
zhuzilin/zhuzilin
2 3 0
zhuzilin/electron-fc
A electron based famicom(NES) emulator
Language:JavaScript1 2 0
zhuzilin/flash-attention
Fast and memory-efficient exact attention
Language:Python1 1 0
zhuzilin/nvcc_daxpy_example
Language:C++1 3 0
zhuzilin/scattermoe
Triton-based implementation of Sparse Mixture of Experts.
Language:Python1
zhuzilin/triton
Development repository for the Triton language and compiler
Language:C++1 1 0
zhuzilin/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
1
zhuzilin/base64-img
Language:Python2 0
zhuzilin/bun
Incredibly fast JavaScript runtime, bundler, transpiler and package manager – all in one.
Language:Zig1 0
zhuzilin/chibicc
A small C compiler
Language:C2 0
zhuzilin/FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
Language:C++1 0
zhuzilin/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python2 0
zhuzilin/grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
Language:Cuda0 0
zhuzilin/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Language:Python1 0
zhuzilin/megablocks
Language:Python
zhuzilin/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python
zhuzilin/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:C++2 0
zhuzilin/qiskit-translations
Home of Qiskit documentation translations
Language:Shell2 0
zhuzilin/torchrec
Pytorch domain library for recommendation systems
Language:Python1 0