ustcwhy

Ph.D student at CAS

Chinese Academy of SciencesBeijing, China

Pinned Repositories

M4U
Code for the Paper M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models.
Language:Python8 0 00
BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Language:Python0 0 00
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python0 0 00
flash-attention
Fast and memory-efficient exact attention
Language:Python0 0 00
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python0 0 00
JARVIS
JARVIS, a system to connect LLMs with ML community
Language:Python0 0 01
LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
Language:Python0 0 00
MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
Language:TeX0 0 00
torchscale
Transformers at any scale
Language:Python0 0 00
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python2 0 00

ustcwhy's Repositories

ustcwhy/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python2 0 00
ustcwhy/BitBLAS
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
Language:Python0 0 00
ustcwhy/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python0 0 00
ustcwhy/flash-attention
Fast and memory-efficient exact attention
Language:Python0 0 00
ustcwhy/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Language:Python0 0 00
ustcwhy/JARVIS
JARVIS, a system to connect LLMs with ML community
Language:Python0 0 01
ustcwhy/LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
Language:Python0 0 00
ustcwhy/MT-Reading-List
A machine translation reading list maintained by Tsinghua Natural Language Processing Group
Language:TeX0 0 00
ustcwhy/torchscale
Transformers at any scale
Language:Python0 0 00
ustcwhy/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python0 0
ustcwhy/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python0 0
ustcwhy/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
ustcwhy/ustcwhy.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript0 0
ustcwhy/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python0 0
ustcwhy/WorkingTime
0 0

ustcwhy

Pinned Repositories

M4U

BitBLAS

fairseq

flash-attention

gpt-fast

JARVIS

LMOps

MT-Reading-List

torchscale

unilm

ustcwhy's Repositories

ustcwhy/unilm

ustcwhy/BitBLAS

ustcwhy/fairseq

ustcwhy/flash-attention

ustcwhy/gpt-fast

ustcwhy/JARVIS

ustcwhy/LMOps

ustcwhy/MT-Reading-List

ustcwhy/torchscale

ustcwhy/OpenRLHF

ustcwhy/TransformerEngine

ustcwhy/transformers

ustcwhy/ustcwhy.github.io

ustcwhy/VILA

ustcwhy/WorkingTime