shjwudp

I am proud to serve humanity.

@NVIDIABeijing, China

Pinned Repositories

bagua
Bagua Speeds up PyTorch
Language:Python868 16 14584
bagua-net
High performance NCCL plugin for Bagua.
Language:Rust14 8 74
ACM-ICPC-api-service
Language:Go2 2 01
ACM-ICPC-frontend
Language:TypeScript1 2 00
blueprint-trainer
Scaffolding for sequence model training research.
Language:Python1 2 00
c4-dataset-script
Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.
Language:Python108 5 012
mamba-jax
Language:Python5 3 00
megabyte
A PyTorch implementation of MEGABYTE. This multi-scale transformer architecture has the excellent features of tokenization-free and sub-quadratic attention. The paper link: https://arxiv.org/abs/2305.07185
Language:Python2 2 01
ocr_game
Language:Python10
shu
中文书籍收录整理, Collection of Chinese Books
Language:Python150 5 330

shjwudp's Repositories

shjwudp/shu
中文书籍收录整理, Collection of Chinese Books
Language:Python150 5 330
shjwudp/c4-dataset-script
Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.
Language:Python108 5 012
shjwudp/mamba-jax
Language:Python5 3 00
shjwudp/megabyte
A PyTorch implementation of MEGABYTE. This multi-scale transformer architecture has the excellent features of tokenization-free and sub-quadratic attention. The paper link: https://arxiv.org/abs/2305.07185
Language:Python2 2 01
shjwudp/blueprint-trainer
Scaffolding for sequence model training research.
Language:Python1 2 00
shjwudp/apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Language:Python0 0
shjwudp/bagua-core
Core communication lib for Bagua.
Language:Rust1 0
shjwudp/BLOOM-COT
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python0 0
shjwudp/conversational-datasets
Language:Python1 0
shjwudp/do-we-need-attention
Language:TeX0 0
shjwudp/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model
Language:Python0 0
shjwudp/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python0 0
shjwudp/GPU-math
🤯 GPU math & benchmarks, branched from mli / transformers-benchmarks
Language:Jupyter Notebook0 0
shjwudp/grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
shjwudp/Huggingface-Model-Service
Language:Python1 0
shjwudp/hyena-jax
JAX/Flax implementation of the Hyena Hierarchy
Language:Jupyter Notebook0 0
shjwudp/juicefs
JuiceFS is a distributed POSIX file system built on top of Redis and S3.
Language:Go0 0
shjwudp/MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Language:Python0 0
shjwudp/Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python1 0
shjwudp/NeMo
NeMo: a toolkit for conversational AI
Language:Python0 0
shjwudp/OptimalShardedDataParallel
An automated parallel training system that combines the advantages from both data and model parallelism. If you have any interests, please visit/star/fork https://github.com/Youhe-Jiang/OptimalShardedDataParallel
Language:Python0 0
shjwudp/RWKV-LM
RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python0 0
shjwudp/S5
Language:Python0 0
shjwudp/safari
Convolutions for Sequence Modeling
shjwudp/shjwudp.github.io
Language:HTML1 0
shjwudp/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
Language:Python0 0
shjwudp/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python0 0
shjwudp/Titans
A collection of models built with ColossalAI
Language:Python0 0
shjwudp/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python0 0
shjwudp/twitter-dialogue
1 0