zphang

PhD Student @ NYU, Researcher @ EleutherAI

New York, New York

Pinned Repositories

adaptive-computation-time-pytorch
Alex Graves' Adaptive Computation Time in PyTorch
Language:Python15 6 22
bert_on_stilts
Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs
Language:Python106 7 830
extratrees_cuda
Optimizing Extremely Randomized Trees with GPUs
Language:TeX6 4 02
lrqa
Language:Python7 4 03
minimal-gpt-neox-20b
Language:Python129 5 717
minimal-llama
Language:Python458 17 1145
minimal-opt
Language:Python67 3 25
saliency_investigation
Code for "Investigating and Simplifying Masking-based Saliency Methods for Model Interpretability" (https://arxiv.org/abs/2010.09750)
Language:Python13 4 15
transformers
Code and models for BERT on STILTs
53 4 035
usc_dae
Repository for Unsupervised Sentence Compression using Denoising Auto-Encoders
Language:Python46 4 315

zphang's Repositories

zphang/minimal-llama
Language:Python458 17 1145
zphang/minimal-gpt-neox-20b
Language:Python129 5 717
zphang/bert_on_stilts
Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs
Language:Python106 7 830
zphang/minimal-opt
Language:Python67 3 25
zphang/transformers
Code and models for BERT on STILTs
53 4 035
zphang/usc_dae
Repository for Unsupervised Sentence Compression using Denoising Auto-Encoders
Language:Python46 4 315
zphang/lrqa
Language:Python7 4 03
zphang/hyperllama
5 5 1
zphang/llm_feedback
Language:Python5 4 02
zphang/minimal-t5
Language:Python5 3 03
zphang/my_pefty_llama
Minimal implementation of multiple PEFT methods for LLaMA fine-tuning
Language:Python2 1 0
zphang/sndict
Structured Nested Dictionaries
Language:Python2 3 01
zphang/hpt
Language:Python1 3 0
zphang/llama_peft
1 2 0
zphang/zphang.github.io
Github Page
Language:JavaScript1 3 1
zphang/architecture-objective
Language:Python2 0
zphang/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Language:Python2 0
zphang/doc-chat-ui
3 0
zphang/FLAN
Language:Python1 0
zphang/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.
Language:Python1 02
zphang/GPTQ-for-LLaMa
4 bits quantization of LLaMa using GPTQ
Language:Python1 0
zphang/hf_benchmark_sample
Language:Python2 0
zphang/jiant
The jiant toolkit for general-purpose text understanding models
Language:Python1 0
zphang/lm_evaluation_harness
Language:Python2 01
zphang/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python2 0
zphang/mpi4py
Python bindings for MPI
Language:Python1 0
zphang/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python1 01
zphang/pegasus
Language:Python1 0
zphang/summarization_experiments
2 0
zphang/t5x
Language:Python1 0