Pinned Repositories
0plot
Use 0plot to automatically build matplotlib plots using ChatGPT.
attention_with_linear_biases
Code for the ALiBi method for transformer language models (ICLR 2022)
PartialShuffle
sandwich_transformer
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.
self-ask
Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"
shortformer
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
tensorflow_with_latest_papers
Implementation of Newest RNN and Seq2Seq Features
tstl_t5_bias
This is our implementation of the T5 bias for fairseq.
UsingTheOutputEmbedding
Code for the EACL paper "Using the Output Embedding to Improve Language Models" by Ofir Press and Lior Wolf
YouMayNotNeedAttention
Code for the Eager Translation Model from the paper You May Not Need Attention
ofirpress's Repositories
ofirpress/attention_with_linear_biases
Code for the ALiBi method for transformer language models (ICLR 2022)
ofirpress/self-ask
Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"
ofirpress/YouMayNotNeedAttention
Code for the Eager Translation Model from the paper You May Not Need Attention
ofirpress/shortformer
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
ofirpress/sandwich_transformer
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.
ofirpress/UsingTheOutputEmbedding
Code for the EACL paper "Using the Output Embedding to Improve Language Models" by Ofir Press and Lior Wolf
ofirpress/0plot
Use 0plot to automatically build matplotlib plots using ChatGPT.
ofirpress/PartialShuffle
ofirpress/tstl_t5_bias
This is our implementation of the T5 bias for fairseq.
ofirpress/tensorflow_with_latest_papers
Implementation of Newest RNN and Seq2Seq Features
ofirpress/awd-lstm-lm
ofirpress/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
ofirpress/composer
library of algorithms to speed up neural network training
ofirpress/dl4mt-tutorial
ofirpress/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
ofirpress/LeViT_ALiBi
LeViT + ALiBi
ofirpress/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
ofirpress/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
ofirpress/ofirpress.github.io
Build a Jekyll blog in minutes, without touching the command line.
ofirpress/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
ofirpress/RecurrentHighwayNetworks
Recurrent Highway Networks - Author implementation for Tensorflow and Torch
ofirpress/SciCode
A benchmark that challenges language models to code solutions for scientific problems
ofirpress/Snowballed_Hallucination
ofirpress/sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet
ofirpress/tensorflow
Computation using data flow graphs for scalable machine learning
ofirpress/the-gan-zoo
A list of all named GANs!