ofirpress

Modeling language

@uwnlp

Pinned Repositories

0plot
Use 0plot to automatically build matplotlib plots using ChatGPT.
Language:JavaScript19 1 40
attention_with_linear_biases
Code for the ALiBi method for transformer language models (ICLR 2022)
Language:Python507 12 1939
PartialShuffle
Language:Python14 2 02
sandwich_transformer
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.
Language:Python55 3 22
self-ask
Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"
Language:Jupyter Notebook300 6 431
shortformer
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
Language:Python145 4 48
tensorflow_with_latest_papers
Implementation of Newest RNN and Seq2Seq Features
Language:Python1 2 00
tstl_t5_bias
This is our implementation of the T5 bias for fairseq.
Language:Python2 1 00
UsingTheOutputEmbedding
Code for the EACL paper "Using the Output Embedding to Improve Language Models" by Ofir Press and Lior Wolf
Language:Lua45 4 07
YouMayNotNeedAttention
Code for the Eager Translation Model from the paper You May Not Need Attention
Language:Python294 15 228

ofirpress's Repositories

ofirpress/attention_with_linear_biases
Code for the ALiBi method for transformer language models (ICLR 2022)
Language:Python507 12 1939
ofirpress/self-ask
Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"
Language:Jupyter Notebook300 6 431
ofirpress/YouMayNotNeedAttention
Code for the Eager Translation Model from the paper You May Not Need Attention
Language:Python294 15 228
ofirpress/shortformer
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
Language:Python145 4 48
ofirpress/sandwich_transformer
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer Models by Reordering their Sublayers.
Language:Python55 3 22
ofirpress/UsingTheOutputEmbedding
Code for the EACL paper "Using the Output Embedding to Improve Language Models" by Ofir Press and Lior Wolf
Language:Lua45 4 07
ofirpress/0plot
Use 0plot to automatically build matplotlib plots using ChatGPT.
Language:JavaScript19 1 40
ofirpress/PartialShuffle
Language:Python14 2 02
ofirpress/tstl_t5_bias
This is our implementation of the T5 bias for fairseq.
Language:Python2 1 00
ofirpress/tensorflow_with_latest_papers
Implementation of Newest RNN and Seq2Seq Features
Language:Python1 2 00
ofirpress/awd-lstm-lm
Language:Python0 2 00
ofirpress/BIG-bench
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Language:Python0 0
ofirpress/composer
library of algorithms to speed up neural network training
Language:Python0 0
ofirpress/dl4mt-tutorial
Language:Python1 0
ofirpress/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Language:Python1 0
ofirpress/LeViT_ALiBi
LeViT + ALiBi
Language:Python3 0
ofirpress/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Language:Python0 0
ofirpress/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Language:Python1 0
ofirpress/ofirpress.github.io
Build a Jekyll blog in minutes, without touching the command line.
Language:SCSS1 01
ofirpress/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python2 0
ofirpress/RecurrentHighwayNetworks
Recurrent Highway Networks - Author implementation for Tensorflow and Torch
Language:Python2 0
ofirpress/SciCode
A benchmark that challenges language models to code solutions for scientific problems
ofirpress/Snowballed_Hallucination
0 0
ofirpress/sockeye
Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet
Language:Python1 0
ofirpress/tensorflow
Computation using data flow graphs for scalable machine learning
Language:C++1 0
ofirpress/the-gan-zoo
A list of all named GANs!
Language:Python2 0

ofirpress

Pinned Repositories

0plot

attention_with_linear_biases

PartialShuffle

sandwich_transformer

self-ask

shortformer

tensorflow_with_latest_papers

tstl_t5_bias

UsingTheOutputEmbedding

YouMayNotNeedAttention

ofirpress's Repositories

ofirpress/attention_with_linear_biases

ofirpress/self-ask

ofirpress/YouMayNotNeedAttention

ofirpress/shortformer

ofirpress/sandwich_transformer

ofirpress/UsingTheOutputEmbedding

ofirpress/0plot

ofirpress/PartialShuffle

ofirpress/tstl_t5_bias

ofirpress/tensorflow_with_latest_papers

ofirpress/awd-lstm-lm

ofirpress/BIG-bench

ofirpress/composer

ofirpress/dl4mt-tutorial

ofirpress/examples

ofirpress/LeViT_ALiBi

ofirpress/Megatron-DeepSpeed

ofirpress/NLP-progress

ofirpress/ofirpress.github.io

ofirpress/pytorch

ofirpress/RecurrentHighwayNetworks

ofirpress/SciCode

ofirpress/Snowballed_Hallucination

ofirpress/sockeye

ofirpress/tensorflow

ofirpress/the-gan-zoo