/transformers-mathematics

transformers for mathematical reasoning

Primary LanguagePython

transformers-mathematics

GPT-2 sucks at third grade math, I wonder if we can do better

We will be using this dataset.

Joint work by Helen Ngo, Joseph Palermo and Michael Jia, with support from Rayhane Mama.

Benchmarks TODO

All character-level.

  • LSTM with teacher forcing
  • tiny Transformer
    • add Encoder
  • regular Transformer
  • 1558M GPT-2 finetune, but there's some nuance here

Other resources for mathematical reasoning

This is mostly a to-read list.