misc SizeTest: gpt-neo to gpt2 This script allows converting a transformers gpt-neo model into a gpt2 model. Local attention is implemented through the bias buffer and stored in the weights.