Figure 3 explanation

Question

Figure 3 explanation

Muennighoff opened this issue 2 years ago · 2 comments

Sorry, a few things are unclear to me about Fig 3:

What is generated by the model? Everything starting from <work>? Including Answer?
Is it zero-shot like in the image?
Is it a mistake from the model that it generates « & >?
Does the model literally predict the output of the program (0.0051) or did you run the program externally and then add it to the image?

Thanks 🤗

Answer 1 · 2022-11-19T12:06:18.000Z

Thanks for the comment: for inference, it generates everything after including Answer. It is "zero-shot" in the sense you just need to prompt with at test-time.

Answer 2 · 2022-11-19T13:56:22.000Z

Interesting, so the calculation 0.0051 was not done using an external program, but the model did it itself!
Thanks 👍