LLM experiments How good is GPT-4 at math?. It gets arithmetic ~30-40% right on small datasets (25 rows, 5 columns). So, not very good.