/receipt-parser

A fuzzy (supermarket) receipt parser written in Python using tesseract

Primary LanguagePythonApache License 2.0Apache-2.0

A fuzzy receipt parser written in Python

Updating your housekeeping book is a tedious task: You need to manually find the shop name, the date and the total from every receipt. Then you need to write it down. At the end you want to calculate a sum of all bills. Nasty. So why not let a machine do it?

This is a fuzzy receipt parser written in Python. You give it any dirty old receipt lying around and it will try its best to find the correct data for you.

It started as a hackathon project. Read more about it on the trivago techblog.
Also read the comments on HackerNews
Oh hey! And there's also a talk online now if you're the visual kind of person.

Future Plans

The plan is to write the parsed receipt data into a CSV file. This is enough to create a graph with GnuPlot or any spreadsheet tool. If you want to get fancy, write an output for ElasticSearch and create a nice Kibana dashboard. I'm happy for any pull request.