Receipt OCR
What OCR actually reads off a receipt, where it still fails in 2026, and how Starlog is built so a misread never sneaks into your books.
Last updated: 2026-06-11
Every receipt app promises some version of “just snap a photo and we extract everything automatically.” It's a good pitch because it's mostly true — and the “mostly” is exactly where your bookkeeping either holds up or quietly goes wrong.
This page explains what receipt OCR is, what Starlog's OCR does and deliberately doesn't do, and how to get the most out of it. No claimed accuracy percentages, no “AI magic” — just how the thing actually works.
Receipt OCR (optical character recognition) is software that reads the text on a photographed receipt and turns it into structured expense data — merchant, amount, date — so you don't type it yourself. It's what makes capturing an expense a five-second job instead of a sixty-second one.
Where it runs matters more than most people realise. Many apps upload your receipt to a cloud OCR service to read it — which means a copy of every supplier, price, and purchase passes through someone else's infrastructure. The alternative is on-device OCR: the photo is read by your phone itself, and never leaves it for the sake of text extraction.
Starlog uses Google ML Kit's text recognition, running entirely on your phone. When you snap a receipt:
PDF receipts work too: Starlog extracts the first page and runs the same on-device recognition over it — handy for email invoices and ride-hailing receipts.
Because recognition happens on-device, Starlog doesn't ship your receipt to a third-party OCR API to read it. And once captured, the original image is filed into your own Google Drive — sorted by business, year, and month — not into a vendor's database you'd have to export your way out of someday.
That's a deliberate position: the extracted text is a convenience, but the image is the record — the thing an audit asks for in three years. Records like that belong in storage you control. More on the full posture on the security & privacy page.
An honest list, because real receipts aren't the crisp ones in the demos:
The full breakdown is in What OCR still gets wrong on real receipts.
There are two ways for OCR to be wrong, and they're not equally bad. Failing visibly — “can you check this total?” — costs you a glance. Failing silently — confidently reading ₹240 from a receipt that says ₹2,400 — sails into your records looking finished, and surfaces months later as a number that won't reconcile.
Starlog is designed around that asymmetry: OCR drafts, you confirm, corrections take a tap, and the original image is never thrown away. A human stays in the loop by design — which is the correct division of labour for records you're legally responsible for.
OCR quality is mostly photo quality. Three habits cover most of it:
The complete checklist is in How to photograph a receipt so the OCR actually works.