Do I need an account?

No. Groups are created with a shareable link, and members are identified per-group with magic links. We never require email or a password.

How do I create my first group?

Go to /groups/new, pick a name and default currency, and share the link with whoever's splitting expenses with you. The whole flow takes about 30 seconds.

How to split rent fairly when bedrooms aren't equal?

The master bedroom is bigger than the box room. Equal splits are unfair. Here's the math.

Splitting roommate expenses without the awkward conversations?

Rent, internet, electricity, the cleaner, that one Costco run. The math isn't hard - but chasing each other for €17 every month is exhausting. EvenRound turns 'who owes what' into a single number you can settle once a month.

Blog

How receipt scanning actually works in 2026 (a non-technical explainer)

What a multimodal AI sees when it reads your restaurant receipt, why it gets things right (and occasionally wrong), and how to spot mistakes.

The EvenRound team · EditorialPublished May 1, 2026Updated Apr 29, 20269 min read

Receipt scanning used to be the kind of feature that almost worked. You'd snap a photo, the app would catch the total maybe 60% of the time, miss the date, hallucinate a line item, and you'd spend longer fixing it than typing the receipt manually.

That changed in about 2024. Multimodal AI models can now read a crumpled, sideways, slightly-wet restaurant bill in two seconds and produce a structured list of items with prices, tax, tip, and the merchant name. It's not magic - it's linear algebra and a lot of training data - but the practical effect is that splitting an itemised dinner takes 30 seconds instead of 5 minutes.

Here's what's actually happening when you point your phone at a receipt, why it works now when it didn't before, and where it still trips up.

TL;DR

Multimodal language models read the image as a single inference pass and emit structured JSON. Accuracy on clean receipts is ~95%; on photos taken in a moving car after three drinks, somewhat less. Always sanity-check the total against the printed bill before you commit.

What used to happen, and why it didn't work

Before 2023, receipt scanning meant OCR plus rules. The OCR engine (Tesseract or one of its commercial cousins) would convert the image to text. Then a rule-based parser would try to identify the total ("look for 'TOTAL', take the number to its right"), the line items ("each row with a quantity and a price"), and the merchant ("first non-numeric line at the top").

It worked for receipts that looked like the receipts the rules were written for. It failed catastrophically the moment the receipt:

was in a different language
had a non-standard layout (Italian receipts have totals in odd places; Asian receipts often have items on the right and prices on the left)
was creased, faded, or photographed at an angle
had handwritten amendments (a tip added by hand, an item crossed out)
used a thermal printer that had partially failed

Splitwise, Tricount, and most receipt-scanning expense apps used OCR-plus-rules until 2024. Hence the 60% accuracy.

What changed

Two things, simultaneously.

First: multimodal models. GPT-4V, Claude, Gemini - all of them learned to "see" images and text in the same representation space. Instead of converting the image to text and then reasoning over the text, they reason over the image directly. That means the model can use spatial information: "this number is directly below this label and aligned to the right, so they belong to the same row".

Second: structured output. The same models can emit JSON that matches a schema you define. So instead of asking "what does this receipt say?" and parsing prose, the app asks "fill in this schema" and gets back a structured object: merchant, total, line items, currency, date.

These two together collapse the OCR-plus-rules pipeline into one inference pass. The accuracy on real-world receipts goes from ~60% to ~95% on clean images, ~85% on bad ones.

What happens when you scan a receipt in EvenRound

Step by step:

You take a photo.Your phone's camera is better than 2018's in two ways: it shoots in higher resolution, and it auto-corrects perspective. (We also do a client-side compression pass - resize to 1600px max, JPEG at 85% quality - so the upload is fast even on bad WiFi.)
The image goes to our API.It's stored temporarily in EU-region object storage. The URL is signed and short-lived; only our backend can read it.
We send the image to Anthropic's Claude with a structured-output schema. The schema says: "give me merchant, occurred_on, currency_code, subtotal_minor, tax_minor, tip_minor, total_minor, line_items[]". Each field has a type (e.g. amounts are integers in minor units like pence).
Claude looks at the image and emits JSON matching the schema. The whole call takes 2-4 seconds.
We validate the JSONwith a Zod schema (a TypeScript validator). If anything is off (a negative amount, a missing field), we either auto-correct or kick back a "couldn't read this clearly" message.
We pre-fill the expense formwith the parsed values. You see the merchant, the total, the date, and (if it's an itemised receipt) the line items with names and prices. You tap each item against a person, hit save.

Total elapsed time: about 5 seconds for the model call, 30 seconds for the human tapping items against names. End to end, an itemised £140 dinner for six people is logged in well under a minute.

Where it still gets things wrong

We'd rather over-disclose than oversell. The cases where the model still struggles:

Faded thermal-printed receipts.If you can't read it, neither can the model. The fix is usually a better photo (more light, less glare).
Handwritten amendments. A waiter scribbling "+£15 service" at the bottom is sometimes interpreted as a line item, sometimes as a tip, sometimes ignored. We surface this as "review the tip amount" before commit.
Multi-language receipts. Bilingual receipts (Hong Kong, Quebec) are usually fine. Receipts entirely in a script the model has seen less of (Burmese, Amharic) are less reliable.
Calculated discounts."20% off the wine" sometimes shows as a separate line, sometimes as a reduction on the wine line. We try to detect this; it's the field most worth double-checking.
Tax-inclusive vs tax-exclusive subtotals. US receipts list subtotal then tax then total. UK / EU receipts usually show total VAT-inclusive with a "VAT included" note. The model has to figure out which convention the receipt is using; it occasionally guesses wrong.

For the worst-case receipts, you can always just type the total manually. The scanner is a speed-up, not a replacement.

We have more on the failure modes in when AI gets the receipt wrong: what to check.

Privacy and data

Two questions people ask, both fair.

"Does the model train on my receipts?"No. We use Anthropic's API, which doesn't train on customer data. The same applies to OpenAI's API and Google's Gemini API tiers we'd use. The free consumer-facing chatbots have different policies; the API products do not train on customer inputs by default.

"Where do receipt photos go?"EU-region Supabase storage. They're associated with the group they belong to, governed by the same row-level security as the expense itself. When you delete a group, the receipts go too. We covered this in detail in where do receipt photos actually go? (an honest answer).

Cost

Each receipt scan costs us about £0.005-£0.01 in model tokens. It's the reason AI features tend to be paywalled in expense apps - someone has to pay for the inference.

Free tier on EvenRound gets 5 receipt scans per group per month, which covers most casual users. Plus is unlimited (within a fair-use ceiling) for £3.99/month if you want to scan everything.

What we'd do differently

We'd move the model call to the edge. Right now the receipt photo travels from your phone, to our EU server, to Anthropic's API, back to our server, back to you. That's a lot of transit. With a streaming response and the right edge runtime, the round-trip drops by maybe 30% - which on a 4-second call is over a second of perceived speed-up. It's on the roadmap.

Try it

Next time you split a restaurant bill, create a group, add the members, and tap "Scan receipt" on the new-expense form. Snap the bill on the table. The form fills in. Tap items against names. You're done before the card terminal has finished printing.

Try EvenRound

Free forever. No signup. Works in your browser in 30 seconds.

Create a group How it works

How receipt scanning actually works in 2026 (a non-technical explainer)

What a multimodal AI sees when it reads your restaurant receipt, why it gets things right (and occasionally wrong), and how to spot mistakes.

The EvenRound team · EditorialPublished May 1, 2026Updated Apr 29, 20269 min read

Here's what's actually happening when you point your phone at a receipt, why it works now when it didn't before, and where it still trips up.

TL;DR

What used to happen, and why it didn't work

It worked for receipts that looked like the receipts the rules were written for. It failed catastrophically the moment the receipt:

was in a different language
had a non-standard layout (Italian receipts have totals in odd places; Asian receipts often have items on the right and prices on the left)
was creased, faded, or photographed at an angle
had handwritten amendments (a tip added by hand, an item crossed out)
used a thermal printer that had partially failed

Splitwise, Tricount, and most receipt-scanning expense apps used OCR-plus-rules until 2024. Hence the 60% accuracy.

What changed

Two things, simultaneously.

These two together collapse the OCR-plus-rules pipeline into one inference pass. The accuracy on real-world receipts goes from ~60% to ~95% on clean images, ~85% on bad ones.

What happens when you scan a receipt in EvenRound

Step by step:

You take a photo.Your phone's camera is better than 2018's in two ways: it shoots in higher resolution, and it auto-corrects perspective. (We also do a client-side compression pass - resize to 1600px max, JPEG at 85% quality - so the upload is fast even on bad WiFi.)
The image goes to our API.It's stored temporarily in EU-region object storage. The URL is signed and short-lived; only our backend can read it.
We send the image to Anthropic's Claude with a structured-output schema. The schema says: "give me merchant, occurred_on, currency_code, subtotal_minor, tax_minor, tip_minor, total_minor, line_items[]". Each field has a type (e.g. amounts are integers in minor units like pence).
Claude looks at the image and emits JSON matching the schema. The whole call takes 2-4 seconds.
We validate the JSONwith a Zod schema (a TypeScript validator). If anything is off (a negative amount, a missing field), we either auto-correct or kick back a "couldn't read this clearly" message.
We pre-fill the expense formwith the parsed values. You see the merchant, the total, the date, and (if it's an itemised receipt) the line items with names and prices. You tap each item against a person, hit save.

Total elapsed time: about 5 seconds for the model call, 30 seconds for the human tapping items against names. End to end, an itemised £140 dinner for six people is logged in well under a minute.

Where it still gets things wrong

We'd rather over-disclose than oversell. The cases where the model still struggles:

Faded thermal-printed receipts.If you can't read it, neither can the model. The fix is usually a better photo (more light, less glare).
Handwritten amendments. A waiter scribbling "+£15 service" at the bottom is sometimes interpreted as a line item, sometimes as a tip, sometimes ignored. We surface this as "review the tip amount" before commit.
Multi-language receipts. Bilingual receipts (Hong Kong, Quebec) are usually fine. Receipts entirely in a script the model has seen less of (Burmese, Amharic) are less reliable.
Calculated discounts."20% off the wine" sometimes shows as a separate line, sometimes as a reduction on the wine line. We try to detect this; it's the field most worth double-checking.
Tax-inclusive vs tax-exclusive subtotals. US receipts list subtotal then tax then total. UK / EU receipts usually show total VAT-inclusive with a "VAT included" note. The model has to figure out which convention the receipt is using; it occasionally guesses wrong.

For the worst-case receipts, you can always just type the total manually. The scanner is a speed-up, not a replacement.

We have more on the failure modes in when AI gets the receipt wrong: what to check.

Privacy and data

Two questions people ask, both fair.

Cost

Each receipt scan costs us about £0.005-£0.01 in model tokens. It's the reason AI features tend to be paywalled in expense apps - someone has to pay for the inference.

Free tier on EvenRound gets 5 receipt scans per group per month, which covers most casual users. Plus is unlimited (within a fair-use ceiling) for £3.99/month if you want to scan everything.

How receipt scanning actually works in 2026 (a non-technical explainer)

TL;DR

What used to happen, and why it didn't work

What changed

What happens when you scan a receipt in EvenRound

Where it still gets things wrong

Privacy and data

Cost

What we'd do differently

Related reading

Try it

Try EvenRound

People also ask

How receipt scanning actually works in 2026 (a non-technical explainer)

TL;DR

What used to happen, and why it didn't work

What changed

What happens when you scan a receipt in EvenRound

Where it still gets things wrong

Privacy and data

Cost

What we'd do differently

Related reading

Try it

Try EvenRound

People also ask