AI invoice scoring: what the model actually looks at
The A-to-E grade attached to every invoice combines OCR confidence, fraud signals, debtor credit quality and supplier history. Here's what goes into the number.
Why a graded score, not a simple "accept / reject"
On the Tauraco marketplace, every invoice receives a grade from A to E. This score drives two things:
- The rate offered to investors (from 80 bps for an A to 400+ bps for a D)
- Visibility on the marketplace (A's surface first, E's are never listed)
A binary "accept / reject" pushes you to either reject too many viable deals or mis-price risk. The A-to-E grade lets you fund more broadly without diluting portfolio quality.
The 4 dimensions of the model
1. OCR extraction quality (~15 % weight)
The Tauraco OCR extracts: invoice number, dates (issue/due), amounts (net/VAT/gross), party identifiers (SIRET / SIC / OHADA RC), bank accounts. A poorly scanned document, an algorithmically-generated PDF with inconsistencies, or an unexpected format pushes confidence down. Confidence < 75 % triggers automatic human review.
2. Proactive fraud signals (~30 % weight)
Several detectors run in parallel:
- Duplicate: same invoice already factored elsewhere (content hash
- fuzzy match on number and amounts)
- Date coherence: issue > due, due date in the past, abnormal gap
- Amount coherence: VAT not matching local rates, total ≠ net + VAT
- Supplier = buyer: classic fraud signal
- Fictitious counterparty: SIREN that doesn't exist or has been struck off
- Amount-dressing pattern: suspiciously round amounts, abnormal numeric run
Any alert with score ≥ 60 plus a "hard" flag (e.g. supplier = buyer) triggers automatic rejection before human review.
3. Debtor credit quality (~35 % weight)
The dominant factor in the final score. We combine:
- Public solvency data (Banque de France, Greffes, OHADA BIB)
- Payment history on Tauraco when available
- Sector and macro exposure
- Size (revenue, headcount)
- Country and sovereign risk
A listed blue-chip or recognised public operator gets a very high implicit credit score. An SMB in a struggling sector pulls the grade down.
4. Supplier history on the platform (~20 % weight)
Suppliers with a clean track record on Tauraco see their grade marginally improved. Those with an incident (late payment, dispute) see their next invoice drop one notch.
A score is reviewable: not a black box
Scoring isn't opaque. Each invoice ships with a breakdown of contributions: "grade B because OCR confidence 92 % + debtor grade A
- supplier on first deal". Operators can request a review with justification.
Observed metrics
- Default rate on A portfolio: 0.4 % over 14 months
- Default rate on B portfolio: 1.6 %
- Fraud-detection precision: 97 % (~3 % false positives)
- Average scoring latency: 42 seconds
Going deeper
- See a graded example on /market
- Read our detailed anti-fraud approach
- Understand OHADA compliance