A finance team automates 70% of its invoice processing on a Tuesday. By Friday, the single reviewer assigned to handle exceptions is staring at 1,200 unresolved cases, rubber-stamping approvals to keep pace. The automation percentage was correct. The human oversight layer was not. That distinction, between how much you automate and how many people you need watching what the automation does, is the gap that most AI implementations fall into.

Gartner's early 2026 data makes the scale of the problem concrete: close to 60% of finance teams are piloting or implementing AI, but only 7% of CFOs report a strong impact from the investment. AI now handles an estimated 47% of routine finance tasks (Innovature BPO), up from 40% a year earlier. Deloitte reports 63% of finance departments actively using AI solutions. The technology is plainly working. The returns, for most, are not.

The usual explanations — poor implementation, bad data, organisational resistance — account for part of the gap. But there is a structural problem beneath them: teams are calibrating the wrong variable. They ask what percentage of work should be automated. They should be asking how many humans they need to maintain quality at each automation level.

The ratio, not the percentage

Ledgeris's Automation Ceiling framework maps, for each back-office function, the point at which AI stops saving money and starts creating risk. The Human-AI Ratio is its operational companion: where the Ceiling sets the maximum automation level, the Ratio sets the minimum human involvement needed to hold quality at that level. These figures are drawn from Ledgeris's operational data across client engagements, and they shift as the AI learns a specific client's patterns. They are starting points, not universal constants.

For data entry and transaction processing, where the Ceiling sits at 85–90%, the starting ratio is roughly one human reviewer per 500–800 automated transactions daily. The reviewer samples 5–10% of processed items, checks that the AI's categorisation matches the source document, and investigates flagged anomalies. At this ratio, error rates on standardised inputs stay below 0.5%.

As the work grows more complex, the ratio tightens. Accounts payable and receivable (Ceiling: 70–75%) require roughly one human per 200–400 automated transactions. The human handles the 25–30% that involve mismatched purchase order numbers, duplicate invoices, vendor disputes, or unusual approval chains. Reconciliations (Ceiling: 80–85%) sit at roughly one per 300–500 automated matches, with the human investigating timing differences, legacy data artefacts, and intercompany entries that resist pattern matching.

Then the balance inverts. For financial reporting (Ceiling: 40–50%), AI generates draft reports from structured data, runs variance detection, and aggregates numbers. The analyst does the rest: the interpretive commentary, the narrative explaining why revenue rose 8% and what it signals for next quarter. That interpretive layer is irreducibly human. The ratio is closer to 1:1, with the AI functioning as a productivity tool rather than an autonomous processor. For compliance and audit preparation (Ceiling: 25–35%), where regulatory judgement, risk assessment, and signing authority belong to credentialed professionals, the ratio is roughly three to four humans per AI-assisted workflow. The AI assembles documents, tracks checklists, and validates data. The humans make the calls.

Two failure modes

The Gartner gap is, in Ledgeris's experience, largely a ratio problem, though not the only one. Two patterns recur.

Over-automation is the more obvious: pushing AI past the Ceiling for a given function and generating errors that cost more to fix than the automation saved. A compliance team that automates 60% of its work when the Ceiling sits at 30% will produce filings that invite regulatory scrutiny rather than reducing it.

Under-supervision is subtler. The automation percentage is correct, but the human review layer is understaffed. An AP team that automates 70% of invoice processing but assigns one reviewer to 5,000 monthly exceptions will find that reviewer overwhelmed and rubber-stamping approvals without investigation. Errors pass through unchallenged. The technology performed as designed. The operation did not.

Setting the ratio

Three inputs determine the right ratio for a given function. First, identify the Automation Ceiling: the maximum percentage of work that AI should handle. Second, calculate the exception volume, which is the number of transactions per month that will require human judgement at that automation level. For AP at 70% automation, exception volume is 30% of total transactions; for data entry at 85%, it is 15%. Third, staff the human layer to the exception volume, with headroom for spikes, rather than to the total transaction volume.

The ratio is not fixed. As the AI learns a client's specific vendor behaviours, recurring exceptions, and seasonal volume shifts, the exception rate declines. A team that starts at one human per 200 automated AP transactions in month one may reach one per 350 by month six. The adjustment should be data-driven and gradual. Assumed improvement is how the second failure mode begins.

What to ask your provider

When evaluating an outsourcing provider's AI capabilities, ask for the ratio, not the percentage. A provider who says they automate 80% of AP is describing the software. A provider who says their AP operation runs at a 1:300 human-to-transaction ratio on automated volume, with a dedicated exception team handling the remaining 20–25% at a 1:40 case-to-person ratio, is describing how the work actually gets done. The first answer is marketing. The second is an operation.

The distinction is easy to test, and, for exactly that reason, revealing.

Ledgeris calibrates the Human-AI Ratio for each client's function and transaction profile. Book a free Back-Office Audit at ledgeris.com/contact.

Featured

May 18, 2026

Why Your Next Finance and Accounting Hub Should Be in Lagos: The GBS Alternative

May 18, 2026

A CFO at a $35M logistics company told us last year that her outsourced AP team processed invoices accurately but could never tell her why a vendor's billing pattern had changed. The team handled her account alongside four others. They knew her chart of accounts well enough to post entries correctly, but not her business well enough to flag that a freight partner had quietly shifted from net-30 to net-15 terms. When the variance showed up at month-end, it looked like a timing mystery. It was a contract issue.

May 18, 2026

May 17, 2026

The Human-AI Ratio: Why the Automation Percentage Is the Wrong Question

May 17, 2026

A finance team automates 70% of its invoice processing on a Tuesday. By Friday, the single reviewer assigned to handle exceptions is staring at 1,200 unresolved cases, rubber-stamping approvals to keep pace. The automation percentage was correct. The human oversight layer was not. That distinction, between how much you automate and how many people you need watching what the automation does, is the gap that most AI implementations fall into.

May 17, 2026

Red Flags to Look Out For When Evaluating African BPO Providers

May 17, 2026

Outsourcing evaluations tend to fixate on what can be seen: generators, office layouts, monitors on desks. These are the things that photograph well for a due diligence report. The risks that actually derail engagements (an undocumented workflow, a supervision layer that collapses at scale, a compliance posture that exists only on paper) are invisible during a site visit and silent until they cause damage.

May 17, 2026

The Human-AI Ratio: Why the Automation Percentage Is the Wrong Question

The ratio, not the percentage

Two failure modes

Setting the ratio

What to ask your provider

Why Your Next Finance and Accounting Hub Should Be in Lagos: The GBS Alternative

Red Flags to Look Out For When Evaluating African BPO Providers

Ledgeris

Contact Us