Banking QA · Lesson 2

Payments Testing

A payment looks like one click. Underneath it is a sequence of steps across several systems, each of which can fail differently. This lesson teaches you to test that sequence end to end.

Banking QA Banking & Payments — Lesson 2 of 5 ~30 min read · ~70 min with exercises

1 The Hook

A fictional NZ retailer, Kauri & Co, took a $120 EFTPOS card payment at the counter on a Friday afternoon. The terminal beeped, the screen said approved, the customer walked out with their goods. The shop assumed it had its money.

On Monday the retailer’s bank statement did not show the $120. The payment had been authorised — the customer’s bank had confirmed the funds and placed a hold — but it had never been settled. A defect in the terminal’s end-of-day batch meant the day’s authorised transactions were never sent for settlement. The hold on the customer’s account quietly expired a few days later, and the money simply never moved. The shop had goods out the door and nothing in the bank.

To the cashier, the customer, and the shop owner, the payment had “gone through.” The approval beep was real. But approval is not money. The payment had completed the first half of its life — authorisation — and silently failed the second half — settlement. The two are separate steps that can succeed or fail independently, and most people, including many testers, treat them as one.

Here is the lesson hidden in that story. The team had tested that a card payment got approved. They had a green path that ended at the approval screen. No test followed the payment all the way to settled funds and reconciled them against what the terminal reported. The gap between “approved” and “settled and reconciled” is exactly where payments testing lives.

2 The Rule

An approved payment is not a completed payment. Authorisation reserves the money; settlement actually moves it; reconciliation proves it moved. Test all three. A payment is only correct when the funds have settled and the settled total reconciles to what was authorised — not when a screen says approved.

3 The Analogy

Analogy

Booking a campervan for a Great Walk over summer.

When you reserve the van, the company puts a hold on your card — they have checked the money is there and set it aside, but they have not taken it. That is authorisation. The actual charge happens later, when you pick the van up. If you never collect, the hold expires and no money changes hands. The reservation felt like paying, but nothing actually moved.

A card payment works the same way. The approval at the terminal is the hold on the van; settlement is the charge when you collect. Kauri & Co got the reservation but the charge never ran, so the hold expired and the money stayed put. A payments tester is the person who checks not just that the van was reserved, but that it was actually collected and paid for — and that the company’s books show the same thing the customer’s do.

4 The NZ Payment Rails

A “rail” is a path money travels along. NZ has several, and they behave differently enough that a tester must know which one a payment uses.

Card and EFTPOS. When a customer taps or inserts a card, the payment runs through a scheme — the EFTPOS network for domestic debit, or an international scheme for credit and contactless. The defining feature is the two-step nature: an authorisation in real time, then settlement later in a batch. The Kauri & Co case is a card-rail failure between those two steps.

Direct debit. Here the customer gives a biller standing authority to pull money from their account — a gym, a power company, an insurer. The biller initiates; the customer’s bank honours it against the authority on file. The risks are different: a debit pulled without valid authority, a debit for the wrong amount, or a debit the customer is entitled to dispute and have reversed.

Bill payment and bank transfer. A customer pushes money from their account to a payee — paying a bill, sending money to a friend. Traditionally these moved between NZ banks in scheduled interbank exchanges through the day, so a payment made late in the evening might not land in the payee’s account until the next business day. This is the “7-day” world many people still picture.

Real-time / faster payments. NZ is moving toward payments that clear in seconds, any time, including weekends. The principle a tester holds onto is that real-time changes the timing and irrevocability but not the obligations: the right amount must reach the right payee, and the movement must still reconcile. Real-time payments are often final the instant they land, which raises the stakes on getting the payee and amount right the first time, because there is no comfortable settlement window to catch an error in.

Pro tip: The first question to ask about any payment feature is “which rail, and is it two-step or single-step?” Card and EFTPOS are two-step (authorise then settle). A real-time transfer is closer to single-step and often irrevocable. The rail determines your whole test approach — what can fail, when, and whether it can be undone.

5 Authorisation versus Settlement

This distinction is the spine of card payments testing, so it is worth being precise.

Authorisation is the real-time check. The merchant’s system asks the customer’s bank: is this card valid, and are these funds available? The bank answers approve or decline, and on approval it places a hold — the customer’s available balance drops by the amount, but the money has not actually left. No funds have moved yet. The hold is a promise, with an expiry.

Settlement is the later movement of the actual money, usually in an end-of-day batch. The merchant submits the day’s authorised transactions, and funds move from the customers’ banks to the merchant’s bank. Only now has money truly changed hands.

Several things can go wrong in the gap, and each is a test:

Authorised, never settled: the Kauri & Co failure — the hold expires and the money never moves.
Settled amount differs from authorised amount: common and legitimate for tips or fuel, where the final charge differs from the hold — but it must be within the allowed rules and tracked.
Double settlement: the same authorisation is submitted twice, charging the customer twice.
Hold not released on decline or cancellation: the transaction is abandoned but the customer’s funds stay locked, so they cannot spend their own money.

For a tester, the rule is simple: never stop at the approval screen. Follow every payment to settled funds, and confirm the settled amount, the settlement timing, and that any hold was correctly released or converted.

6 The Payment Lifecycle and Chargebacks

A card payment has a longer life than authorise-then-settle. After settlement comes the possibility of a dispute.

A chargeback is a customer-initiated reversal of a settled card payment, raised through their bank — the goods never arrived, the charge was fraudulent, the amount was wrong. The money flows back from the merchant to the customer, and the merchant may be asked to provide evidence to contest it. For NZ debit and many everyday payments there are also dispute and reversal paths, and persistent unresolved disputes can end up at the Banking Ombudsman.

A refund is different and worth not confusing with a chargeback. A refund is initiated by the merchant — a willing return of money — and it is a fresh payment in the opposite direction, with its own authorisation and settlement. A chargeback is forced by the customer’s bank; a refund is given by the merchant. Both must reconcile, and a tester checks that a refund moves the right amount, that it cannot exceed the original payment, and that a refunded transaction cannot also be charged back for the same money.

So the full lifecycle a tester reasons about is: authorise → settle → reconcile → (possibly) refund or chargeback → reconcile again. Each transition is a place where money can be duplicated, lost, or moved by the wrong amount. The test job is to prove that at every stage the books on both sides still agree.

7 What to Test in a Payment

The practical checklist for a payments tester:

Authorisation paths: approve, decline (insufficient funds, invalid card, limit exceeded), and timeout. A timeout is not an approval — test that an unanswered authorisation is treated as not authorised.
Settlement completeness: every authorised transaction that should settle does settle, and the settled total equals the authorised total, accounting for legitimate adjustments.
Hold lifecycle: a hold is placed on authorisation, released on decline or cancellation, and converted — not duplicated — on settlement.
Idempotency: resubmitting the same payment (a retry after a dropped connection) must not charge the customer twice. This is critical for real-time rails where the client cannot tell if the first attempt landed.
Direct debit authority: a debit only succeeds against a valid authority, for an amount within its terms, and a disputed debit can be reversed.
Reconciliation: the merchant’s records, the scheme’s settlement report, and the bank’s ledger all agree, with breaks flagged.
Refunds and chargebacks: a refund cannot exceed the original; a transaction cannot be both refunded and charged back for the same amount.

8 Building Payment Test Cases

A strong payment test case states which rail it is on, follows the payment past approval, and asserts on settled funds and reconciliation — not just the approval response.

Here is a worked test case for the Kauri & Co card scenario, written to catch the exact bug that hurt them:

Test ID:            PAY-CARD-009

Rail:               Card / EFTPOS (two-step: authorise then settle)

Risk category:      Authorised-but-never-settled (lost merchant funds)

Pre-conditions:     Test card with sufficient funds; merchant settlement batch scheduled.

Action:             Take a $120.00 card payment; let the end-of-day batch run.

Expected result:    1) Authorisation approved; hold of $120.00 placed on the card.

                  2) After batch, $120.00 settles to the merchant account.

                  3) The hold is converted, not left to expire or duplicated.

Reconciliation:     Settled total equals authorised total for the batch; 0 authorised

                  transactions missing from settlement.

Evidence required:  Authorisation response; settlement batch report; merchant ledger

                  credit; authorised-vs-settled reconciliation table.

Traceability:       Risk R-03 (approved transactions not submitted for settlement).

Result:             [Pass / Fail]

Notice what makes this catch the Hook bug: the expected result has three numbered stages, not one, and stage 2 (settlement) is asserted explicitly. The reconciliation line demands that the settled total equals the authorised total with zero missing items — precisely the check that was absent at Kauri & Co. And the rail is named at the top, so the reviewer knows it is two-step.

9 Common Mistakes

🚫 Stopping the test at the approval screen

Why it happens: The approval beep feels like success and the demo ends there.
The fix: Approval is only a hold — the Kauri & Co trap. Follow every card payment through settlement to settled funds and reconcile the settled total against the authorised total. The dangerous bugs live after the approval screen.

🚫 Not testing idempotency on retries

Why it happens: The happy path sends one request and gets one response.
The fix: Networks drop. A client that retries a payment it is unsure landed can charge the customer twice. Test that resubmitting the same payment — same idempotency key — produces one charge, not two. This is essential on real-time rails where the result is final.

🚫 Treating a timeout as either a clear success or a clear failure

Why it happens: A timeout has no response, so it is tempting to assume one.
The fix: A timed-out authorisation is in an unknown state — it may or may not have landed at the bank. Test the reconciliation that resolves it: the payment is not treated as approved until confirmed, and the customer is never charged for an unconfirmed transaction.

🚫 Confusing refunds with chargebacks

Why it happens: Both move money back to the customer, so they look similar.
The fix: A refund is merchant-initiated and is a fresh payment in reverse; a chargeback is forced by the customer’s bank. They have different flows, evidence, and reconciliation. Test that a transaction cannot be both refunded and charged back for the same money, and that a refund cannot exceed the original.

10 Now You Try

Three graded exercises across the payment lifecycle. Write your answer, run it for AI feedback, then compare to the model answer.

🔍 Exercise 1 of 3 — Spot the Payment Risks

Read the description of a fictional online checkout for an NZ retailer below. Identify 3 payment risks that could lose money, double-charge a customer, or leave funds stuck, and name the part of the lifecycle each touches (authorisation, settlement, hold, idempotency, refund/chargeback).

Checkout: card payment for an online order
On “Pay now” the site sends a card authorisation and shows a success page as soon as it is approved. If the customer’s connection drops and they press “Pay now” again, the site sends a second authorisation. Settlement runs nightly, submitting all approved orders for the day. If an order is cancelled after approval but before settlement, the site marks it cancelled but does nothing to the authorisation hold. Refunds are processed by re-using the original authorisation reference, with no check on whether the order was already refunded.

List 3 payment risks and the lifecycle part each touches:

Show model answer

There are at least four real risks here; any three well-explained earns full marks.

1. No idempotency on retry — Pressing "Pay now" again sends a second authorisation, so a dropped connection can charge the customer twice. Lifecycle part: idempotency / authorisation. Impact: double charge. Fix: an idempotency key so a retry maps to the same single payment.

2. Hold not released on cancellation — An order cancelled after approval but before settlement leaves the authorisation hold in place, so the customer's funds stay locked until the hold expires on its own. Lifecycle part: hold. Impact: customer cannot spend their own money. Fix: release the hold on cancellation.

3. Refund with no already-refunded check — Re-using the original authorisation reference for refunds with no check means the same order can be refunded more than once. Lifecycle part: refund. Impact: the merchant pays out the same money twice. Fix: check refund status and that total refunded cannot exceed the original.

Bonus: success page on approval only — like Kauri & Co, the customer is told it succeeded at authorisation, before settlement is confirmed, so an authorised-but-never-settled order looks complete to everyone.

The trap: every one of these passes a "the payment was approved" test, because approval is the one thing that does work.

🔧 Exercise 2 of 3 — Fix the Test Case

The card payment test case below stops at approval and misses the real risks. Rewrite it to follow the payment to settled funds, with these fields: Test ID, Rail, Risk category, Pre-conditions, Action, Expected result, Reconciliation, Evidence required, Traceability. Use a fictional Harbour Bank-acquired retailer EFTPOS payment as the context.

Original (too shallow):
“Tap the card for $50. Check the terminal shows approved. Pass if it says approved.”

Rewrite as a full-lifecycle payment test case:

Show model answer

Test ID: PAY-EFT-015

Rail: Card / EFTPOS (two-step: authorise then settle)

Risk category: Authorised-but-never-settled / hold not converted

Pre-conditions: Test card with sufficient funds; Harbour Bank merchant account; settlement batch scheduled; card available balance captured as baseline.

Action: Tap the card for $50.00; allow the end-of-day settlement batch to run.

Expected result: 1) Authorisation approved and a $50.00 hold placed (available balance drops by $50.00, ledger balance unchanged). 2) After the batch, $50.00 settles to the merchant account and the customer's ledger balance drops by $50.00. 3) The hold is converted to a settled charge, not duplicated or left to expire.

Reconciliation: Settled total for the batch equals the authorised total; this transaction appears exactly once in the settlement report; 0 authorised items missing from settlement.

Evidence required: Authorisation response; before/after available and ledger balances on the card; settlement batch report line for this payment; merchant ledger credit; authorised-vs-settled reconciliation showing this item once.

Traceability: Risk register R-03 (approved transactions not settled) and R-05 (hold not converted).

What makes it strong: the rail is named, the expected result has distinct authorise/settle/hold stages instead of one "approved", it asserts on settled funds and the hold conversion, and reconciliation demands the settled total tie to the authorised total with the item appearing exactly once. The original asserted none of this.

🏗️ Exercise 3 of 3 — Build a Direct Debit Test Plan

Design a direct debit test plan of 5 test cases for a fictional NZ power company debiting a Pacific Bank customer’s account monthly. Each case needs at least: an ID, what it verifies, an acceptance criterion, and the evidence required. Cover valid authority, wrong amount, expired/cancelled authority, insufficient funds, and dispute/reversal.

Show model answer

DD-01 | Verifies: a debit succeeds only against a valid authority on file | Acceptance criteria: debit with a valid authority posts the correct amount; debit with no/invalid authority is rejected | Evidence required: authority record; posted debit; rejection log for the no-authority case

DD-02 | Verifies: a debit for the wrong amount is handled per the authority terms | Acceptance criteria: an amount outside the authority's agreed terms is rejected or flagged, not silently posted | Evidence required: authority amount terms; the attempted amount; the reject/flag outcome

DD-03 | Verifies: an expired or cancelled authority cannot be debited | Acceptance criteria: a debit against a cancelled/expired authority is rejected; 0 funds move | Evidence required: authority status (cancelled/expired) with date; rejection record; account balance unchanged

DD-04 | Verifies: insufficient funds is handled correctly | Acceptance criteria: the debit is declined (or handled per policy), no partial posting, customer balance not driven below limits incorrectly | Evidence required: balance before; decline response; balance after (unchanged)

DD-05 | Verifies: a disputed debit can be reversed | Acceptance criteria: a valid dispute reverses the debit, returning the exact amount to the customer; the reversal reconciles | Evidence required: dispute record; reversal posting equal to the original; before/after balances; reconciliation of the reversal

Strong plans: each case is specific, has a measurable criterion, names concrete evidence, and together they cover valid authority (DD-01), wrong amount (DD-02), expired/cancelled authority (DD-03), insufficient funds (DD-04), and dispute/reversal (DD-05). Weak plans say "check the direct debit works" five times — that is the difference being marked.

11 Self-Check

Click each question to reveal the answer.

Q1: Why is an approved card payment not a completed payment?

Because approval is only authorisation — a hold that reserves the funds. The money does not actually move until settlement, usually in a later batch. A payment can be authorised and never settle (the hold expires and the money stays put), so you must follow it to settled, reconciled funds before calling it complete.

Q2: What is the first question to ask about any payment feature?

Which rail is it on, and is it two-step or single-step? Card and EFTPOS are two-step — authorise then settle. A real-time transfer is closer to single-step and often irrevocable. The rail determines what can fail, when, and whether it can be undone, which shapes your whole test approach.

Q3: Why does idempotency matter on payments, and how do you test it?

Because networks drop, and a client that retries a payment it is unsure landed can charge the customer twice. You test it by resubmitting the same payment with the same idempotency key and asserting it produces exactly one charge, not two. It is most critical on real-time rails where the result is final and the client cannot tell if the first attempt succeeded.

Q4: What is the difference between a refund and a chargeback?

A refund is merchant-initiated — a willing return that is a fresh payment in the opposite direction with its own authorisation and settlement. A chargeback is forced by the customer’s bank, reversing a settled payment. Test that a transaction cannot be both refunded and charged back for the same money, and that a refund cannot exceed the original.

Q5: How should a timed-out authorisation be treated?

As an unknown state, not a success or a failure. The request may or may not have reached the bank. The payment must not be treated as approved until reconciliation confirms it, and the customer must never be charged for an unconfirmed transaction. Test the reconciliation path that resolves the timeout.

12 Interview Prep

Real questions asked in NZ QA interviews for payments roles. Read the model answers, then practise your own version.

“Walk me through how you would test a card payment end to end.”

I’d start by naming the rail — card/EFTPOS is two-step. Then I follow the whole lifecycle, not just the approval. First authorisation: approve, decline for insufficient funds and invalid card, and timeout. On approval I check a hold is placed and the available balance drops while the ledger balance does not. Then settlement: I let the batch run and confirm the funds actually move to the merchant, the settled amount matches, and the hold is converted, not duplicated or left to expire. Then reconciliation: settled total equals authorised total with nothing missing. Finally the dispute paths — refund and chargeback — making sure the same money cannot move twice. The whole point is never stopping at the approval screen.

“A customer says they were charged twice for one online order. What is your first hypothesis?”

My first hypothesis is a missing idempotency control. The customer’s connection probably dropped, they pressed pay again, and the system sent a second authorisation that also settled. I’d reproduce it by sending two identical requests without a shared idempotency key and confirm two charges result, then verify the fix — the same key should map to one payment. I’d also check the reconciliation: two settled items for one order should show up as a break the system flags, not something that silently goes through.

“How does testing a real-time payment differ from testing a traditional bank transfer?”

The obligations are the same — right amount, right payee, and it must reconcile — but the timing and irrevocability change everything about the risk. A traditional transfer settles in a scheduled exchange, so there is a window in which a mistake might be caught. A real-time payment is often final the moment it lands, any time of day including weekends. So I put more weight on getting the payee and amount right before sending, on idempotency because retries are dangerous when the result is final, and on the absence of a safety net — there is no comfortable settlement window to catch and reverse an error.

Senior engineer insight

The moment that changed how I approach payments testing was watching a settlement batch silently drop 11 transactions because the merchant terminal sent a duplicate batch reference — the gateway deduplicated and discarded them, the terminal reported all-clear, and neither side raised an alert. We only found it three days later in a reconciliation break. Since then I treat the settlement report as the source of truth, not the terminal response: every payment test I write has a reconciliation assertion that counts authorised items, settled items, and confirms the two numbers match.

Most common mistake: writing a test suite that achieves 100% coverage of the authorisation API and zero coverage of what happens between the end-of-day batch and the merchant's bank statement — which is exactly where the money either arrives or disappears.

From the field

A mid-size NZ retailer was migrating from a legacy Paymark integration to a Verifone cloud-hosted terminal estate. The team tested every card and contactless flow against the Paymark test environment and everything passed. What they did not test was the settlement file format: Paymark and the new Verifone gateway produced structurally different batch files, and the retailer's reconciliation script expected the old layout. For two weeks after go-live, the script silently mismatched transaction references — settled amounts were landing in the bank but the internal ledger was not marking them as settled, so the finance team was manually chasing “outstanding” payments that had actually cleared. The lesson that generalises: when you change a payment provider or gateway, the authorisation flow is usually the easy part — it is the settlement file format, the reconciliation script, and the PCI DSS scope boundary that need equal test coverage before you cut over.

Why teams fail here

They treat the approval response as the end of the test — the Kauri & Co trap — so settlement failures, batch drops, and hold-not-converted bugs ship to production undetected.
They skip idempotency testing because the happy path never retries, so the first time a customer's connection drops and they press Pay again, a double charge goes through unchallenged.
They test against a single payment rail in isolation and do not cover the timing and irrevocability differences of real-time payments — meaning their refund and dispute logic assumes a settlement window that no longer exists on fast-payment transactions.
They treat PCI DSS compliance as a checkbox handled by another team, so card data scoping creeps into test logs, screenshots, and mock data — creating a compliance breach in the test environment that mirrors production risk.

Key takeaway

In payments testing, the approval screen is the beginning of your job, not the end — every dollar that fails to settle, double-settles, or fails to reconcile was authorised successfully first.

← Core Banking & Ledger Testing Next: Open Banking & APIs →

Payments Testing

1 The Hook

2 The Rule

3 The Analogy

4 The NZ Payment Rails

5 Authorisation versus Settlement

6 The Payment Lifecycle and Chargebacks

7 What to Test in a Payment

8 Building Payment Test Cases

9 Common Mistakes

10 Now You Try

11 Self-Check

Related techniques

12 Interview Prep