Writing · architecture · planner-app

The four layers that made paid API calls safe to ship

Published 2026-04-10 Reading 6 min Tags architecture · spend · cloudflare

I spent three sessions planning the migration from Railway to Cloudflare Workers before I wrote a single line of the new spend module. Most of that time went to answering one question: how do I make it structurally impossible to send a paid SMS twice for the same reminder.

The problem

The planner app sends SMS reminders through Twilio on a schedule. Every SMS costs real money. Under the old Railway architecture the scheduler was a single long-running Python process; a bug in the loop would get noticed inside a minute because logs scrolled past in the terminal. Under Cloudflare, the scheduler runs as a Cron Trigger every minute, in a fresh Worker with no memory of previous invocations. A bug that retried the same send without protection could send the same reminder fifty times before I noticed.

I wrote down what "protected" had to mean, and it came out as four layers: a killswitch to stop everything, a budget cap per user, an idempotency guarantee so retries don't double-send, and only then the actual vendor call. Any one of those layers catches a different failure mode; skipping any one of them means a bug in the others becomes expensive.

Before I wrote a single line, I wanted the idempotency layer specifically right. I knew the naive "check if sent, then send" pattern had a race condition in it, but I hadn't built the mental model for what replaced it. Here's the conversation that closed the gap.

Variant D1 · with left border strip

Transcript · idempotency design 2026-04-09 · 4 turns

Dylan

The spend module needs idempotency or the whole thing is a liability. How do I make "send twice" structurally impossible rather than probabilistically unlikely?

Claude

Insert the row before calling Twilio. Put a UNIQUE constraint on (recipient, occurrence_id, date) and let the database reject the second write. On vendor failure after the insert, delete the row or mark it failed; never leave it in limbo.

Dylan

Does this need to be in a transaction?

Claude

No. The constraint does the work. A transaction would actually hurt: it holds the row lock across the Twilio call, which might take hundreds of milliseconds.

Transcript 1 · shown with variant D1

The insert-before-send pattern

The idempotency layer turned out to be the most interesting one. The naive approach is to check "have I sent this already" before calling Twilio, but that leaves a race window: two concurrent invocations both see no prior send, both insert the log after, both call Twilio. The fix is to invert the check: insert a row representing the intent to send first, with a UNIQUE constraint on (recipient, occurrence_id, date), then call Twilio. If two invocations race, exactly one succeeds at the INSERT; the other fails cleanly with a constraint violation and exits without sending.

The other three layers (killswitch, budget cap, vendor call) are easier to reason about individually but only work if they compose in the right order. Put the killswitch first, the budget cap second, the idempotency insert third, the vendor call fourth. Reverse any two and you get a bug that costs money instead of a bug that costs nothing.

Figure 1 · four-layer spend protection, insert before execute

What this bought me

Two weeks after the migration shipped, a bug in the reminder scheduler caused the same cron to fire twice in a 60-second window. Without the idempotency layer, that would have been two SMS sends for every active reminder in the system. With it, the second invocation hit the UNIQUE constraint, logged the rejection, and exited. Zero user-visible impact, zero duplicate sends, zero budget impact. I learned about the bug from an audit log query I ran the next day.

The query that surfaced it looked like this:

audit-query.sh · planner-app

$ wrangler d1 execute planner --remote --command \ "SELECT outcome, COUNT(*) FROM sms_log WHERE date = '2026-04-08'" ┌──────────────────────┬──────────┐ │ outcome │ COUNT(*) │ ├──────────────────────┼──────────┤ │ sent │ 47 │ │ rejected_duplicate │ 47 │ └──────────────────────┴──────────┘ [ok] 47 sends, 47 rejections caught by UNIQUE constraint [ok] budget impact: $0.00 · user impact: none

Figure 2 · audit query output · every send matched by a rejected duplicate

What I'd do differently

Principle 2 in my workspace architecture rules (defense in depth for spend protection) came out of this work. The thing I'd emphasise more, if I wrote it again: insert-before-send has to be the third layer at the earliest, never the last. If the UNIQUE constraint is downstream of the budget check and the budget check has a bug, the constraint still saves you. The other way around is not true.

A week after shipping the pattern, I checked in with Claude about whether the approach generalised beyond SMS:

Variant D2 · no left border strip

Transcript · pattern generalisation 2026-04-10 · 4 turns

Dylan

The cron double-fired yesterday. Idempotency layer caught it, zero user impact. I think this pattern generalises?

Claude

Yes. Any "send something external, exactly once" requirement reduces to: pre-insert with UNIQUE constraint, then call the vendor, then roll back the insert on failure. Same shape for SMS, email, webhook dispatch, and background job enqueueing. The key insight is that the constraint runs before the expensive call, not after.

Dylan

What about at-least-once vs exactly-once semantics?

Claude

This is exactly-once from the user's perspective. At-least-once semantics from the vendor are fine, because the retry will hit the same constraint and get rejected identically. The cost of one failed INSERT is trivial compared to the cost of a duplicate send.

Transcript 2 · shown with variant D2

The pattern is in the workspace architecture rules now. Every spend-touching module in this portfolio inherits it by default, and the principle number is cited in every relevant diary entry. The cost was three planning sessions and one small refactor; the return is a class of bug that can no longer happen.

Transcript variant D
in-article mock

The four layers that made paid API calls safe to ship

The problem

The insert-before-send pattern

What this bought me

What I'd do differently

Direct comparison

D1 · with left border strip

D2 · no left border strip

Transcript variant Din-article mock

The four layers that made paid API calls safe to ship

The problem

The insert-before-send pattern

What this bought me

What I'd do differently

Direct comparison

D1 · with left border strip

D2 · no left border strip

Transcript variant D
in-article mock