Case study · FinTech
Payments orchestration modernization
Client. Global payment processor (anonymized)
Problem
Peak traffic triggered retry storms; support could not trace a payment across gateways in under an hour. Change risk blocked country expansion.
Solution
Strangler migration to an event-backed orchestration core with deterministic replay, per-route SLOs, and runbooks tied to dashboards executives actually read.
Approach
Eighteen-month program in vertical slices: baseline tracing, replay harness, gateway abstraction, progressive country cutovers with automated canaries.
Tech stack
- Kubernetes
- PostgreSQL
- TypeScript
- Java
- Docker
- Azure
Results
- +4.1 pts authorization success on challenged routes
- −38% p99 routing latency at peak
- 12 min median time-to-triage for incidents