ArchitectureMarketplace

Survived 14x traffic on launch day.

Consumer platform pre-launch. We modelled the scaling cliff, sharded the hot table, and stood up read-replicas before TechCrunch published.

ClientConsumer platform

Duration6 weeks · 2 engineers

Regionus-east-1, us-west-2

CategoryArchitecture

14xPeak traffic above baseline

99.97%Availability through launch week

<200msp99 response time at peak

0Customer-facing incidents

THE PROBLEM

What we walked into.

Consumer platform had a TechCrunch feature scheduled and a pre-launch following of 80k waitlist signups. The current architecture was a single-region Postgres deployment that had handled their beta of 2k DAU comfortably. And absolutely no further.

Six weeks to launch. Two backend engineers. The CEO wanted answers, not slideware.

THE APPROACH

How we shipped it.

Load model with real production traffic

Built a load test that replayed two weeks of production traffic at increasing multiples. Identified the exact request volume where p99 latency crossed 1s. The scaling cliff. Then designed for 4x that.

Hot-table sharding behind a read API

The `activity` table was 80% of read load. Sharded by user ID across four logical shards behind a read API. Application code unchanged.

Read-replicas with geo-routing

Stood up read replicas in us-west-2 for the western US user base, routed via Route 53 latency-based records. Cut p99 for half the user base in half.

Pre-warmed cache and CDN

Pre-warmed CloudFront for all hot URLs the morning of the TechCrunch publish. ElastiCache pre-loaded with predicted hot keys.

STACK

AWS services in this engagement

EC2 / ECS

RDS (sharded)

ALB

CloudFront

Lambda

THE OUTCOME

What shipped.

TechCrunch traffic peaked at 14x baseline. Platform held at p99 <200ms throughout the launch week. No customer-facing incidents, no engineer woken up.

The architecture has since handled three further traffic spikes from press coverage without further engineering work.

MORE CASE STUDIES

Security

Tell us what you're trying to ship.

A 30-minute scoping call with the engineers who would do the work.

Book a 30-min call See more case studies