Automating Playable Ad Creative Variants with Serverless LLM Pipelines

The Problem

Playable ad campaigns require hundreds of creative variants. Each variant targets a different audience segment, platform placement, or A/B test cell. A single campaign launch can demand 50-200 unique creative combinations -- different hooks, call-to-action buttons, color schemes, and offer presentations. Doing this manually takes days. Designers write one variant at a time, QA reviews each one, and the bottleneck stalls campaign velocity.

For PlayableAd Studio, the challenge was acute. The platform generates MRAID-compliant playable ads for hyper-casual games. Clients needed to launch 10+ campaigns per week, each with 20+ creative variants. Manual generation was the #1 bottleneck reported in user surveys. A typical client with 5 game titles wanted 10 campaign refreshes per title per week, totaling 1,000+ creative variants monthly. At 30 minutes per manual variant, that's 500 hours of design work. Impossible to scale without automation.

The Solution

We built a **Serverless LLM Variant Pipeline** that runs on Cloudflare Workers. The pipeline takes a single base creative brief and generates 20-50 variants automatically. Each variant gets a unique headline, color palette, CTA text, and incentive offer -- all within the constraints of the playable ad's game mechanics and brand guidelines. The entire pipeline runs in under 3 minutes for a batch of 50 variants.

The pipeline exposes a simple REST API. A campaign manager submits a JSON brief once, and receives a manifest of downloadable MRAID files 3 minutes later. No waiting for designers, no back-and-forth revision cycles. The brief format supports explicit constraints per platform -- Meta placements require shorter headlines than TikTok, for example -- and the pipeline adjusts the generated variants accordingly.

Architecture

The pipeline has four stages:

Stage 1: Brief Parser

A Cloudflare Worker reads the campaign brief (target audience, game genre, platform, tone, color constraints, offer types) and extracts structured parameters. The brief format is a JSON document with 12 required and 8 optional fields. The parser validates all required fields and applies sensible defaults for optional ones. If the brief is missing critical information like platform placement, the parser returns a 400 error with specific guidance on what fields need attention. This prevents downstream variant generation from producing unusable creatives.

Stage 2: Variant Generator

Calls an LLM (OpenRouter API) with a prompt template to generate N variant descriptions. The prompt includes the structured parameters from Stage 1 plus examples of high-performing variants from the client's historical data. Each description includes headline (max 30 chars), subtitle (max 50 chars), CTA text (max 25 chars), visual style, and offer type. The generator uses a temperature of 0.9 to ensure diversity across the batch while staying within brand constraints.

```typescript

async function generateVariants(brief: Brief, count: number): Promise<Variant[]> {

const prompt = buildPrompt(brief, count);

const response = await fetch('https://openrouter.ai/api/v1/chat/completions', {

method: 'POST',

headers: { 'Authorization': `Bearer ${API_KEY}`, 'Content-Type': 'application/json' },

body: JSON.stringify({

model: 'openai/gpt-4o',

temperature: 0.9,

messages: [{ role: 'user', content: prompt }],

response_format: { type: 'json_object' }

})

});

const data = await response.json();

const variants: Variant[] = JSON.parse(data.choices[0].message.content).variants;

return variants.filter(v => validateConstraints(v));

}

```

Stage 3: Constraint Validator

Validates each variant against brand guidelines -- no forbidden words, CTA length limits, color accessibility WCAG contrast ratios. Invalid variants are regenerated with refined prompts that include the specific constraint that was violated. The validator also checks for duplicate variants: if two generated descriptions have cosine similarity above 0.85, one is discarded and regenerated. This ensures real diversity across the batch instead of surface-level number changes.

Stage 4: Asset Assembler

Merges variant parameters with the playable ad's game template to produce MRAID-compliant HTML files. Outputs are stored in R2 with a 24-hour cache TTL and returned as a manifest JSON. The manifest includes variant IDs, file download URLs, and metadata tags for ad network upload. The assembly step runs entirely on Workers -- no Lambda or EC2 instances needed.

Results

After deploying the pipeline for 3 pilot clients over 6 weeks:

- **Campaign velocity**: 4x increase in creative variants per week (from 15 to 60+)

- **Time to campaign launch**: Reduced from 2 days to 4 hours per launch (85% reduction)

- **A/B test throughput**: 8 concurrent A/B tests running vs 1-2 before the pipeline

- **Cost per variant**: $0.03 per variant (LLM API + Worker compute)

- **Client satisfaction**: 85% of auto-generated variants met or exceeded manual quality benchmarks

- **Variant diversity index**: 0.92 out of 1.0 for headline uniqueness across 50-variant batches

Key Takeaways

- LLM-powered creative generation eliminates the manual bottleneck, enabling 10x output at 1/10th the time

- Serverless architecture keeps costs near zero at rest -- no idle GPU instances to pay for

- Constraint validation is critical; invalid variants waste ad spend more than not generating them

- Structured output via response_format reduces parsing errors by 90% compared to raw text completion

- Pipeline retries achieve 99.2% batch completion rate with max 3 attempts per failed variant

- Diversity scoring prevents the LLM from generating near-identical variants under different numbers