Dedication

Dedicated to Laika & Tuffy

Front matter · What ships with this book

What ships with this book

Name: Design Systems That Build Themselves
Author: Ishdeep S Sahni

This is not a survey of the field. It is a working system — copy-paste-ready, production-grade, fully documented. Below is everything that lands in your hands the moment you open the kit.

Every artifact in this book exists as a downloadable file you can run today. The chapters explain why each piece exists and how to remix them; the kit is the runnable version. Open kit/README.html to start.

What this book is not: a magic generator. The system requires your inputs (three brand adjectives, optional brand colours, optional Figma file), your judgment at every checkpoint (archetype confirmation, BYOC palette review, drift triage), and an AI tool to drive (Claude in this book — see Ch17). What it does provide: a tested contract between human decisions and AI output, so the same inputs produce the same recognisable system every time.

The kit — every artifact, downloadable

5 archetype specs — Harbor, Sentinel, Current, Lattice, Meridian (.md + .html previews)
1 governance spec — universal quality gates that override every archetype
28 atomic prompts — copy-paste ready, every prompt with model + MCP + environment metadata; grouped into Generation, Validation, Path B workflow, Claude Design, Scenarios, and Code → Figma planning
10 prompt chains — multi-step orchestrations with checkpoints, plus the chain.md runner. Chain 9 (Path A end-to-end) and Chain 10 (Three-phase Path B) are the two main runs
kit.zip — one bundle, drop into your project root, run chain.md
Live HTML spec previews — 5 self-contained, theme-switchable archetype reference pages
Prompt Flow Map + Full Catalog — node diagram + step-list views of the full system
Figma MCP playbook — connection diagnostics, three-phase workflow, Code↔Figma round-trip

↓ Download kit.zip or browse the kit README

Five archetypes — pick the one that matches your product

Each archetype is a complete design language: tokens, components, layout vocabulary, banned patterns, voice rules, governance overrides. Click Spec for the rendered HTML preview; click .md for the raw paste-ready spec block.

Archetype	For	Defining trait	Files
Harbor	Editorial · marketing · brand	Atmospheric warmth, serif italic display, single-accent restraint, oklch shadows	Spec · .md
Sentinel	Enterprise · admin · workflow	Two-layer tokens, accent-CTA rule, tabular numerals, sans-serif workhorse	Spec · .md
Current	Mobile-first · consumer apps	44px touch targets, native font stack, safe-area insets, no hover-only logic	Spec · .md
Lattice	Data-dense terminals · trading desks · observability	Two-color-system architecture (chrome + data), monospace numerics, keyboard-first, flat elevation	Spec · .md
Meridian	Long-form reading · documentation · publications	72ch reading-width cap, 17px+ body, 1.7+ leading, sepia + dark + light, print stylesheet required	Spec · .md

Plus a sixth, universal: Governance — the spec that every archetype inherits. View HTML · Download .md.

How the system governs itself

Three feedback loops keep generated output aligned with the spec:

Loop	Catches	Cadence
Drift audit (Prompt 11 · Chain 4)	Hardcoded hex, off-grid spacing, invented tokens, semantic misuse	Pre-merge CI gate · quarterly full sweep
A11y audit (Prompt 12 · Chain 5)	WCAG 2.1 AA failures: contrast, focus, ARIA, keyboard, touch targets	Per-component before ship · post-refactor sanity
Figma write-back (Prompt 15 · Chain 8)	Code-Figma drift after token updates or BYOC palette changes	Sprint-end token sync · post-rebrand reconciliation

The audits do not require human review of every line — they read against the canonical tokens.css and the archetype's banned patterns, and surface only what diverged. The output is a severity-ranked report, not a wall of warnings.

Who this is for

You are	You get
Design system lead bringing AI integration into an established practice	A repeatable workflow that uses the official Figma MCP — not an open-source middleware experiment
Senior designer crossing into engineering via Dev Mode	Production-grade components on the first pass, not demos that need rework
Front-end engineer who owns the design system	Proven prompt chains for Figma → React (or Vue / Svelte), not experiments to be reinvented
Founder or PM without a design org	A coherent system that emerges from your brand inputs and evolves with the product

Ch01 makes the case for why this matters. The chapters from Ch02 onward are the system itself.

How to use this book

Three reading paths, depending on where you start:

Path A — Greenfield. No Figma file. Three brand adjectives in, a published design system out. Chapter 20 is your chapter — run Chain 9.
Path B — Standard. Existing Figma file. Use Chapters 15–19 for the Figma + MCP workflow. Run Chain 2 first.
Path C — Deep / audit. Existing work to reconcile against the spec. Run Chain 4 + Chain 5.

Or read in chapter order, top to bottom — that's the original sequence and it works for any path.

Front matter · About the author

About the author

This is not a survey of the field. It's a documented system — the one I run in production, weekly.

Ishdeep S Sahni

Product · Design · AI Workflows and Paradigms

Eighteen-plus years across design systems, cross-functional delivery, and the AI workflows that ship products. The patterns, prompts, and specs in this book are the ones I run in production — not a survey of the field, not a roundup of best practices. A documented playbook of what works, why it works, and what to swap when your situation differs.

The five archetypes (Harbor, Sentinel, Current, Lattice, Meridian) are my own framing, derived from years across editorial sites, enterprise platforms, mobile apps, analytics terminals, and documentation surfaces. The 28 atomic prompts and 10 chains are what I have shipped, retired, refined, and shipped again. The Figma × MCP × Claude integration is the workflow I switched to when it became clear that copy-pasting between tools was the bottleneck the rest of the industry was still treating as the workflow.

I'm reachable at mail@ishdeepsahni.com or on LinkedIn at linkedin.com/in/ishdeepssahni. If something in here breaks for your context, I want to hear about it — that's how the next version gets sharper.

Act I — Foundations · Chapter 00

The 2026 AI design landscape

Three paradigms exist right now. Until a spec binds them, all three drift. This chapter establishes the difference — and points at the substrate the rest of the book builds on.

Since Figma AI shipped at Config 2024 and Anthropic's vision-capable Claude landed the same month, three distinct AI tracks have emerged in the design and engineering workflow. From the outside they look similar — all three generate UI faster than a human can sketch it. From the inside they solve different problems entirely. Picking one without understanding the difference is like choosing a database before knowing whether you need a spreadsheet.

This chapter establishes the difference. The next chapter establishes why none of them produces a system on its own. The rest of the book is about the substrate — the spec — that turns any of the three paradigms into a system.

Three paradigms

Paradigm 1 · In-product

Figma Native AI

AI acceleration inside the Figma workspace. Rename layers, generate fills, FigJam diagrams, Figma Make. Stays within the design tool.

Solves: Repetitive in-Figma tasks: faster mockups, auto-named layers, diagram generation, prompt-to-frame inside Figma Make.
Boundary: No codebase awareness. Output is independent of the consuming product's tokens, components, or naming conventions.

Paradigm 2 · Generative

Claude Design

Claude generates rendered UI mockups, critiques designs across multiple screens simultaneously, and maintains full design system context across sessions via Projects. Speed from zero. No Figma precedent required.

Solves: First-screen velocity via rendered Artifacts. Visual exploration without an existing component. Cross-screen consistency audit via multi-image critique. Persistent design system context via Projects — spec loaded once, used across every session.
Production-grade when: Spec loaded into a Claude Project as persistent knowledge (Ch17). The spec is the contract that makes every Artifact, critique, and iteration recognisably yours — without repasting.

Paradigm 3 · Source-of-truth

Figma × MCP × Claude

Claude reads directly from a Figma file via the official Dev Mode MCP. Tokens, components, constraints become code references — not guesses.

Solves: Application-level consistency at scale. Every generated component inherits from the same canonical Figma artifact. The AI cannot invent a colour that does not exist in the file.
Boundary: Requires an established Figma file. From zero, Paradigm 2 is faster for the first screen.

The substrate that binds them

None of the three paradigms produces consistency on its own. The shared dependency is the spec — a structured, copy-pastable contract documenting an archetype's tokens, components, layouts, voice, banned patterns, and governance overrides. Two pages, paste-it-in, run anywhere.

Five archetypes exist — Harbor, Sentinel, Current, Lattice, Meridian (Ch9–11). Each has its own spec. Each spec runs about a thousand lines of structured text. The reader picks an archetype, copies its spec into a prompt, and gets back tokens, components, patterns, and a published spec page that match the spec exactly.

What the spec is, concretely

A spec is a fenced text block — copy-pastable, machine-consumable. It declares THEME, TOKEN NAMING, COLOR TOKENS, TYPOGRAPHY, DENSITY, ACCENT STRATEGY, ELEVATION, ICONOGRAPHY, SECTION TEMPLATES (or layout vocabulary per archetype), COMPONENT LIBRARY (in priority order), VOICE RULES, BANNED PATTERNS, WHEN TO PICK criteria, FAILURE MODES, GOVERNANCE OVERRIDES, and a closing PHILOSOPHY callout. Universal gates live in a separate governance.spec.md that pastes alongside the archetype spec.

Two pastes per prompt. That's the full workflow. The rest of the book is what the pastes contain, why those contents are the right contents, and how to extend or evolve them when your product needs something the canonical archetypes don't cover.

Three workflows, three paths

Because the spec is the substrate, three workflows compose around it. Each path uses a different starting point but lands at the same destination: production code that respects the spec's contract.

Path	Best for	Sequence
A · Quick	Greenfield. MVP validation. No Figma precedent. First-screen velocity matters more than artifact fidelity.	Description → Claude Project (spec as knowledge) → rendered Artifact → screenshot → Figma reference → tokens.css → components → spec page
B · Standard	Mature systems. Team handoff. Design review required before code ships. Most readers default here.	Figma file → MCP read → spec-driven prompts → tokens.css → production code
C · Deep	Quality gate before code ships. Auditing existing work against the spec. Compliance / governance contexts.	Figma → Claude critique against spec → revised Figma → MCP → code

Path B is what the bulk of this book teaches because it covers the largest reader audience. Paths A and C are equally complete — Ch20–22 covers each path's full pipeline. Use the path chooser at the top of every chapter to filter content for your path.

Run the system

Two artifacts the book points at — both already exist, both runnable today.

The Kit

Specs · prompts · chains

Every spec, every prompt, every chain in one bundle. Run chain.md for guided sequential execution. Or paste individual prompts ad-hoc.

Open the kit ↓ Download kit.zip

Prompt Flow Map + Full Prompt Catalog

Two views of the prompt system

A connected node diagram across Path A · B · C with chain.md front and centre — click any node to download its prompt. Plus a full step-list catalog for detailed per-path reading.

Open Prompt Flow Map Full Prompt Catalog

What this book does not cover

Out of scope

Pure Claude design workflows — Anthropic SDK reference. We use it as a paradigm, not as the subject.
Figma plugin development.
React Native or mobile-native code generation beyond what the Current archetype produces (Ch11).
Design ops at hundred-contributor scale — Ch24 governance touches this; not the primary subject.
Intro to prompting fundamentals — assumes prior LLM use.

You'll do best with this book if you know your way around Figma and have used an AI coding tool at least once. You don't need to be an engineer — but reading CSS and understanding what a component is should feel comfortable.

Act I — Foundations · Chapter 01

The AI design consistency crisis

AI can build a UI in minutes. It just builds a different UI every time.

Shipping speed is no longer the bottleneck — generation is effectively free. The new bottleneck is whether screen 47 belongs to the same product as screen 3. That gap, between "fast" and "cohesive", is the consistency crisis. This chapter pins down where the drift comes from, why prompting harder does not fix it, and the shape of the fix the rest of the book delivers.

The drift, in five layers

Same prompt, two days apart, returns two clean outputs that fail the side-by-side test. Each layer below would pass review on its own. All five together read as "different teams built these."

Layer	Run 1	Run 2	What the eye reads
Primary blue	`#2563EB`	`#3B82F6`	"Same brand?"
Corner radius	`8px`	`12px`	"Two button styles"
Body type	`14 / 20`	`16 / 24`	"Density mismatch"
Button padding	`10px 18px`	`12px 20px`	"Visually unbalanced"
Icon stroke	`1.5`	`2`	"Two icon libraries"

Why drift happens

Drift is not a prompting failure. It is the predictable behaviour of a generator with no visual memory of its prior runs. Four root causes, in the order they bite.

1

No visual memory across sessions

LLMs do not look at what they built yesterday before building today. Every generation starts from a blank canvas and reaches for "reasonable defaults" — defaults that vary by context, mood, and the specific examples that surfaced in training data.

2

"Reasonable" is not a value

Reasonable corner radius is anywhere between 4 and 16. Reasonable primary blue is a thousand hexes between #1E40AF and #60A5FA. Without an explicit number, the model picks a fresh point inside the reasonable range each run.

3

Pasting tokens narrows values, not application

A pasted tokens.css locks the right blue. It does not lock which element gets the blue, which alias hover state should use, or how icon-to-label spacing maps inside a specific component. Right values, wrong application — drift in disguise.

4

Only the artifact closes the loop

A Figma file read via MCP, or a fenced spec consumed inline, is what the AI must obey rather than improvise from. The artifact is canonical; the paste is a snapshot. Snapshots drift the moment the source updates.

Specification ≠ artifact

The two words look interchangeable. They are not. A pasted token file is a specification — a snapshot, open to interpretation. A Figma file read via MCP, or a fenced spec block consumed inline by the prompt, is the artifact — canonical, shared, current.

Pasted spec (text)

What it locks: token values. The right blue. The right radius. The right type ramp.

What it leaks: token application. Same blue, wrong element. Hover mapped to the wrong alias. Spacing chosen by analogy.

When to use: Path A, Paradigm 2 (Claude Design + Project). Greenfield. No precedent file. Load spec as Project knowledge — the spec becomes persistent context, not a paste. See Ch17 for Project setup.

Figma + MCP (artifact)

What it locks: values and component-to-token mappings. Hover state is whatever the Figma component says it is. Spacing is whatever the auto-layout encoded.

What it requires: a precedent file. From zero, the spec route ships first; the artifact route catches up by Ch15.

When to use: Path B / C, Paradigm 3. Mature systems. Team handoff. Design review before code ships.

The fix shape — five self-actions

"A design system that builds itself" is shorthand for five repeating loops. Each loop is run by the AI; each loop is gated by the spec; you decide six things and watch.

Loop	What it produces	You decide	AI does
Self-extracting	tokens.css from Figma (Path B/C) or from the confirmed archetype spec (Path A)	Source: Figma file, or three adjectives → confirmed archetype spec	Reads via MCP or generates from spec; verifies WCAG; emits dark-mode
Self-generating	Component code with all variants and states	Component scope (which list of components)	Produces every variant, state, token reference in place from the first pass
Self-auditing	Drift report + a11y report on every change	Severity threshold: fail / warn	Runs audits as prompt chains continuously, not as last-minute ship-blockers
Self-documenting	Storybook stories, changelog, quality-gate reports	Disclosure depth (internal vs published)	Generates docs alongside components, never retrofitted after launch
Self-governing	PR-time gates encoded as prompts	Which gates are blocking vs advisory	Runs the audit on every PR; surfaces drift before it compounds

Your role vs AI's role

Six decisions for you, in maybe thirty minutes of thinking. Hours of mechanical execution for the AI. The split is not negotiable — handing AI taste decisions produces template-AI ("Unlock the Power of…"); handing yourself mechanical decisions produces a six-week sprint that should have been an afternoon.

You — taste & judgment

Pick three brand adjectives.

Confirm the AI-suggested archetype, or override (Ch02).

Decide dark-mode strategy.

Pick a tech stack (React / Vue / web components).

Path A: run the chain. Path B / C: point at a Figma source.

Say go / no-go at ship.

AI — mechanical execution

Suggest an archetype from your three adjectives — first AI step in Path A, runs before token generation.

Extract tokens from Figma, or generate from the confirmed archetype spec.

Generate dark-mode overrides + WCAG verification.

Produce every component's code, every variant, every state.

Add a11y annotations; generate Storybook stories.

Run drift audits on every PR.

Maintain governance gates as prompt chains.

Who this book is for

Four readers, different starting points, the same destination. If you don't see yourself in one of these, the rest of the book will probably frustrate you — and that's fair.

Persona 01

Design system leads brought into AI integration

You've shipped systems before. Tokens, components, governance — second-language. Now Cursor is on someone's screen and you need a repeatable workflow with the official Figma MCP, not an open-source middleware experiment.

Persona 02

Senior designers crossing into engineering

You ship HTML / CSS in Figma Dev Mode. You've experimented with AI generation and hit inconsistency walls. You want production-grade components on the first pass, not demos.

Persona 03

Front-end engineers who own the design system

You inherit Figma files and turn them into React. Eighty percent of the work is mechanical and you sense AI should be doing it. You want proven prompt chains, not experiments.

Persona 04

Founders & PMs without a design org

You need a coherent system to exist, but six weeks of hand-crafting is not on the table. You want it to emerge from your brand inputs, then evolve with the product.

What this book gives you

Five concrete deliverables. Everything between Ch02 and Ch24 reduces to one of them.

1

Five archetypes, not one

Harbor (editorial), Sentinel (enterprise), plus three specialists — Current (mobile-first), Lattice (data-dense), Meridian (long-form reading). Picked at Ch02; built out at Ch09–11.

2

28 atomic prompts in the kit, plus dozens of teaching prompts inline

The kit ships 28 numbered, copy-paste prompt files — each labelled with its required MCP, recommended model, and execution environment. Tokens, components, Figma → Code, Code → Figma, accessibility, governance. Every kit prompt includes a meta-header (prerequisite check + branching questions) and meta-footer (auto-save, HTML preview offer, handoff to next step). BYOC colors and brand fonts are asked inline in Prompt 01 — no separate prompt to track. Throughout the chapters, an additional set of inline teaching prompts walk through specific decisions (archetype selection, hybrid-product strategy, drift triage). 50+ prompts in total when teaching examples are counted — but the kit's 28 are the canonical, runnable surface.

3

Ten prompt chains

Multi-step workflows with artifacts at every step. Chain 9 covers Path A end-to-end (adjectives → published spec page in an afternoon); Chain 10 covers Path B (existing Figma → production code via the Three-Phase loop).

4

The 12 anti-patterns AI builders drift toward

Banned aesthetics with concrete replacement patterns. The "Unlock the Power" cliché, the floating placeholder label, the cold rgba shadow on a warm palette.

5

Figma Dev Mode MCP mastery

How to use Figma's official MCP to extract tokens, generate components, and keep code and Figma in sync — read and write, both directions.

6

The Kit — every spec, every prompt, every chain, downloadable

One bundle you copy into your project. Run chain.md in Claude Code for guided sequential execution — prompts save their own outputs (output/tokens.css, output/foundation.css, etc.), ask BYOC and font questions before generating, and hand off to the next step automatically. Or paste individual prompts ad-hoc; each is self-contained. Open the kit →

7

Prompt Flow Map + Full Prompt Catalog

How every prompt composes across Path A · B · C. Prompt Flow Map → — connected node diagram; chain.md entry front and centre; click any node to download its file. Full Prompt Catalog → — detailed per-path step list with descriptions for every prompt.

Chapter 24 closes the loop with the five chains running end-to-end. Everything between this chapter and that one is the vocabulary, the reference systems, and the individual prompts that compose into those chains. The Kit and the Flow Map sit alongside the book as the runnable surfaces.

What this saves — order-of-magnitude estimate

Numbers below are reference ranges from running this workflow on greenfield and existing-system projects. Your numbers will differ — log your before / after as you adopt and replace the placeholders.

Activity	Hand-crafted (typical)	Spec + kit (this book)	Yours
Token foundation (3 layers, light + dark)	2–3 days	20–40 min (Prompts 02 + 02b)	__
Core component pass (Button, Input, Badge, Card, Modal)	1–2 weeks	2–3 hours (Prompt 03 + Workflow C critique)	__
Spec page (rendered, copy-paste ready)	3–5 days	30–60 min (Prompt 08)	__
First archetype to first ship-ready set	4–6 weeks	1 afternoon (Chain 9 end-to-end)	__
Drift audit on a PR	30–90 min manual	2–5 min (Prompt 11 in CI)	__
Headcount to maintain a production system	3–5 (lead, designer, 2 engineers, doc owner)	1 lead + 1 engineer (part-time on system)	__

Caveat: numbers are total-project speed-ups, not strict apples-to-apples per-task ratios. The kit removes work entirely (e.g., Layer 3 token derivation by hand vs Prompt 10) more than it makes existing tasks faster.

What's next

The first fork in the road is the one that compounds the hardest if you take it wrong: which archetype of design system are you actually building? Picking the wrong archetype costs more than not picking — every downstream decision inherits the mismatch. Chapter 02 walks the five-way decision tree and lands you on one of Harbor, Sentinel, Current, Lattice, or Meridian.

Act I — Foundations · Chapter 02

Five archetypes of design systems

Most "design system" advice treats the term as one thing. It is not — and the choices the five archetypes make are often the opposite of each other.

This chapter does three things, in order. It introduces the five archetypes I work with — Harbor, Sentinel, Current, Lattice, Meridian — and links to the live spec for each. It dismantles the popular framing that "three brand adjectives" deterministically produce a design system, and replaces it with an honest narrowing-then-evaluating loop. And it lands the reader on a single archetype via a three-question decision tree, which becomes the primary input to every prompt for the rest of the book.

The five — at a glance

Each card opens a live working preview — what running Prompts 1, 45–47, and 49 actually produces when fed that archetype. The paste-able spec text — the fenced block you copy into Prompt 1 — is embedded as a <spec-card> with a COPY button in Ch03 (alongside the prompt that consumes it) and again in each archetype's deep-dive chapter (Ch09–11). The preview shows you what to expect; the spec-card gives you what to paste.

Archetype 01

Harbor

Editorial & marketing. Dark-first, single-hue, fluid serif display, semantic-only token naming, oklch warm shadows.

Open the spec

Archetype 02

Sentinel

Enterprise & product. Light-first, compact, modular type scale, primitive + semantic alias architecture, multi-ramp palette.

Open the spec

Archetype 03

Current

Mobile-first consumer. Touch-first 44px targets, thumb-zone layouts, system fonts, bold multi-accent palette, gesture patterns.

Open the spec

Archetype 04

Lattice ^*

Data-dense analytics. 12px base, monospace numerics, tabular rhythm, multi-pane chrome, keyboard-first, two-system tokens (chrome + data).

Open the spec

Archetype 05

Meridian

Long-form reading. Body-text-as-hero (17–19px, 1.75 leading), reading-width caps (~72ch), minimal chrome, semantic + reading-role tokens.

Open the spec

^* Lattice, the archetype name, is unrelated to Lattice the HR product — used here as a categorical label for data-dense analytics surfaces.

Harbor and Sentinel together cover roughly 60–70% of real products in my experience. Current, Lattice, and Meridian are the specialist edge cases. The shared dependency, regardless of which you pick, is the universal governance spec — non-negotiable gates that paste alongside the archetype-specific spec.

Twelve dimensions, opposite choices

The reason archetype choice compounds is that they make opposite calls on almost every dimension. Forcing one archetype's choices onto another's use case produces a system that fights itself — a Sentinel-density dashboard rendered in Harbor's fluid serif, or a Current mobile flow rendered with Lattice's 12px tables.

Dimension	Harbor	Sentinel	Current	Lattice	Meridian
Theme	Dark-first	Light-first	Both first-class	Dark-first	Light + sepia + dark
Token naming	Semantic only	Primitive + semantic	Semantic + brand ramp	2-system (chrome + data)	Semantic + reading roles
Body type	Fluid clamp() serif italic	Fixed 14px sans	17px native sans	12px sans + mono numbers	17–19px serif, 1.75 leading
Density	Generous (48–128px)	Compact (4px base)	Generous (touch zones)	Maximum (2px base)	Reading-comfortable, 72ch cap
Accent strategy	Single hue	Multi-ramp + semantic	Multi-accent bold	2 systems, kept separate	Minimal (link only)
Elevation	Background shift + warm shadow	Flat + hairline borders	Soft mobile shadows	Hairline grid + 1px rules	Flat, almost shadow-less
Card radius	16px (soft)	4px (crisp)	12–16px (touch-friendly)	2–4px (data-dense)	0–4px (utilitarian)
CTA hierarchy	One primary per section	Primary / secondary / tertiary / destructive	One primary, full-width, thumb-reachable	Compact tabular actions, keyboard-first	Inline links primarily
Iconography	Phosphor	Lucide	Phosphor + native iOS / Material	Tabler	Heroicons
Voice	Editorial, aspirational	Clear, helpful, respectful	Direct, action-led	Terse, technical, abbreviation-heavy	Calm, authoritative, slow
Severity-via-border	Banned	Used (domain-correct)	Used sparingly	Used heavily for status chips	Banned
Typical session length	2–5 min	20–60 min	1–10 min, many returns	4–8 hrs (terminal)	15–45 min reading

Adjectives: narrowing, not determining

The most cited shortcut for picking among the five is "describe your brand in three words and the AI will infer the archetype." That shortcut works — but the popular framing is wrong about what it does. Adjectives do not determine a design system. They narrow the space the AI generates inside.

Reading	What it claims	Reality
Popular framing	Three adjectives in → deterministic design system out	Same adjectives, different runs, materially different palettes
Honest framing	Three adjectives narrow the option space; AI fills in the specifics within that space; you evaluate and either commit or tweak the adjectives	Reproducible at the category level (warm vs cool, generous vs compact), interpretive at the specifics level (which warm, which generous)

Adjective dictionary

A reference for what each adjective pre-commits before you even press run. Mix and match three from the list — the AI's interpretation will land in roughly the territory described.

Adjective	Hue family	Saturation	Density / contrast
`warm`	Reds, oranges, ambers, ochres	Balanced to muted	Generous spacing, soft contrast
`cool`	Blues, teals, slate	Balanced	Compact rhythm, crisp contrast
`editorial`	Muted neutrals + single accent	Restrained	Generous (Harbor or Meridian density)
`precise`	Cool or neutral	Restrained	Compact (Sentinel) or maximum (Lattice)
`restrained`	Greys, single hue allowed	Muted	Single accent, hairline elevation
`confident`	Saturated, anchored	Vivid on accent only	Bold weight distribution, no pastels
`calm`	Muted neutrals + cool accent	Muted	Flat surfaces, minimal motion
`vibrant`	Multi-hue, saturated	Vivid	Multi-accent (Current territory)
`quiet`	Neutrals, single muted accent	Muted	Generous, body-as-hero (Meridian)
`sharp`	Cool or neutral	Vivid on accent only	Crisp 4px radii, no soft surfaces
`spacious`	Any (neutral-leaning)	Restrained	Maximum whitespace, large type
`technical`	Cool, blue-leaning	Restrained on chrome, vivid on data	Maximum (Lattice — 12px base)

This dictionary is a starting position, not a contract. The AI may interpret warm as #C9A961 on one run and #D4A853 on the next; both are warm. The category is reproducible; the specific hex is not. Generate once, evaluate, lock the output as your canonical tokens.css, never re-run from scratch.

The archetype-from-adjectives prompt

Cascade step 2 in the workflow below runs this prompt. Three adjectives in; one archetype recommendation out, with explicit per-dimension reasoning so you can confirm or override before downstream prompts inherit the choice. The prompt is interpretive, not deterministic — its job is to surface a defensible recommendation from a narrowed option space, not to produce the same answer twice. If the recommendation feels wrong, sharper adjectives change the result more reliably than re-running the prompt.

Archetype from adjectives

PROMPT — ARCHETYPE FROM ADJECTIVES

YOUR ROLE
You map three brand adjectives to one of five archetypes — Harbor,
Sentinel, Current, Lattice, Meridian — and surface your reasoning per
design dimension so the user can confirm or override. You do NOT generate
any tokens, components, or specs in this step. Your only output is an
archetype recommendation with explicit per-dimension reasoning and a
brief case for the runner-up.

STEP 0 — RECEIVE INPUTS
  THREE BRAND ADJECTIVES: [adjective 1], [adjective 2], [adjective 3]
  PRODUCT TYPE (optional): [e.g. "marketing site", "data terminal", "mobile app"]
  USE CASE NOTES (optional): [e.g. "users sit for 4-hour sessions", "thumb-zone primary"]

If fewer than three adjectives are supplied, stop and ask. If the
adjectives are obvious filler ("modern", "clean", "intuitive", "innovative",
"powerful"), apply the elimination test in STEP 1 and ask for sharper
words before proceeding.

STEP 1 — APPLY THE ELIMINATION TEST
For each adjective, ask: could a competitor design system also claim this
adjective without lying? If yes, flag it as low-signal.

If two of three adjectives flag as low-signal, stop and request sharper
adjectives. Do not proceed to a recommendation — the input doesn't narrow
the option space enough to justify one.

If exactly one adjective is low-signal, note it and proceed; the other two
will carry the inference.

STEP 2 — INFER PER-DIMENSION TENDENCIES
Output ONE sentence per dimension, naming the value the adjectives
narrow toward and the adjective(s) driving that inference:

  - Theme: dark-first | light-first | both first-class | reading-comfortable
  - Density: generous | compact | maximum | reading-comfortable (Meridian's 72ch cap)
  - Type personality: serif-display + sans-body | sans-throughout | native-system | sans + mono-numerics
  - Accent strategy: single-hue | multi-ramp + semantic | bold-multi-accent | minimal | two-systems (chrome + data)
  - Elevation: warm-shadows | flat + hairlines | soft mobile | grid-based | almost-flat
  - Iconography: Phosphor | Lucide | Phosphor + native | Tabler | Heroicons
  - Implied session length: short brand visit | desktop work session | mobile bursts | long terminal | reading session

STEP 3 — SCORE AGAINST THE FIVE ARCHETYPES
Score each archetype against the inferred tendencies. Reference the
archetype profiles below:

  - HARBOR — dark-first, generous, serif-display + sans-body, single-hue,
    warm-shadows, Phosphor, short brand visit. Editorial / marketing.
  - SENTINEL — light-first, compact, sans-throughout, multi-ramp + semantic,
    flat + hairlines, Lucide, desktop work session. Enterprise / product.
  - CURRENT — both first-class, generous (touch zones), native-system,
    bold-multi-accent, soft mobile, Phosphor + native, mobile bursts. Mobile.
  - LATTICE — dark-first, maximum, sans + mono-numerics, two-systems,
    grid-based, Tabler, long terminal. Data-dense.
  - MERIDIAN — reading-comfortable, reading-comfortable density, serif-body
    + sans-display, minimal, almost-flat, Heroicons, reading session. Long-form.

Pick the closest match. If a second archetype scores within one or two
dimensions of the closest, name it as a secondary fit and explain what
would tip the choice (e.g., "Sentinel if dashboard-heavy; Lattice if data
density matters more than approachability").

STEP 4 — DELIVER THE RECOMMENDATION
Output the recommendation in EXACTLY this shape:

  ADJECTIVES: [restated, exactly as provided]
  ELIMINATION TEST: pass | flagged: [adjective] is low-signal because [reason]

  PER-DIMENSION INFERENCE:
  - Theme: [value] (driven by [adjective])
  - Density: [value] (driven by [adjective])
  - Type personality: [value] (driven by [adjective])
  - Accent strategy: [value] (driven by [adjective])
  - Elevation: [value] (driven by [adjective])
  - Iconography: [value]
  - Implied session length: [value]

  RECOMMENDED ARCHETYPE: [Harbor | Sentinel | Current | Lattice | Meridian]
  ONE-SENTENCE REASONING: [why this archetype, not another]
  SECONDARY FIT (if applicable): [other archetype] — [one-sentence tipping condition]

  NEXT STEP: confirm or override the recommendation. Then proceed to
  Prompt 1 (or Prompt 1b for BYOC) in Ch03 with the chosen archetype's
  spec from kit/specs/.

CONSTRAINTS
- Do NOT generate any tokens, components, or specs in this step. Output
  the recommendation block above and stop.
- Do NOT pick an archetype outside the five named.
- If you genuinely cannot fit the adjectives to any archetype (e.g., they
  describe a hybrid use case like "marketing site + data dashboard"), say
  so explicitly and propose a layered approach: name the primary archetype
  for the dominant surface and the secondary archetype for the other
  surface (Ch11 covers hybrid patterns).
- Do NOT paraphrase the adjectives in STEP 4's "ADJECTIVES" line. Restate
  them verbatim so the user can verify what you parsed.
- If the user's product type or use case notes contradict the adjective
  inference (e.g. adjectives lean Harbor but product is "internal admin
  tool"), surface the contradiction explicitly and recommend the archetype
  that fits the use case, not the adjectives.

Use with: three brand adjectives · Claude Opus · Claude.ai or Claude Code · runs before Prompt 1

The cascade — generation with checkpoints

The popular cascade diagram shows a clean linear pipeline: adjectives in, design system out. The honest cascade has explicit human gates between each stage so the interpretive nature of generation is visible — and correctable. In Chain Mode (kit/chain.md), the orchestrator collects steps 1–3 as upfront Q&A before any generation starts — archetype, BYOC colors, and brand fonts are all decided before the first prompt runs. In ad-hoc mode, Prompt 1's meta-header asks the same questions the moment you paste it.

Input · you

1

Three adjectives — your only design input

Pick words that have opinions. Apply the elimination test. If a competitor could claim it, swap it.

— Vector 1 · adjectives narrow palette, saturation, and density.

2

AI suggests an archetype → checkpoint: confirm or override

The archetype-from-adjectives prompt maps your adjectives to one of Harbor / Sentinel / Current / Lattice / Meridian and states its reasoning in one sentence per dimension. You confirm, override, or restart with sharper adjectives. This is the load-bearing decision — every downstream prompt inherits it.

3

Paste the two specs (and route the substitutions)

Open the chosen archetype's spec page (the cards above). Paste the governance spec first, then the archetype spec, into Prompt 01 in Ch03. If you're substituting brand colors, answer "Yes" to Prompt 01's BYOC question — the prompt activates ramp derivation and WCAG validation inline, no separate prompt needed. If you're substituting brand fonts, note them when Prompt 01 asks — they carry forward to Prompt 02 via output/chain-state.md, where the type-scale shape and role rules stay locked.

— Vector 2 · answer "Yes" to BYOC in Prompt 01. Vector 3 · supply brand fonts at Prompt 01 Q2 (applied in Prompt 02).

Generation · AI

4

AI generates tokens.css → checkpoint: review the palette + WCAG report

Prompt 1 (default) or Prompt 1b (BYOC) emits ~150–200 CSS custom properties — surfaces, text, accent, semantic colors, spacing, radii, motion — with light + dark variants and verified WCAG contrasts. Prompt 2 then layers the typography scale on top, applying your brand fonts if supplied. The route matters: Prompt 1's palette is interpreted from your three adjectives; Prompt 1b's palette is derived from your hex seeds, with WCAG validated against the archetype's surface and text roles (and flagged back if your seeds fail contrast). Same archetype shape, two different palette origins. Wrong direction at this gate? Tweak adjectives or seeds, re-run Stage 2.

— Vectors 1, 2, 3 take effect here: palette, saturation, and type face all crystallise.

5

AI generates components, patterns, and the spec page → checkpoint: visual review

Prompts 03–05 produce foundation, composite, and pattern components. Prompt 08 assembles a self-contained spec HTML page (the same shape as the spec links above). Visual mismatch at this gate? You're tweaking components, not regenerating tokens.

— Vector 4 · component scope (Prompts 03–05). Vector 5 · domain components (Prompt 07).

Lock · you

6

Lock the output as your canonical files → checkpoint: ship

Commit tokens.css, component.css, and the spec page as the source of truth. Never re-run the cascade expecting to recreate prior output. Use Prompt 11 (drift audit) for ongoing verification instead.

— Vector 6 · Layer 3 extraction runs against your locked CSS, lifting repeated values into new tokens.

Picking the archetype — a three-question tree

If the adjectives feel forced, the decision tree gets you to one of the five faster than any taste exercise. Three questions, eliminations along the way.

Q1 Is the user on mobile, sitting at a desktop, or working a long terminal session?

Mobile (touch-first, short sessions) → Current Terminal (4–8 hr sessions, data-dense) → Lattice Desktop, mixed sessions → next question

Q2 Is the primary task reading, using a product, or browsing brand surfaces?

Reading long-form (docs, articles) → Meridian Using a product (app, dashboard, workflow) → Sentinel Browsing brand surfaces (marketing, portfolio, editorial) → Harbor

Q3 Did you land on more than one? Or did none feel right?

More than one → you're a hybrid (Ch11 covers layered archetypes — Harbor for the marketing site, Sentinel for the product app) None right → revisit Q1; if still none, the framework may need extending (Ch11 covers "when the five aren't enough")

Archetype and palette are separable

A common pushback at this point: "What if I have my own brand colors but I don't want to invent the structure?" The structural decisions (density, type scale, naming, elevation, motion) and the cosmetic decisions (which hues, which saturation level) are independent. The book treats them as one decision by default — but Ch03 ships a Bring-Your-Own-Colors variant, Prompt 1b, that lets you keep the archetype and substitute your palette.

What's locked vs what's yours

The natural follow-up: "if Sentinel's spec is fixed, won't every product that picks Sentinel ship the same UI?" No — and not by accident. The archetype locks structural decisions; six named seams let you personalize within that structure. Two readers, both Sentinel, ship products that look visibly different. That's the design — Sentinel is the bones, not the skin.

Vector	What you bring	What it changes	Which prompt
1 · Adjectives	Three brand adjectives	Hue family, saturation range, density bracket — all narrowed within the archetype's structural rules	Prompt 1 (Step 0 inference) or the archetype-from-adjectives prompt
2 · Brand colors (BYOC)	Primary hex + optional accent hex as ramp seeds	Specific palette values; archetype's ramp shape and WCAG contracts unchanged	Prompt 1b — BYOC variant
3 · Brand fonts	Display + body font family (and import URL if web-fonts), or a system stack	Specific typeface(s) — archetype's type-scale shape and role rules are preserved (fluid clamp() vs fixed modular; serif-display vs sans-throughout; mono-numerics requirement, etc.)	Prompt 2 — typography scale, with font-family overrides
4 · Component scope	List of components you actually ship; which variants and states matter	Which components get built; everything stays archetype-correct	Prompts 45 (foundation) / 46 (composite) / 47 (patterns)
5 · Domain components	Two-sentence description of product-specific components (e.g. "policy editor with rule grid and approval timeline")	Net-new components unique to your product, generated under archetype rules	Prompt 07 — domain component generation
6 · Layer 3 token extraction	(No new input — runs against generated CSS)	Discovered component-level tokens unique to your codebase, lifted from values you repeat 3+ times	Layer 3 extraction prompt

Vectors 1–3 substitute values within the archetype's structural rules. Vector 4 picks scope. Vectors 5–6 add net-new tokens and components that are unique to your product. None of them rewrite the archetype; all of them produce a system that is recognisably yours.

Worth flagging: substitution has limits. Within the archetype's role rules, you have wide latitude — Inter, GT America, IBM Plex, Söhne all work as Sentinel's body sans. Across the role rules, substitution breaks the archetype — swapping Sentinel's body sans for a serif breaks Sentinel's reading rhythm; swapping Harbor's display serif italic for a geometric sans breaks Harbor's editorial signature. The same is true for colors: Sentinel handles a wide palette, but Harbor's single-hue rule means BYOC accepts one primary, not five.

Failure modes — forcing the wrong archetype

Every archetype I've identified has a predictable failure mode when applied to the wrong use case. The symptom always shows up at the system level, not the screen level — so spotting these early is the difference between a one-prompt fix (re-pick the archetype) and a two-week refactor.

Wrong archetype on…	Symptom	Switch to
Harbor on a dashboard	Tables collapse — generous spacing leaves no room for rows. Five rows visible where fifty are needed.	Sentinel or Lattice
Sentinel responsive-down to mobile	Buttons feel imprecise on phones, taps miss, hover-only affordances vanish.	Current (44px targets enforced)
Harbor on long-form docs	Serif italic display feels showy mid-article; readers complain the page "feels marketing-y."	Meridian (utilitarian headings)
Lattice on a marketing site	12px tabular rhythm feels hostile; brand surfaces read as product UI.	Harbor
Current on an analytics terminal	Bold accents and large touch targets eat the data density required for 8-hour sessions.	Lattice

What's next

The archetype is now your primary input. Adjectives are a first-pass aid for picking it, not a separate decision. The next chapter takes the chosen archetype's spec and runs it through Prompt 1 (or Prompt 1b for BYOC) to produce tokens.css — the token architecture that every component in Ch04 onwards will inherit from.

Act II — Token Architecture · Chapter 03

Token architecture

Tokens are the contract every component must obey. Architecture is what makes that contract survive rebrands, theme switches, and product evolution.

A token is a named value — --color-primary, --space-4, --text-base. A flat token list is a glossary. A token architecture is what lets you re-skin the system without touching component code, ship a dark theme without re-authoring 200 properties, and add Layer 3 component-tokens later without rewriting the foundation. This chapter establishes the architecture, ships the three prompts that produce it from your archetype spec, and tells you what to expect on the other side of running them.

Three layers — primitives, semantic aliases, components

Every token in this book lives at one of three layers. The layer it sits at determines who consumes it, when it changes, and what the blast-radius of changing it looks like.

Layer	What it is	Example	When it changes	Who consumes it
1 · Primitives	Raw values: hex, rem, px. The palette in its rawest form.	`--primary-700: #1B2A4A` `--space-4: 1rem`	Rebrand, theme switch.	Layer 2 only — never referenced by components directly.
2 · Semantic aliases	Role-based references that point at primitives via `var()`.	`--color-primary: var(--primary-700)` `--space-card-padding: var(--space-4)`	Role shift (a token's meaning moves), not a value change.	Components, patterns, page CSS.
3 · Component tokens	Optional escape hatches when a single component legitimately needs a value the global semantic doesn't provide.	`--button-primary-bg: var(--color-primary)` `--card-radius: 16px`	Constantly during early authoring; rarely once stable.	Just that component.

Layer 3 is the layer most teams add too early. Don't author component tokens upfront. Let them emerge from real component CSS once you've shipped a few iterations — the Layer 3 extraction prompt scans your locked CSS for values repeated three or more times and proposes which ones deserve a token. Until then, components reference Layer 2 directly.

Why the layering matters

Three scenarios illustrate the cost of conflating layers:

Rebrand. Your primary changes from indigo to teal. With layered tokens, you change one primitive (--primary-700) and every component inherits. With flat tokens, you grep your codebase for #1B2A4A and pray you find every reference.
Theme switch. Light-to-dark swap. With layered tokens, the [data-theme="dark"] override re-points primitives only — semantic aliases stay untouched, components stay untouched. With flat tokens, every component needs its own dark-mode rule.
Role shift. Your "primary" was the brand color; a redesign promotes "accent" to be the brand color and demotes "primary" to a structural neutral. With layered tokens, you re-point semantic aliases — primitives stay; components stay. With flat tokens, every component that referenced the old primary needs editing.

The wrong-versus-right pattern in CSS:

Wrong vs right · component → token reference

/* WRONG — component references a primitive directly. Tightly coupled. */
.button-primary {
  background: var(--primary-700);
}

/* RIGHT — component references a semantic alias. Decoupled. */
.button-primary {
  background: var(--color-primary);
}

/* The semantic alias resolves to a primitive once, centrally, in :root */
:root {
  --color-primary: var(--primary-700);
}

[data-theme="dark"] {
  --primary-700: #7AA8E0;  /* override at the primitive layer only */
}

Where the specs come from

Every prompt below opens with two pastes: governance.spec.md first (universal — same text for every archetype, every prompt that consumes a spec) and your chosen archetype spec second. Both are embedded inline below as COPY-able blocks. The archetype block carries a dropdown — pick Harbor, Sentinel, Current, Lattice, or Meridian and the spec updates in place; hit COPY to grab the full text. The Ch02 archetype-at-a-glance cards are the browsing surface; this is the running surface.

Pick your archetype's spec

Use the dropdown to switch archetype. Spec updates inline; COPY grabs whichever is currently shown. Paste second, after governance.

Archetype spec

Loading archetype spec…

Use with: Prompts 1 / 1b / 45 / 46 / 47 / 49 / 51 — paste SECOND, after governance

Governance — paste this FIRST, every time

The governance spec is universal. Copy the block below and paste it FIRST into Prompts 1, 1b, 49, 51 (and any other prompt that lists governance.spec.md in its inputs). Then paste your chosen archetype spec second. Two pastes, every prompt.

Governance · Universal Spec · v1.0

═══════════════════════════════════════════════════════════════
GOVERNANCE · UNIVERSAL · v1.0
Applies to ALL archetypes — paste before archetype spec
═══════════════════════════════════════════════════════════════

───────────────────────────────────────────────────────────────
ACCESSIBILITY GATES (WCAG 2.1)
───────────────────────────────────────────────────────────────
• Contrast ratios:
    - Body text:        ≥ 4.5:1 (AA), aim for AAA on primary body
    - Large text (18pt+ or 14pt bold): ≥ 3:1 (AA)
    - Non-text UI (icons, focus rings, chart elements): ≥ 3:1
    - Decorative-only elements: exempt
• Every spec page MUST include an Accessibility Verification
  table computed from the actual token values (FG / BG / Ratio /
  Status pill: AAA / AA / FAIL). Do not omit. Do not estimate.
• Failing combinations must be flagged with "FAIL — do not use"
  and an alternative pairing suggested.

───────────────────────────────────────────────────────────────
KEYBOARD NAVIGATION
───────────────────────────────────────────────────────────────
• Every interactive element reachable via Tab in DOM order
• Focus indicator visible: 2px outline, offset 2px,
  --color-focus-ring (defined per archetype)
• Standard key bindings honored:
    Enter / Space  → activate buttons, toggles, links
    Esc            → close dialogs, drawers, popovers, menus
    Arrow keys     → navigate within tabs, radio groups, menus,
                     and listboxes
    Home / End     → jump to first / last item in lists
    Tab / Shift+Tab → forward / back through focusable elements
• Focus trapping: required for modal dialogs, drawers, command
  palettes. Esc returns focus to trigger.
• Skip-to-content link required at top of every page.

───────────────────────────────────────────────────────────────
SCREEN-READER REQUIREMENTS
───────────────────────────────────────────────────────────────
• All icon-only buttons must have aria-label
• All form fields must have programmatic labels (label[for] or
  aria-labelledby — never placeholder-as-label)
• Live regions for toasts, async results, validation errors
  (aria-live="polite" default; "assertive" only for errors
  blocking task completion)
• Test matrix: VoiceOver (Safari/macOS), NVDA (Firefox/Win),
  TalkBack (Chrome/Android) for any component shipped to mobile
• Decorative SVGs: aria-hidden="true". Meaningful SVGs:
  role="img" + <title>.

───────────────────────────────────────────────────────────────
MOTION & ANIMATION
───────────────────────────────────────────────────────────────
• prefers-reduced-motion: reduce → fallback to fade-only or
  no animation. Never just shorten duration; remove transform
  animation entirely.
• No autoplay video or animated backgrounds without user
  consent.
• Vestibular safety: avoid parallax > 20% offset, scroll-jacking,
  spinning loaders > 2s without text label.

───────────────────────────────────────────────────────────────
TOKEN HYGIENE
───────────────────────────────────────────────────────────────
• Zero raw hex in component CSS. Every color, every size, every
  shadow goes through var(--token-name).
• Components reference SEMANTIC tokens, never primitives.
  Bad:  color: var(--color-blue-500);
  Good: color: var(--color-link);
• Both light and dark mode tokens defined for every semantic
  token, even if a mode is unshipped.
• Tokens declared in a single :root + [data-theme="dark"]
  override block. No tokens defined inside component scope.

───────────────────────────────────────────────────────────────
COMPONENT STATE COVERAGE
───────────────────────────────────────────────────────────────
Every interactive component must implement these states
(visually distinct, not just functionally present):
    default
    hover           (pointer devices only — :hover)
    focus-visible   (keyboard users — :focus-visible, NOT :focus)
    active          (pressed — :active)
    disabled        ([disabled] or aria-disabled="true")
    loading         (where applicable — async actions)
    error           (where applicable — validation/failure)
    success         (where applicable — confirmation states)

Form fields additionally implement:
    empty / filled / required / read-only

───────────────────────────────────────────────────────────────
ERROR-MESSAGE FORMULA (UNIVERSAL)
───────────────────────────────────────────────────────────────
Every error message follows What → Why → How:
    What: [what happened, plain language]
    Why:  [the cause]
    How:  [recovery action — concrete, doable]

  Bad:  "Invalid input."
        "Something went wrong. Try again."
  Good: "Email format isn't recognized. Try name@example.com."
        "Upload failed. The file is over 10MB.
         Try compressing it."

Banned: stack traces in user-facing copy, error codes without
explanation, generic "Oops!" / "Whoops!" messaging.

───────────────────────────────────────────────────────────────
COPY HYGIENE (UNIVERSAL)
───────────────────────────────────────────────────────────────
• Sentence case for headings, buttons, labels (not Title Case,
  not ALL CAPS — except for an "overline" eyebrow style)
• No emoji in UI copy. SVG icons only. (Emoji acceptable in
  user-generated content rendering.)
• Contractions allowed (we'll, don't, can't) — softer voice
• Verb-first CTAs ("Save changes", not "Changes can be saved")
• Banned filler: "Please", "Kindly", "Just", "Simply"
• Banned marketing fluff in product UI: "Let's embark on…",
  "Unlock the power of…", "Revolutionize your workflow"

───────────────────────────────────────────────────────────────
ICONOGRAPHY (UNIVERSAL FLOOR)
───────────────────────────────────────────────────────────────
• SVG only. Inline preferred for color inheritance.
• fill="currentColor" or stroke="currentColor" — never hex
• aria-hidden="true" for decorative; <title> for meaningful
• Consistent grid (24px standard; archetype may override)
• Consistent stroke width within a family
• Never mix icon families on the same surface
• Banned: emoji as icon, raster (.png) icons, animated GIF icons

───────────────────────────────────────────────────────────────
VERSIONING
───────────────────────────────────────────────────────────────
SemVer (MAJOR.MINOR.PATCH):
    MAJOR — breaking token rename, removed component, breaking
            API change
    MINOR — new component, new variant, new token, deprecation
            notice
    PATCH — bug fix, doc update, non-visual refactor

Deprecation policy: deprecated tokens/components ship for
2 MINOR versions before removal. Deprecation warnings logged
in console at build time.

───────────────────────────────────────────────────────────────
CHANGELOG TEMPLATE
───────────────────────────────────────────────────────────────
## [VERSION] — YYYY-MM-DD

### Added
- [new components, tokens, variants]

### Changed
- [modifications to existing surfaces]

### Deprecated
- [old → new migration path, removal version]

### Removed
- [breaking removals — MAJOR only]

### Fixed
- [bug fixes]

### A11y
- [accessibility improvements — call out separately]

───────────────────────────────────────────────────────────────
QUALITY-GATE CHECKLIST (PR / RELEASE BLOCKING)
───────────────────────────────────────────────────────────────
☐ Contrast ratios verified (AA minimum, AAA for primary body)
☐ Keyboard navigable (Tab / Space / Enter / Esc / Arrows)
☐ Screen-reader tested (VoiceOver + NVDA minimum)
☐ prefers-reduced-motion honored
☐ Both light + dark mode tokens defined
☐ Storybook (or equivalent) story per variant + state
☐ No raw hex in components
☐ No emoji in UI
☐ Component implements default / hover / focus-visible /
  active / disabled
☐ Error messages follow What/Why/How
☐ Visual-regression snapshot committed
☐ Changelog entry under correct semver bump

───────────────────────────────────────────────────────────────
PERFORMANCE BUDGETS (universal floor)
───────────────────────────────────────────────────────────────
Performance is a design system concern. Slow systems erode every
other quality choice. Universal targets per page (every archetype):

First Contentful Paint (FCP):     < 1.5s on 4G mobile
Largest Contentful Paint (LCP):   < 2.5s on 4G mobile
Cumulative Layout Shift (CLS):    < 0.1
Time to Interactive (TTI):        < 3.5s on 4G mobile
Bundle (JS, gzipped):             < 100KB initial route
Bundle (CSS, gzipped):            < 30KB
Web font payload:                 < 100KB total (subset + woff2)
Largest image:                    < 200KB (WebP/AVIF preferred)

Per-archetype tightening (overrides):
  Current   — initial JS < 70KB (mobile networks)
  Lattice   — initial JS < 50KB; data-grid virtualization mandatory
  Meridian  — web font < 60KB (subset to article glyph set);
              first paint must show body text within 1s
  Harbor    — hero image < 150KB; font subset to display set
  Sentinel  — initial JS < 100KB (workhorse default)

Quality-gate addition: every release runs Lighthouse + WebPageTest
budget check. Regressions fail CI.

───────────────────────────────────────────────────────────────
INTERNATIONALIZATION (i18n)
───────────────────────────────────────────────────────────────
Every component MUST work in:

Direction:
  • LTR (default — English, most European)
  • RTL (Arabic, Hebrew, Persian, Urdu)
  Required: every component declares logical CSS properties
  (margin-inline-start instead of margin-left) so RTL flips
  correctly without per-language overrides.

Character ranges:
  • Latin (basic + extended)
  • CJK (Chinese, Japanese, Korean) — taller line-heights
    needed (1.6+ minimum); fonts may need to swap to
    region-appropriate (Noto Sans CJK)
  • Cyrillic, Greek, Devanagari per product reach

Locale-sensitive tokens:
  • Date / time formats — use Intl.DateTimeFormat, never
    hardcode "MM/DD/YYYY"
  • Number formats — Intl.NumberFormat handles
    locale separators, currency, units
  • Sort orders — Intl.Collator for string comparison
  • Pluralization — Intl.PluralRules + ICU MessageFormat

Banned i18n antipatterns:
  ✗ Hardcoded English strings in components (use i18n keys)
  ✗ Concatenated strings ("You have " + count + " items") —
    use ICU plurals
  ✗ Fixed-width buttons that truncate German (avg 30%+ longer)
  ✗ Direction-aware icons (chevrons, arrows) without RTL flip
  ✗ Assuming Latin characters in regex / validation
  ✗ ASCII-only password rules

───────────────────────────────────────────────────────────────
FORM VALIDATION STRATEGY (universal pattern)
───────────────────────────────────────────────────────────────
Validation behavior is a tonal choice — get it wrong and the
form feels nagging or surprises the user at submit time.

WHEN to validate:

  ✓ On blur (after user leaves the field)        — DEFAULT
  ✓ On submit (final check, always runs)         — REQUIRED
  ✓ On change (real-time), but ONLY for:
    - Password strength meters (visible feedback as user types)
    - Username/handle availability (debounced API check)
    - Match-confirmation fields (password confirm matches)

  ✗ On every keystroke for general fields (nagging, breaks
    flow — user is mid-thought)

WHAT to validate:
  • Required field empty
  • Format mismatch (email, URL, phone, date)
  • Range / length violations
  • Server-side rules (uniqueness, conflict, capacity)

HOW to display:
  • Error message replaces helper text below input
  • Field gets --color-error border + 3px focus ring on focus
  • Error message follows What/Why/How formula
  • Error icon to the left of message (color + icon redundant
    encoding — colorblind a11y)
  • aria-invalid="true" + aria-describedby pointing to error msg
  • Submit button stays enabled (per WCAG SC 3.3.4); never
    disable submit because that hides the form's validation
    state from the user

Per-archetype overrides:
  Sentinel  — inline validation on blur, parenthetical
              "(required)" not asterisk-only
  Current   — keyboard type matches input semantic
              (inputMode="numeric" / "email" / "tel")
  Meridian  — forms are rare; when present, generous spacing
              (--space-4 between fields), clear labels
  Lattice   — number-field formatting (mono + tabular figures);
              tab key advances within table-grid forms
  Harbor    — labels above (NEVER floating); error tone
              consistent with editorial voice (no exclamation)

───────────────────────────────────────────────────────────────
LOADING-STATE STRATEGY (universal decision tree)
───────────────────────────────────────────────────────────────
Choose the right loading affordance — wrong choice feels broken.

Operation duration → preferred loading state:

  < 100ms     → no loading state (instant)
                User won't notice; showing a spinner adds latency
                perception, not removes it.

  100–300ms   → optimistic UI (immediate state change, server
                confirms in background; rollback on error)
                Right for: like buttons, toggle switches, inline
                edits.

  300ms–1s    → button → loading state on the action
                Button shows spinner inline + disables; rest of
                page stays interactive.

  1s–5s       → skeleton placeholders (NOT spinner)
                Show shape of incoming content; reduces perceived
                wait. Skeleton MUST match content shape (don't
                use generic rectangles).

  5s–30s      → progress indicator with text label
                "Analyzing screenshot — 68% (about 4 seconds left)"
                Shows progress + ETA + ability to cancel.

  > 30s       → background processing + notification on completion
                Don't make the user wait. Submit, navigate away,
                notify (toast / email / push) when done.

Per-archetype rules (tightening):
  Lattice   — prefer NO loading state for sub-100ms ops; never
              hide chrome behind a loader (keep keyboard
              shortcuts + nav functional)
  Current   — pull-to-refresh haptic on threshold; skeleton
              required for any list waiting > 500ms
  Meridian  — skeletons banned (motion ban); use opacity-static
              placeholders OR no loader at all (most docs render
              statically anyway)
  Harbor    — skeletons subtle (--color-surface-2 base, opacity
              pulse), never shimmer (too feature-y)
  Sentinel  — text label required next to spinner if wait > 1s

Banned loading antipatterns (universal):
  ✗ Spinner with no text after 1s (ambiguous — stuck or working?)
  ✗ Full-screen blocker for partial-page loads
  ✗ Skeleton shimmer in Meridian / editorial contexts
  ✗ Hide nav / sidebar / chrome behind a loading state
  ✗ Disabling Submit button "to prevent double-click" — use
    debounce + button loading state instead

───────────────────────────────────────────────────────────────
HYBRID PRODUCTS (when one archetype isn't enough)
───────────────────────────────────────────────────────────────
Most real products are hybrids. Common patterns:

  Marketing site + Product app
    → Harbor for the marketing surface (homepage, pricing,
      case studies); Sentinel for the in-app product
    → Examples: Linear, Notion, Stripe, Vercel
    → Strategy: separate codebases or distinct route trees;
      Harbor + Sentinel each ship their own tokens.css with
      shared brand primitives (same primary hex) but
      independent semantic layers, type scales, density.

  Web app + Mobile app
    → Sentinel for desktop web; Current for native mobile
    → Examples: Slack, Figma, Notion, Asana
    → Strategy: shared brand color anchor; otherwise distinct
      systems. Don't try to "responsive" Sentinel down to mobile
      — touch ergonomics demand Current's rules (44px targets,
      bottom-anchored nav, sheets-not-modals).

  Product app + Data terminal view
    → Sentinel for the workflow shell; Lattice for the
      data-explorer / monitoring screens
    → Examples: Linear's command bar, Datadog inside their
      broader app, Snowflake's worksheet inside admin chrome
    → Strategy: keep Lattice scoped to specific routes;
      transition is explicit (visual rule: navigating into the
      Lattice route changes chrome density + introduces the
      data-system color palette)

  Marketing site + Documentation
    → Harbor for marketing; Meridian for docs subdomain
    → Examples: Stripe, Tailwind, Vercel — marketing on root,
      docs on /docs or docs.* subdomain
    → Strategy: shared header brand, distinct layouts. Docs
      reading experience must NOT inherit Harbor's atmospheric
      hero patterns.

  Product app + In-app docs / help center
    → Sentinel shell + Meridian content panes for docs
    → Strategy: docs render as a Meridian-styled panel inside
      Sentinel's chrome (right drawer, modal, dedicated route)

KEEP TOKEN FILES SEPARATE (the universal hybrid rule):
  Don't force one tokens.css to cover both archetypes. Each
  archetype owns its own:
    - Semantic layer (Layer 2 token names per archetype)
    - Typography scale (Harbor clamp vs Sentinel fixed px etc.)
    - Density scale
    - Component priority order
    - Banned patterns

  Share ONLY:
    - Brand primary hex (the anchor color)
    - Brand voice principles (broad guidelines, not specific
      voice rules — those vary per archetype)
    - Universal governance gates (this file)

The most expensive mistake in hybrid design: forcing one
archetype's choices onto another archetype's use case. A 14px
base font is correct for Lattice and catastrophic for Meridian.
Pick the right archetype per surface, then build within it.

───────────────────────────────────────────────────────────────
ARCHETYPE OVERRIDES — AGGREGATED SUMMARY
───────────────────────────────────────────────────────────────
Each archetype inherits this governance spec in full and adds
its own overrides. Below is the canonical aggregated summary
(updated whenever any archetype's overrides change).

Harbor — 9 overrides
  Theme: dark-first; semantic-only tokens; fluid clamp typography;
  warm oklch shadows; single accent enforced; serif italic
  display reserved for editorial emphasis; floating label class
  banned; indigo/purple banned; avatar initials default; one
  primary CTA per section; hero variants limited to template 01.

Sentinel — 10 overrides
  Two-layer token architecture; accent-CTA rule (no white-on-
  orange in chrome); tabular numerals on numeric columns; fixed
  px modular typography; sans-serif only; cold rgba shadows;
  dark-mode tokens defined; WCAG verification table required
  in spec page; color-only encoding banned; "(required)"
  parenthetical (not asterisk-only).

Current — 11 overrides
  44px minimum touch targets enforced; body font ≥ 16px on
  inputs; native font stack default; safe-area insets honored;
  hover-only interactions banned; hamburger nav banned as
  primary; tab bar capped at 5 items; haptics paired with
  visible state; modal-only patterns flagged for review;
  inputMode matches input semantic; both light + dark modes
  required.

Lattice — 10 overrides
  Two-color-system architecture (chrome + data separate);
  monospace + tabular numerals on all numeric columns; every
  interactive element has documented keyboard shortcut;
  animation on data updates banned; shadows on default surfaces
  banned; confirmation dialogs banned for reversible actions;
  pagination banned for data tables (must virtualize); color-
  only encoding banned; body type ≥ 11px in chrome; pane
  resizability required.

Meridian — 11 overrides
  Reading-width hard-capped at 72ch on body; body type ≥ 17px;
  body line-height ≥ 1.7; animation on body content banned;
  sepia + dark mode tokens defined; print stylesheet required;
  all headings (h2+) auto-id'd; external links flagged with
  icon; code blocks have copy button; ⌘/Ctrl+K search wired;
  marketing hype words blocked in body.

───────────────────────────────────────────────────────────────
SPEC-PAGE OUTPUT REQUIREMENTS (Prompt 08)
───────────────────────────────────────────────────────────────
Every generated spec page MUST contain these sections, even if
the archetype spec doesn't enumerate them explicitly. Prompt 08
derives missing details from archetype rules.

Required sections (in this order):
  §1.1  Color system + Accessibility verification table
  §1.2  Typography (scale, weights, line-heights)
  §1.3  Spacing scale
  §1.4  Elevation + shadows (with use case per token)
  §1.5  Shape & borders (radii with use guidance)
  §1.6  Motion (durations + easings + reduced-motion fallback)
  §1.7  Iconography (family, grid, stroke, sample grid)
  §2    Voice & content (Do/Don't pairs)
  §3    Component library (foundation → composite → domain)
  §4    Patterns (form layout, empty state, + archetype-specific)
  §5    Accessibility (WCAG conformance, keyboard map, SR notes)
  §6    Versioning + Governance (this checklist + changelog
        template)
  Footer: version · generation date · mode · WCAG target ·
          archetype name

═══════════════════════════════════════════════════════════════
END GOVERNANCE · v1.0
═══════════════════════════════════════════════════════════════

Use with: Prompts 1 / 1b / 45 / 46 / 47 / 49 / 51 — paste FIRST, every time

The two prompts

Two prompts produce the token architecture. Both are in kit/prompts/ and work in any Claude session. In Chain Mode (paste kit/chain.md into Claude Code), they run sequentially with auto-save, Q&A branching, and handoff — no manual copy-paste between steps.

Prompt	What it does	Inputs	Output
Prompt 1	Generates the full token set. Asks about BYOC colors and brand fonts before generating — routes automatically based on your answers. Saves to `output/tokens.css` and announces Prompt 2 is ready.	governance.spec.md + archetype.spec.md + (optional) brand hex seeds	output/tokens.css — color, spacing, radii, motion tokens (no typography)
Prompt 2	Layers the type scale on top. Reads `output/tokens.css` and any brand fonts noted in Prompt 1. Saves the updated token file and announces Prompt 03 is ready.	output/tokens.css + archetype TYPOGRAPHY block + (optional) brand font families	output/tokens.css — updated with full typography scale

Prompt 1 — Token generation (with BYOC built in)

Reads the two specs, asks about brand colors and fonts before generating anything, emits output/tokens.css with verified WCAG contrasts, applies governance overrides, and tells you exactly what to do next. Use Claude Opus — the interpretive judgment matters more here than for any other prompt in the book.

The card below shows the core generation instructions — the prompt body that Claude executes. When you run this from the kit (kit/prompts/01-tokens-from-spec.md), the prompt opens with a meta-header that detects your environment (Claude Code vs Claude.ai), checks for existing output files, and asks the two branching questions before a single line of CSS is generated. It closes with a meta-footer that saves output/tokens.css, offers an HTML preview, and announces Prompt 2 is ready. The instructions below are what runs between those two wrappers.

Prompt 1 — Token generation from archetype spec

PROMPT 1 — TOKEN GENERATION FROM ARCHETYPE SPEC

YOUR ROLE
You are a design-system token generator. You translate two pasted specs
into a production-grade tokens.css with verified WCAG contrasts and full
light + dark coverage.

STEP 0 — RECEIVE INPUTS
Confirm both pastes before generating any CSS.

  PASTE 1 — governance.spec.md: [paste here]
  PASTE 2 — [archetype].spec.md: [paste here]

If either paste is missing or incomplete, stop and ask for it.

STEP 1 — VALIDATE SPEC STRUCTURE
The archetype spec must contain: THEME, TOKEN NAMING, COLOR TOKENS,
TYPOGRAPHY, DENSITY, ACCENT STRATEGY, ELEVATION, RADII, MOTION,
ICONOGRAPHY, GOVERNANCE OVERRIDES.

The governance spec must contain: WCAG MINIMUMS, BANNED PATTERNS,
GLOBAL CONSTRAINTS.

If any required section is missing, name it and stop.

STEP 2 — EMIT THE :root BLOCK
Generate :root with:
  - Surface tokens per archetype's THEME (--color-bg, --color-surface, etc.)
  - Border tokens (--color-divider, --color-border, --color-focus-ring)
  - Text tokens (--color-text, --color-text-muted, --color-text-faint)
  - Accent tokens per ACCENT STRATEGY (single-hue OR multi-ramp per spec)
  - Semantic trio (--color-success, --color-warning, --color-error)
  - Spacing scale per DENSITY (4px, 2px, or 8px base)
  - Radii per RADII block
  - Motion per MOTION block (with reduced-motion fallback)
  - Icon sizing scale

Apply archetype rules:
  - TOKEN NAMING: semantic-only OR primitive+semantic, per spec
  - THEME: dark-first OR light-first, per spec
  - DENSITY: spacing base per spec

TYPOGRAPHY: generate ZERO font tokens here. --font-display, --font-body,
--font-mono, --text-*, --leading-*, --tracking-*, --weight-* are all
Prompt 2's job. If the archetype spec's TYPOGRAPHY section is present,
acknowledge receipt — then skip it. Prompt 2 will consume it.

PATH B TYPOGRAPHY DETECTION (MCP context only):
If running with Figma Dev Mode MCP access, after reading Variable
collections also scan Figma Text Styles — typography lives there,
not in Variables. Do NOT emit font tokens. Instead append to
output/chain-state.md:

  [Figma typography detected]
  Text Styles found: [yes | no]
  Styles: [name · font-family · size · weight · line-height per style]
  Note for Prompt 2: read these from Figma via MCP instead of
  asking the user to specify fonts manually.

If no Text Styles are found, note:
  [Figma typography detected]
  Text Styles found: no — Prompt 2 will ask user for font preferences.

STEP 3 — EMIT THE OPPOSITE-THEME OVERRIDE BLOCK
Generate the opposite-theme override block (e.g., [data-theme="light"]
if archetype is dark-first). Override only what changes — typically
surfaces, text, divider, border, focus-ring. Keep accent and semantic
constant unless governance spec requires otherwise.

STEP 4 — VERIFY WCAG CONTRAST
For every text-on-surface pair, compute contrast and output a markdown
table:

  | Foreground | Background | Ratio | WCAG |
  |---|---|---|---|
  | --color-text | --color-bg | 14.2:1 | AAA |
  ...

Flag any ratio below the governance spec's WCAG MINIMUMS section.

STEP 5 — APPLY GOVERNANCE OVERRIDES
Read GOVERNANCE OVERRIDES in the archetype spec. Apply archetype-specific
gates: Harbor's warm oklch shadows, Current's 44px touch-target floor,
Sentinel's sentence-case-only labels, etc.

STEP 6 — DELIVER THE FINAL FILE
Output the complete tokens.css as a single fenced code block, preceded
by a 2-line summary (token count, WCAG ratios passed/flagged). No prose
before or after the code block.

CONSTRAINTS
- No hardcoded color values outside the :root block.
- No magic numbers in spacing or radii — all spatial values reference tokens.
- All motion respects prefers-reduced-motion.
- Honor the BANNED PATTERNS section absolutely.
- No emoji, no Comic Sans, no Papyrus, no archetype-banned typefaces.

Use with: governance.spec.md + archetype.spec.md · Claude Opus · Claude.ai or Claude Code

BYOC — Bring Your Own Colors

BYOC is built into Prompt 1 — there is no separate paste step. When you run Prompt 1 from the kit, it asks: "Do you have brand colors to honour?" — answer yes, supply your primary hex (and optionally accent), and Prompt 1 activates the BYOC path inline. The full derivation logic (oklch ramp interpolation, WCAG validation, per-archetype TOKEN NAMING) runs inside Prompt 1 without switching files. A standalone 01b-byoc.md file ships in the kit as reference documentation for the BYOC logic itself, not as a step you run separately.

Prompt 2 — Typography scale (with brand fonts if noted)

Reads output/tokens.css from Prompt 1, layers the type scale on top, and applies any brand fonts you specified in Prompt 1's meta-header. Optionally substitutes brand fonts while preserving role rules — Inter ↔ GT America in Sentinel works; serif body in Sentinel breaks the archetype and gets flagged. Saves the updated output/tokens.css and announces Prompt 03 is ready.

Prompt 2 — Typography scale + brand fonts

PROMPT 2 — TYPOGRAPHY SCALE (with optional brand-font substitution)

YOUR ROLE
You generate the typography token block on top of an existing tokens.css.
You match the type-scale shape (fluid clamp() vs fixed modular) to the
archetype's TYPOGRAPHY rules. You optionally substitute brand fonts
while preserving role rules.

STEP 0 — RECEIVE INPUTS
  PASTE 1 — existing tokens.css from Prompt 1 / 1b: [paste here]
  PASTE 2 — [archetype].spec.md (TYPOGRAPHY section): [paste here]
  YOUR BRAND DISPLAY FONT (optional): [font name + import URL]
  YOUR BRAND BODY FONT (optional): [font name + import URL]
  YOUR BRAND MONO FONT (optional): [font name + import URL]

Brand fonts are optional. Without them, use the archetype's default font
assignments.

STEP 1 — VALIDATE FONT ROLE COMPATIBILITY
Check brand fonts against archetype TYPOGRAPHY rules:
  - Harbor: display = serif (italic-capable for editorial flourish), body = sans
  - Sentinel: display + body = sans-serif (same family allowed), mono optional
  - Current: display + body = native or sans, mono optional
  - Lattice: display + body = sans, mono = required (tabular numerics)
  - Meridian: body = serif preferred (sans allowed for technical), display = sans

If a supplied brand font violates the archetype's role (e.g., a serif for
Sentinel's body), FLAG IT BACK and propose either keeping the archetype
default OR overriding (which breaks the archetype). Do not silently
substitute incompatible fonts.

STEP 2 — EMIT FONT FAMILY TOKENS
Generate:
  --font-display: [brand display | archetype default], [fallback stack];
  --font-body:    [brand body    | archetype default], [fallback stack];
  --font-mono:    [brand mono    | archetype default], ui-monospace, monospace;

Include @import url() at the top of the file for any web-fonts. Use
font-display: swap for all custom fonts.

STEP 3 — EMIT TYPE SCALE
Per archetype:
  - Fluid clamp() (Harbor / Current / Meridian display): clamp(min, base + vw, max).
  - Fixed modular (Sentinel / Lattice): 1.125 (Sentinel) or 1.067 (Lattice)
    modular ratio with a 14px or 12px base.
  - Reading-comfortable (Meridian body): 17–19px body with 1.75 leading.

Token names:
  --text-xs, --text-sm, --text-base, --text-lg, --text-xl, --text-2xl, --text-hero
  --leading-tight, --leading-snug, --leading-normal, --leading-relaxed
  --tracking-tight, --tracking-normal, --tracking-wide
  --weight-regular, --weight-medium, --weight-semibold, --weight-bold

STEP 4 — VERIFY READING ERGONOMICS
For body text in particular:
  - Body font size at viewport >= 1024px: 14px floor for Sentinel/Lattice,
    16px floor otherwise.
  - Reading-width cap (max-width on body p): 80ch for Meridian, 70ch for Harbor,
    no cap for Sentinel/Lattice/Current.
  - Reduced-motion respected for any text-related transitions (e.g., fade-in).

STEP 5 — DELIVER THE FINAL UPDATED tokens.css
Output the complete updated tokens.css as a single fenced code block.
No prose before or after.

CONSTRAINTS
- No font fallback shorter than 3 levels deep (custom -> category -> system).
- No text-transform: uppercase on body text — only on UI labels and eyebrows.
- All sizes use the type-scale tokens. No magic px values in component CSS.
- Honor TYPOGRAPHY's BANNED FONTS list from the archetype spec.

Runs after: Prompt 1 or Prompt 1b · Claude Opus · Claude.ai or Claude Code

What you get back

A run of Prompt 1 + Prompt 2 against governance.spec.md and harbor.spec.md, with no brand fonts supplied, returns ~150 lines of tokens.css. The shape:

tokens.css · sample output (excerpt) · Harbor archetype

/* Generated from harbor.spec.md + governance.spec.md
   54 tokens · WCAG: 12 ratios verified, 0 flagged
*/
:root {
  /* Surfaces — dark-first per Harbor */
  --color-bg:           #141312;
  --color-surface:      #1A1918;
  --color-surface-2:    #1E1D1B;

  /* Text — 4 levels */
  --color-text:         #F5F4F0;
  --color-text-muted:   rgba(245,244,240,0.70);

  /* Accent — single hue per Harbor's ACCENT STRATEGY */
  --color-primary:      #66D9C2;
  --color-primary-hover: #7EE5D0;
  --color-primary-text:  #141312;

  /* Typography — fluid clamp() per Harbor */
  --font-display: "Instrument Serif", Georgia, serif;
  --font-body:    "Inter", system-ui, sans-serif;
  --text-base: clamp(1rem, 0.95rem + 0.25vw, 1.125rem);
}

[data-theme="light"] {
  --color-bg:           #FAFAF7;
  --color-surface:      #F4F1EA;
  --color-text:         #1A1918;
  /* Override surfaces and text only. Accent and semantic stay constant. */
}

The full file covers about 50–60 tokens for Harbor (semantic-only naming keeps the count compact). Sentinel, with its primitive + semantic two-layer architecture, returns roughly 90–110 tokens for the same spec depth. The token count varies by archetype — what stays constant is the structure: :root primary block + opposite-theme override + WCAG verification.

Layer 3 emerges later, not now

Resist the urge to author component tokens upfront. Layer 3 is best left empty until you've shipped the foundation, composite, and pattern components and seen which values actually repeat. The Layer 3 extraction prompt scans your locked CSS for values used three or more times across components and proposes which ones deserve a component-level token.

That prompt lives in the kit at kit/prompts/10-layer-3-extraction.md and runs as the final step of Chain 1 (Token cascade) or Chain 9 (Greenfield end-to-end). Authoring guidance lands in a later chapter; for now, treat Layer 3 as a "later, not now" concern. Components in Ch04 onwards reference Layer 2 directly.

What's next

Tokens are a contract. The next chapter, Colors & WCAG, unpacks the most fragile part of that contract — color choices that survive both human aesthetic preference and machine-verified contrast. We'll go deeper on the WCAG verification step Prompt 1 emits, document the failure modes that show up most often when readers run their own brand colors through Prompt 1b, and walk the oklch interpolation that derives ramps from a single seed.

Act II — Token Architecture · Chapter 04

Colors & WCAG

Every prompt in this book emits a WCAG contrast table next to its tokens. This chapter unpacks why that table is the single most consequential output — and what the AI is checking when it builds it.

Color is the fastest-failing part of any AI-generated design system. The failure mode is always the same: the brand color looks beautiful in isolation, fails contrast against the surface roles the archetype demands, and ships anyway because nobody read the contrast table the prompt emitted. This chapter is about reading that table — what the numbers mean, what the AI does when a number fails, and how the oklch ramp interpolation in Prompt 1b derives a 9-shade palette from a single hex seed without inventing colors that fail the system.

WCAG fundamentals — the four numbers that matter

WCAG 2.1 specifies four contrast thresholds. Every token pair the AI emits is checked against the right one for its role. Memorize these — they're the only numbers that decide whether your palette ships.

Element	WCAG AA (minimum)	WCAG AAA (preferred)	Where it applies
Body text (under 18pt or under 14pt bold)	4.5:1	7:1	Most paragraph text, table cells, form labels
Large text (18pt+ or 14pt bold)	3:1	4.5:1	Headings, hero copy, eyebrow labels
Non-text UI (icons, focus rings, charts, borders)	3:1	—	Interactive borders, focus rings, icon strokes, chart elements
Decorative-only elements	Exempt	Exempt	Pure visual flourish carrying no information

The single most useful number is 4.5:1 — body text against its surface. Most palettes fail here, and almost always at the same place: the brand accent. A saturated mid-tone hex sits visually around #888 in oklch lightness, which means it scores ~3:1 against white and ~3:1 against black. Both fail body text. The rule that drops out of this is the Accent-CTA rule (covered later in this chapter).

Five archetypes, five color strategies

Each archetype encodes its color strategy in its spec. The strategies are deliberately different — what works for editorial reading would suffocate a data terminal.

Archetype	Strategy	Token naming	Accent count	Defining rule
Harbor	Single-hue accent on warm dark surfaces	Semantic only	1	Warm oklch shadows derived from primary
Sentinel	Multi-ramp + semantic aliases	Primitive + semantic two-layer	5+ ramps (primary, accent, link, success, warning, error, neutral)	Accent-CTA rule (orange bg + dark text)
Current	Bold multi-accent, both modes first-class	Semantic + small brand ramp (3-step)	3 (primary, secondary, tertiary)	Both light + dark required, no "v2 dark mode"
Lattice	Two systems: chrome (restrained) + data (purpose-fit)	Chrome semantic + data palettes (diverging / sequential / categorical)	1 chrome accent + 8-color D3 categorical	Chrome and data palettes never share a hex
Meridian	Minimal — accent reserved for links	Semantic + reading-role aliases (callout / code / footnote)	1 (link only)	Three themes (light + sepia + dark) all defined

The strategy is encoded in each archetype's ACCENT STRATEGY and COLOR TOKENS sections. When you ran the dropdown selector in Ch03, you copied the full text of one of these strategies into Prompt 1 — that's how the AI knows whether to emit a 9-step navy ramp (Sentinel) or a single teal hue with four states (Harbor).

Ramp generation — oklch interpolation from a seed

When you supply a brand color via Prompt 1b, the AI doesn't pick the ramp shades — it derives them using oklch interpolation. The seed is the input; the 9-shade ramp is the output. This is the math that makes BYOC palettes feel coherent rather than handpicked.

Why oklch, not HSL or RGB

HSL and RGB are perceptually uneven — a 10% lightness shift in HSL feels different at different hues (yellow gets washed out, blue gets pitch black). Oklch (Oklab Lightness-Chroma-Hue) is perceptually uniform: a 10% L shift looks like a 10% lift to the eye regardless of hue. That makes interpolation predictable.

tokens.css · oklch ramp derived from seed

/* Brand seed: #FF5733 (terracotta) */
:root {
  /* Derived ramp — oklch lightness varies 95% → 15%, hue + chroma stay anchored */
  --primary-50:  oklch(0.95 0.04 29);  /* very light tint */
  --primary-100: oklch(0.88 0.08 29);
  --primary-300: oklch(0.72 0.16 29);
  --primary-500: oklch(0.62 0.20 29);  /* the seed itself ≈ #FF5733 */
  --primary-700: oklch(0.45 0.18 29);
  --primary-900: oklch(0.25 0.12 29);  /* deepest shade */
}

/* Hue (third value: 29°) stays constant — every shade reads as the same brand color.
   Lightness scales linearly. Chroma falls off at the extremes (very light + very dark
   shades have less saturation room — physics of color, not a stylistic choice). */

The two non-obvious moves: chroma falls off at the lightness extremes (a 95% L shade can't hold the same chroma as a 60% L shade — the gamut runs out), and hue stays anchored across the whole ramp (no rainbow drift; every shade reads as terracotta). Prompt 1b applies both rules automatically. If you have a pre-tuned ramp from a designer, paste it as-is and skip the derivation.

The Accent-CTA rule — Sentinel's keystone

The single most common WCAG failure in AI-generated palettes is the Accent-CTA rule. It comes up so often, and breaks so many ramps, that Sentinel's spec encodes it as a hard gate.

Prompt 1b validates the brand seed against this rule explicitly. If your supplied primary fails 4.5:1 against both white and black surfaces, the prompt flags it back with a proposed darker or lighter derived shade rather than silently producing low-contrast tokens. You confirm or override; the prompt does not "fix" without telling you.

Triple encoding — color is never the only signal

About 8% of men and 0.5% of women have some form of color blindness. The implication for design systems: color alone never carries meaning. Every place a color encodes a signal, the system must also encode that signal in icon and text. Three redundant channels, every time.

Wrong	Right	Why
Red row (no icon, no text)	Red row + alert-triangle icon + "Failed" text	Red-green colorblind users see the row as gray; need icon + text to know it failed
Status pill — green dot only	Green dot + "Active" label	Same logic — dot color alone is invisible to ~1 in 12 users
Chart line (red vs green)	Chart line + dashed pattern + tooltip with series name	Categorical encoding fails for colorblind users; pattern + label survive
Form error — red border only	Red border + error icon + error text below + `aria-invalid`	Color + visual icon + text + screen-reader hook = 4 channels

The governance spec encodes this as a banned pattern: color-only encoding. Lattice strict-enforces it (every status indicator in a data terminal must combine color + icon + text). Sentinel and Current both inherit it. Harbor and Meridian use semantic color so sparingly that the rule rarely binds, but it still applies — a red-bordered callout still needs the icon and the title.

Dark mode as token swap

Per Ch03's three-layer architecture, dark mode is a primitive-only override — semantic aliases stay constant; components stay constant. This is the simplest possible dark-mode pattern, and it's the one Prompt 1 emits by default.

tokens.css · dark-mode override (primitives only)

:root {
  /* Primitives — light mode */
  --neutral-50:  #FAFAF9;
  --neutral-900: #1C1917;
  --primary-700: #1B2A4A;

  /* Semantic aliases — these never change between modes */
  --surface-page: var(--neutral-50);
  --text-primary: var(--neutral-900);
  --color-primary: var(--primary-700);
}

[data-theme="dark"] {
  /* Re-point primitives — semantic aliases pick up the new values automatically */
  --neutral-50:  #1C1917;  /* surface flips dark */
  --neutral-900: #FAFAF9;  /* text flips light */
  --primary-700: #7AA8E0;  /* primary lifted for dark contrast */
}

The accent typically does NOT swap — its hue stays the same; only its lightness lifts to maintain contrast on the new surface. Sentinel and Lattice keep their accent constant. Harbor's teal stays teal in both modes. Meridian's link blue lifts slightly for dark mode (#0066CC → #66B3FF) — the same hue, just brighter to clear the dark surface.

Failure modes — what Prompt 1b flags back

The most common BYOC outcomes when readers run their own brand colors through Prompt 1b. Each is a real failure I've seen ship — and each is what the prompt's contrast validation step is checking for.

Brand seed	Symptom	What Prompt 1b does
Mid-tone saturated (e.g., teal `#1ABC9C`, lime `#A3E635`)	Fails 4.5:1 against both white and black surfaces — body text on it is illegible either way	Flags back. Proposes darker derived shade (oklch L 35–40%) for primary text role + lighter for surface tints. Asks for confirmation.
Pastel (low chroma, high lightness — e.g., dusty rose `#F8C8DC`)	Fails 3:1 against white. Even non-text UI fails (focus ring invisible).	Flags back. Proposes pastel as `--surface-tint` background only, with a derived deeper shade for the primary text role.
Pure black `#000000` on a Harbor archetype	Violates Harbor's "warm off-black, never pure black" rule (banned per spec)	Flags as archetype violation. Proposes warm off-black `#141312` per Harbor's THEME section. Explains the warmth rule.
Indigo or purple (e.g., `#6366F1`) on Sentinel	Sentinel bans indigo/purple as cliched "AI startup" signal	Flags as archetype violation. Asks if you want to override the ban (which weakens Sentinel's tonal claim) or pick a different brand color.
Two or more accent hues on Harbor	Harbor's single-hue rule violated	Flags as archetype violation. Asks if you want Sentinel (multi-ramp) or Current (multi-accent) instead.
Light-mode-only palette on Current	Current requires both modes first-class	Flags missing dark-mode tokens. Auto-derives dark variant via oklch flip + lift, asks for confirmation.

In every case, the prompt names the failure, proposes a derived or alternative value that does pass, and asks for confirmation. It does not silently substitute. The reason is simple: silent substitution erodes trust. The reader needs to know exactly what changed, why, and approve the change — that's what makes the system auditable.

The contrast table is the deliverable

Every spec page generated by Prompt 08 contains an accessibility verification table — every text-on-surface pair, every accent-on-surface pair, every focus-ring-on-surface pair, computed from the actual token values. If you take one thing from this chapter, take this: the contrast table is the audit trail. It's what proves the palette ships. Stripping it from the spec page is a governance failure, not a stylistic choice.

What's next

Tokens defined and verified is half the work. The other half is making sure designers and engineers are looking at the same source of truth. Chapter 5 — Figma Variables covers how the tokens you just generated land in Figma as Variable collections, how Figma's Dev Mode MCP reads them back into prompts, and the read/write round trip that keeps both ends in sync.

Act II — Token Architecture · Chapter 05

Figma Variables

CSS custom properties and Figma Variables solve the same problem. They are not the same thing — and conflating them is the fastest way to drift between what designers see and what ships.

The three-layer architecture from Ch03 exists in two worlds: the code side (tokens.css) and the Figma side (Variable collections). This chapter maps the architecture onto Figma, shows how the Dev Mode MCP reads from Figma's Variables into Claude-driven prompts, and documents the three sync strategies that keep both worlds from drifting apart.

Variables vs CSS custom properties — the differences that matter

Dimension	Figma Variables	CSS custom properties
Lives in	Figma file (cloud)	Your codebase (`tokens.css`)
Read by	Figma, Figma plugins, Variables REST API, Dev Mode MCP	Browser, Claude via MCP, any CSS processor
Modes (theme switching)	Native — one frame toggle flips every alias simultaneously	Manual overrides (`[data-theme="dark"]`) applied at runtime
Aliasing	Native Variable references (design-time, visible in Figma)	`var(--token-name)` (runtime, computed by browser)
Sync	Does not auto-export to CSS — requires a sync step	Does not auto-import from Figma — requires a sync step
Source of truth	Design-first teams (designers own the primitives)	Code-first teams — this book's default

Three collections, three jobs

The primitive → semantic → component architecture from Ch03 maps directly to three Figma Variable collections. Each collection has one job and one set of rules about what it may contain.

1

Primitives — raw values only

Contains all raw hex values, spacing numbers, radii. No aliases allowed — every variable in this collection resolves to a literal value. Has exactly one mode: Default (the single source of raw truth). Naming pattern: color/primary/700, space/4, radius/md.

2

Semantics — aliases only, with modes

Contains no raw values — every variable aliases a Primitive. Has two (or more) modes: Light and Dark (Meridian adds Sepia). Switching the mode on a Figma frame swaps every semantic alias simultaneously — one click for the whole page. Naming pattern: text/primary, surface/page, accent/default.

3

Components — optional, scoped to component frames

Contains component-specific overrides — aliases pointing at Semantic tokens. Only add this collection when a component legitimately needs a value that the global semantic doesn't provide (e.g., a data-viz palette, a code-editor theme). For most teams, stopping at Layer 2 and having components reference semantics directly is the right call. Naming pattern: button/bg, button/text, badge/bg-success.

Modes — the theme mechanism

Figma Variable modes are the native equivalent of a CSS [data-theme="dark"] block. The key rule is the same in both systems: primitives never change between modes — only semantic aliases change, pointing at different primitives.

Figma Variables · mode resolution

/* The same Variable resolves differently per mode */

/* Light mode (default) */
text/primary   →  {Primitives/color/navy/700}   =  #1B2A4A
surface/page   →  {Primitives/color/neutral/50}  =  #FAFAF9
accent/default →  {Primitives/color/accent/500}  =  #FF9900

/* Dark mode (switch the Semantics collection mode on a Figma frame) */
text/primary   →  {Primitives/color/neutral/50}  =  #FAFAF9
surface/page   →  {Primitives/color/neutral/950} =  #0A0E17
accent/default →  {Primitives/color/accent/400}  =  #FFB84D

/* Primitive values never change between modes.
   Only the semantic aliases change — pointing to different primitives.
   Components see the same Variable names in both modes; Figma resolves them. */

How the MCP reads Figma Variables (Path B · Standard)

In Path B (and C), Claude reads directly from the Figma file via the Dev Mode MCP instead of interpreting from specs. The MCP doesn't receive a JSON export — it queries the live Figma file's Variable collections at prompt time, resolving each alias to its mode-specific value.

Step	What happens	What Claude sees
1 · MCP connect	Claude connects to the Figma file via Dev Mode MCP (file key in the prompt)	Access to the file's Variable collections, component properties, layout specs
2 · Variable read	Claude reads the three collections (Primitives, Semantics, Components)	Fully resolved token values per mode — not a static snapshot
3 · Spec validation	Claude cross-checks the read values against the pasted archetype spec (Ch03 two-paste workflow)	Discrepancies between Figma and spec flagged before any CSS is emitted
4 · tokens.css emit	Claude emits tokens.css derived from the live Figma source — the artifact, not the specification	Output inherits what was actually decided in Figma, not what the spec says the default should be

This is the artifact vs specification distinction from Ch01 playing out in practice. Path A readers run Prompt 1 (spec + adjectives → tokens.css). Path B readers run Prompt 1 in MCP mode (live Figma Variables → tokens.css). Same prompt, different source. Same tokens.css output, different provenance.

What MCP covers — and what it doesn't

MCP handles the Figma → tokens.css direction on demand — you run a prompt, Claude reads the live Variables, emits CSS. That covers one direction of one workflow, triggered by a human. Two things MCP does not cover:

Gap	What it means	How it gets closed
Write direction	When an engineer updates `tokens.css` without touching Figma, Figma drifts behind code. MCP is read-only — it can't push tokens back to Figma.	Chain 8 write-back (Ch16) — a separate prompt that pushes the token file back to Figma as Variable collections. Still human-triggered.
Automated continuous sync	MCP runs when you run a prompt. A team shipping 10 PRs a day needs sync to happen on every commit, on every Figma save — without waiting for a human to invoke Claude.	REST API, Tokens Studio, or manual parity (see below) — these run in CI, on schedule, or on save, without AI in the loop.

When you need sync outside the AI workflow

For automated continuous sync — running in CI, on commit, or on Figma file save without a human triggering Claude — three options ordered by control vs. setup effort.

Option	How it works	Best for
Figma Variables REST API — manual or scripted export	Call Figma's Variables REST API to export as JSON; pipe through Style Dictionary to produce CSS. Full control over naming transformation.	Engineering-led teams who want Variables in Figma but own the CSS output in a build step. Most control, most setup.
Tokens Studio plugin — bidirectional sync	Plugin in Figma syncs Variables ↔ a JSON token file in your repo via GitHub/GitLab. Changes in either direction propagate on the next sync run.	Design-led teams who want the Figma file to be the source of truth. Easiest setup; some naming-convention constraints.
Manual parity — two files, one convention	Figma Variables and CSS custom properties are maintained independently but follow the same naming convention. No tool — a contract between team members.	Small teams in early stages where tooling overhead isn't justified. Requires discipline. Breaks at scale without enforcement.

Source-of-truth decision

You need to decide once, write it down, and not revisit it for at least six months. Both directions work. The only failure mode is undocumented drift between both.

Design-first — Variables own primitives

Figma Variables hold the raw hex and spacing values. Code consumes the exported JSON (via REST API or Tokens Studio). Designers change a primitive in Figma; the sync pipeline propagates it to code on the next run.

Pick this when: your design team is larger than engineering, designers iterate on tokens frequently, or brand colors change on a short cycle.

Code-first — tokens.css owns primitives (this book)

CSS custom properties in tokens.css are the raw source. Figma Variables alias them, or are populated from a token export. The MCP reads from Figma, but the token file drives both directions. Prompt 15 (Chain 8) pushes a new tokens.css back to Figma as Variable collections.

Pick this when: engineers outnumber designers, tokens rarely change between sprints, or your team already has a Style Dictionary build step.

The naming convention is the contract

Whether you sync via REST API, Tokens Studio, or manually — the naming convention is what makes the sync reliable. Both sides must follow the same rule or the mapping breaks.

Naming convention — Figma → CSS mapping rule

/* The rule: replace "/" with "-", prefix with "--" */

/* Figma Variable name  →  CSS custom property */
color/primary/700  →  --color-primary-700
space/4            →  --space-4
text/primary       →  --text-primary
surface/page       →  --surface-page

/* Document this rule in your governance spec.
   Every prompt that reads from Figma uses this mapping.
   Every prompt that emits CSS uses this mapping.
   The naming convention IS the contract. */

What's next

Variables and tokens are color-and-spacing. They say nothing yet about how the text in those surfaces is sized and shaped. Chapter 6 — Typography covers the type scale per archetype — fluid clamp() vs fixed modular, why the distinction matters, how Prompt 2 generates the scale, and what "brand font substitution" means structurally vs what it breaks.

Act II — Token Architecture · Chapter 06

Typography — fluid vs fixed

Harbor uses fluid clamp() scales with serif display italics. Sentinel uses fixed modular scales with sans-only. Lattice caps at 22px. Meridian enforces 17–19px body with 1.75 leading. The scale is not a preference — it is part of the archetype contract.

Prompt 2 (shipped in Ch03 as a paste-ready card) generates the type scale from the archetype spec and any brand font overrides you supply. This chapter explains why the archetype dictates the scale shape, what happens when you substitute brand fonts, and the three failure modes that make typography the most common place where AI-generated systems quietly break.

Fluid vs fixed — five archetypes, two approaches

Archetype	Scale type	Base size	Headline cap	Key constraint
Harbor	Fluid clamp() — all sizes	16–18px (viewport-responsive)	clamp(2.5rem, 1rem + 6vw, 6rem)	Serif italic display reserved for editorial emphasis; ALL CAPS banned on headings
Sentinel	Fixed modular — 1.125 ratio	14px (fixed)	34px — dashboards never need more	Sans-only; serif declaration fails lint; tabular numerals required on numeric columns
Current	Fixed px — iOS/Android scale	17px body (iOS default)	40px (onboarding / hero, sparingly)	Body text ≥ 16px on inputs — iOS Safari triggers zoom below this; system fonts default
Lattice	Fixed dense — 2px-base increments	12px body (terminal convention)	22px (workspace title maximum)	Monospace for all numeric values (tabular figures); sub-13px allowed only for axis labels
Meridian	Fluid clamp() — body emphasis	17–19px (reading-optimised)	32–44px (h1, utilitarian not decorative)	Body line-height ≥ 1.75; reading-width hard-capped at 68ch; serif body is the deliberate default

Why fluid for Harbor and Meridian

Harbor and Meridian both have reading as the primary activity. Fluid clamp() scales let body text expand on large screens (breathing room improves reading) and contract on small screens (no truncation) — in a single declaration, with no media queries and no breakpoints to maintain.

tokens.css · Harbor fluid scale (excerpt)

/* Three values: min viewport · preferred (viewport-unit) · max viewport */
--text-hero:  clamp(2.5rem,  1rem + 6vw,  6rem);   /* 40px → 96px */
--text-2xl:   clamp(2rem,    1.2rem + 2.5vw, 3.5rem);  /* 32px → 56px */
--text-base:  clamp(1rem,    0.95rem + 0.25vw, 1.125rem); /* 16px → 18px */
--text-xs:    clamp(0.75rem, 0.7rem + 0.25vw,  0.875rem);/* 12px → 14px */

/* All nine sizes declared once. No breakpoints. No @media. */

Why fixed for Sentinel and Lattice

Sentinel and Lattice are task-focused, not reading-focused. Their primary constraint is pixel-perfect alignment at a specific density. A Sentinel dashboard needs table cells to align across columns; a Lattice terminal needs monospace numbers to stack. Fluid scaling would shift these alignments between viewports. Fixed values are the correct tool.

tokens.css · Sentinel fixed modular scale (1.125 ratio, 14px base)

--text-caption:  11px;   /* metadata, timestamps */
--text-body-sm:  13px;   /* dense UI, helper text */
--text-body:     14px;   /* DEFAULT — every table cell, every form field */
--text-h4:       22px;   /* card title, dialog title */
--text-h1:       34px;   /* page title — maximum in Sentinel */

/* font-feature-settings: "tnum" 1, "lnum" 1 — tabular numerals, required
   on any column of numbers. Variable-width numerals break column alignment. */

Brand font substitution — what works and what breaks

Prompt 2 accepts optional brand font names. The archetype's type-scale shape (fluid vs fixed, ratio, base size, leading) is preserved regardless of what font you supply. Only the family name substitutes. But the substitution has limits — some swaps are safe, others break the archetype.

Archetype	Safe substitution	Breaks the archetype	Why
Harbor	Any italic-capable serif for display (Playfair, Freight, Canela)	Geometric sans as display family	Harbor's editorial signature is serif italic display. Swapping to sans drops the one signal that distinguishes it from a product UI.
Sentinel	Any humanist or geometric sans (GT America, IBM Plex Sans, Söhne)	Serif body or display	Sentinel is sans-only by spec — a serif declaration fails lint. The signal is clean density, not decoration.
Current	Native system stack preferred; any clean sans ≤ 100KB subset	Custom serif; web font >100KB unsubsetted	Custom fonts add load weight on mobile. iOS renders SF Pro; Android renders Roboto. Deviating without a performance budget breaks the native feel.
Lattice	Any legible sans at 12px; any tabular monospace for numbers	Display serif; non-tabular monospace	Lattice requires tabular figures (`tnum`) on every numeric column. A mono font without tabular features collapses column alignment.
Meridian	Any readable serif for body (Source Serif, Lora, Charter); any clean sans as alternative body	Purely decorative display serif with poor text rendering at body sizes	Meridian's body font renders at 17–19px for sustained reading. A font that looks beautiful at display size but has poor hinting at 18px creates eye strain over long sessions.

Prompt 2 validates the supplied font against the archetype's role rules. If you supply a geometric sans as Harbor's display font, the prompt flags it and asks whether you want to override the archetype's editorial signature or pick a different font. It does not silently comply.

Reading-width caps and leading — where typography meets layout

Line length and line height are typographic decisions that live in tokens. They determine reading comfort more directly than font size.

Archetype	Body leading	Reading-width cap	Notes
Harbor	1.55 (normal) / 1.75 (lede)	68ch paragraphs, 56ch leads, 52ch pull quotes	Marketing pages can be wider; body text caps at 68ch for scan ergonomics
Sentinel	1.50	None — tables fill viewport width	Prose in settings panels or modals: 480–640px cap. Data surfaces: no cap.
Current	1.45	100vw default; 600px on tablet	Mobile reads in glances; tighter leading than desktop archetypes
Lattice	1.30 table / 1.45 prose	Data surfaces: fill. Prose in error modals / docs: 640px.	Data zone rows use 1.0 leading (no extra space) at maximum density
Meridian	1.75 — enforced as a governance gate	68ch hard cap on body; 80ch allowed for code/tables	Eye fatigue past 80ch is documented research. Past 68ch for sustained prose is the Meridian failure mode. Lint flags violations.

Three failure modes

These are the three typography mistakes AI-generated systems make most often. All three appear in the WCAG verification step of Prompt 1 — if you read the output, these failures announce themselves.

1

iOS Safari zoom — body text under 16px on inputs

iOS Safari auto-zooms when a form input has font-size below 16px. It does this to make the text readable. The user sees the page jump. The fix: set --text-body to 16px minimum on any archetype targeting mobile. Current enforces this at 17px. Sentinel's 14px default is correct for desktop-only — the failure happens when Sentinel is "responsively scaled down" to mobile without Current's override.

2

Tabular numerals missing on data columns

A table column showing prices: $1,234.50 and $987.60. Without font-feature-settings: "tnum" 1, proportional-width digits make the decimal points misalign. It looks like a bug; it's a font feature. Sentinel and Lattice both require this as a governance gate. Prompt 2 emits the feature setting automatically when the archetype is Sentinel or Lattice — but only if the supplied brand font supports tabular figures. If it doesn't, the prompt flags back.

3

Meridian body under 17px — "the invisible reading tax"

A Meridian doc site where someone overrode body to 15px to "fit more content." The page looks denser and professional. After 20 minutes of reading, users report eye strain. After two weeks, engagement drops. The cause is invisible: sustained reading at sub-17px imposes a cognitive load that accumulates over time. The 17px floor is not a preference — it's an ergonomic threshold established by decades of print and digital typography research. Meridian's governance gate lint-fails any --text-body declaration below 17px. Do not override it.

Prompt 2 — where to find it

Prompt 2 (typography scale with optional brand font substitution) ships as a paste-ready card in Ch03's archetype spec section — open the kit in the sidebar, or run it directly from kit/prompts/02-typography-scale.md. The verbatim prompt body validates font role compatibility against the archetype, emits the type scale, and includes a reading-ergonomics check before delivering the updated output/tokens.css. In Chain Mode, brand fonts you noted in Prompt 1 carry forward automatically — Prompt 2 confirms them before applying.

What's next

Colour and type are the first two token categories. Chapter 7 — Spacing & Elevation covers the third: how the spacing scale is structured per archetype (4px base vs 2px base vs platform units), how elevation tokens (shadows, hairlines, surface tints) encode the visual depth model, and the specific combinations that each archetype mandates or bans.

Act II — Token Architecture · Chapter 07

Spacing & elevation

The spacing scale determines how dense or generous a product feels. The elevation model determines how depth is expressed — through background tints, hairlines, or shadows. Both are encoded per archetype, and the archetype's choices are opposite each other.

After tokens.css has colors and typography, the remaining decisions are spatial. This chapter covers the spacing base, naming conventions, radii, and the three elevation strategies — and explains why Sentinel and Lattice are flat where Harbor and Current use shadows, and why that isn't a stylistic preference but an architectural constraint.

Spacing — two bases, five archetypes

The spacing scale is anchored to a base unit. Every step in the scale is a multiple of that base. The choice of base is the archetype's density signal.

Archetype	Base	Naming convention	Typical range	Why
Harbor	4px	Integer (--space-1 = 4px, --space-2 = 8px)	4–160px	Editorial needs generous spacing; round integers feel deliberate. Insertions between steps are rare enough not to matter.
Sentinel	4px	Percentile (--space-25 = 2px, --space-100 = 8px)	2–80px	Product systems evolve constantly. Percentile naming lets you insert --space-75 between --space-50 and --space-100 without renaming anything.
Current	4px	Integer (--space-1 = 4px)	4–64px	Mobile screens are small but not cramped. Generous spacing makes tap-targets unambiguous. Insertions are infrequent.
Lattice	2px	Percentile (--space-25 = 2px)	2–32px	Data terminals need sub-4px increments for row height tuning and cell padding. 2px base + percentile naming gives maximum granularity.
Meridian	4px	Integer (--space-1 = 4px)	4–80px	Reading rhythm is driven by paragraph spacing, not pixel-level density. Standard integer scale is sufficient.

Why percentile naming wins at scale

Integer naming (--space-1 through --space-40) feels natural until you need a value between two existing steps. Adding --space-3.5 is awkward; renaming --space-4 onwards is a breaking change. Percentile naming avoids this: --space-50 = 4px, --space-100 = 8px, --space-75 = 6px — inserted later with zero renaming. Sentinel and Lattice use percentile naming precisely because their component libraries grow over time.

tokens.css · Sentinel percentile spacing vs Harbor integer spacing

/* Harbor — integer naming. Simple, correct for editorial. */
--space-1:  0.25rem;  /*  4px */
--space-2:  0.5rem;   /*  8px */
--space-4:  1rem;     /* 16px — skip 3 because integers are sequential */
--space-6:  1.5rem;   /* 24px */

/* Sentinel — percentile naming. Safe to extend. */
--space-25:   2px;     /* tight icon-text pair */
--space-50:   4px;     /* inline gap */
--space-100:  8px;     /* DEFAULT gap */
--space-200:  16px;    /* card padding */
/* Later, 6px is needed. Insert --space-75 with zero migration. */

Radii — personality encoded in corners

Radii are the most visible spacing decision. Sharp corners (0–4px) signal precision and density. Generous curves (12–24px) signal friendliness and consumer-app character. The archetype's radii spec encodes this personality deliberately — the AI must not override it by defaulting to "modern-looking" rounded corners.

Archetype	Card radius	Button radius	Input radius	Banned
Harbor	12px (--radius-lg)	8px (--radius-md)	4px (--radius-sm)	Pill-rounded everything; >32px on cards
Sentinel	4px (--radius-sm)	8px (--radius-lg — Amazon-style)	4px (--radius-sm)	Super-rounded cards; pill buttons (too playful for enterprise)
Current	16px (--radius-lg)	12px (--radius-md) or pill	12px (--radius-md)	Sharp corners (0) on touchable elements — feels desktop-y on mobile
Lattice	0–4px (--radius-none / --radius-sm)	4px (--radius-sm)	4px (--radius-sm)	Pill anywhere; >4px on data containers — breaks tabular grid perception
Meridian	0 (--radius-none)	4px (--radius-sm)	4px (--radius-sm)	>8px on body content; rounded code blocks ("looks like a sticker")

Elevation — three strategies

How does a surface signal that it's "above" another? Three strategies exist, and each archetype picks exactly one. Using the wrong one breaks the archetype's visual logic.

Strategy	Mechanism	Archetypes	Token shape
Background tinting	Progressively lighter (light mode) or darker (dark mode) surface tints signal elevation. No shadows on default surfaces.	Harbor (primary), Meridian	--color-surface, --color-surface-2, --color-surface-offset, --color-surface-dynamic (4 steps)
Hairline borders	1px borders communicate structure. Shadows reserved only for floating elements (dropdowns, modals, tooltips). No shadow on default surfaces.	Sentinel (primary), Lattice	--border-subtle, --border-default, --border-strong + --shadow-1 through --shadow-16 for floating only
Soft shadows	Mobile-native soft shadows on cards and sheets. Not warm oklch (that's Harbor) — neutral rgba shadows matching each platform's visual language.	Current	--shadow-sm through --shadow-fab (5 tiers)

Harbor's warm shadow rule — oklch

Cold black shadows (rgba(0,0,0,0.12)) look sterile against Harbor's warm palette. Harbor derives shadow color from the primary hue using oklch's "from" syntax — the shadow shares the brand's warmth without requiring a separate color decision.

tokens.css · Harbor warm shadows vs Sentinel cold shadows

/* Harbor — warm shadow derived from primary hue via oklch */
--shadow-sm:  0 1px 2px   oklch(from var(--color-primary) l c h / 0.08);
--shadow-md:  0 4px 16px  oklch(from var(--color-primary) l c h / 0.12);
--shadow-lg:  0 12px 40px oklch(from var(--color-primary) l c h / 0.18);
/* The `from` syntax reads primary's lightness, chroma, hue —
   modifies only alpha. Shadow inherits warm tint, no new color needed. */

/* Sentinel — cold neutral shadows, floating elements only */
--shadow-0:   none;  /* + 1px --border-default on default surfaces */
--shadow-2:   0 2px 4px   rgba(0,0,0,0.08);  /* dropdown */
--shadow-8:   0 8px 16px  rgba(0,0,0,0.12);  /* dialog, modal */
/* NEVER on default card surfaces — border only */

The forbidden combinations

Three combinations that appear reasonable but break the archetype's visual logic. Prompt 01 applies governance overrides that flag these at generation time.

Combination	Why it breaks	Correct approach
Strong border + strong shadow on the same element	Double visual weight competes with content; the element becomes the focus, not the data inside it	Pick one: border for anchored surfaces, shadow for floating ones. Never both.
Warm oklch shadows on Sentinel	Warm shadows signal Harbor's editorial warmth. On a Sentinel dashboard, they read as tonal inconsistency.	Sentinel uses cold neutral `rgba(0,0,0,*)` shadows exclusively.
Shadows on default Lattice surfaces (panes, rows, cells)	Any shadow on a data surface competes with the data for visual weight. Every pixel of visual weight has an information cost in a terminal.	Lattice uses zero shadows on default surfaces. Shadows only for transient overlays (popovers, command palette).

What this means for Prompt 01

Spacing, radii, and elevation are all in tokens.css — emitted by Prompt 01 (and 01 in Path B). The archetype spec's DENSITY, RADII, and ELEVATION sections drive these decisions. When you paste the spec into Prompt 01, the AI reads those sections and generates the correct scale + shadow model for that archetype automatically. You do not make these choices manually — the spec does.

The governance gate checks: no raw px values in components (must use spacing tokens), no banned radius violations (no pill-rounded Sentinel buttons), no cross-archetype shadow mixing. These run as part of STEP 5 (governance overrides) in Prompt 01.

What's next

The token architecture is now complete: colors (Ch04), typography (Ch06), and spacing/elevation (Ch07). Chapter 08 — Semantic Tokens covers the layer above all of this — how semantic aliases stitch the raw values into a coherent role vocabulary that components can consume without knowing which primitive sits underneath.

Act II — Token Architecture · Chapter 08

Semantic tokens

Primitives are raw values. Semantic tokens are role-based names. The difference is what makes a design system rebrandable in one file edit — and what makes AI-generated components coherent across a hundred screens.

You've seen the two-layer architecture in Ch03. This chapter unpacks the semantic layer in practice — what roles a design system actually needs, how the four universal base semantics extend into domain-specific vocabularies, and why naming tokens by meaning rather than appearance is the single most consequential architectural decision you'll make.

Primitives vs semantics — the practical difference

Type	Example	What it encodes	Who changes it
Primitive	`--primary-700: #1B2A4A`	A raw value — a specific hex, rem, or number	Rebrand, theme switch (dark mode flips primitives)
Semantic alias	`--color-primary: var(--primary-700)`	A role — what the color IS FOR (not what it looks like)	Role shift — the meaning of "primary" changes

Components reference semantics, never primitives. That's the rule. When dark mode activates, only primitives flip — semantic aliases keep pointing at the right primitives and components inherit the new values without a single component-level edit.

The four universal base semantics

Every archetype's governance spec requires a minimum semantic trio (success / warning / error). Most also require info. These four names are universal — every team, every product, every archetype uses them. The specific hex behind each varies per archetype (Harbor's success green is warmer than Sentinel's WCAG-verified green) but the role name never changes.

Semantic	What it means	Used for	Harbor default	Sentinel default
`--color-success`	Completed, healthy, positive	Success states, confirmation messages, progress complete	`#7BC97A` (warm green)	`#00A878` (cooler, AAA-verified)
`--color-warning`	Attention needed, degraded, risk	Warning badges, degraded state, non-blocking errors	`#E8B24D` (warm amber)	`#D97706`
`--color-error`	Failure, destructive, blocked	Form errors, destructive actions, outage states	`#E07B6D` (warm red)	`#DC3545`
`--color-info`	Neutral hint, supplemental context	Info banners, tooltips, neutral notifications	Not used (Harbor restrains palette to 3)	`#0066CC` via `--color-link-500`

Domain extension — extending without forking

The four base semantics cover most products. Specialised domains (monitoring, analytics, review tools, financial products) need more granular vocabulary. The pattern is always the same: extend by pointing at base semantics or primitives — never by introducing new raw hex values.

tokens.css · domain extension (Sentinel monitoring product)

/* Base semantic layer — governance.spec.md requirement */
--color-success: var(--color-success-500);
--color-warning: var(--color-warning-500);
--color-error:   var(--color-error-500);

/* Domain extension — severity scale for a monitoring product */
--severity-critical: var(--color-error-700);   /* darker = more severe */
--severity-high:     var(--color-error);      /* aliases base semantic */
--severity-medium:   var(--color-warning);    /* aliases base semantic */
--severity-low:      var(--color-warning-300); /* lighter primitive */
--severity-info:     var(--color-info);       /* aliases base semantic */

/* Now the AI building alert badges uses --severity-critical, not a raw hex.
   Rebrand → primitive updates → severity-critical follows automatically.
   The domain extension chain means no maintenance debt. */

Semantic role vocabulary — per archetype

Each archetype has its own semantic role vocabulary beyond the four base semantics. The names reflect the archetype's use-case — what kinds of surfaces and text roles the system needs.

Role category	Harbor	Sentinel	Lattice
Surfaces	--color-bg, --color-surface, --color-surface-2, --color-surface-offset, --color-surface-dynamic	--surface-page, --surface-card, --surface-sunken, --surface-sidebar, --surface-header	--chrome-bg, --chrome-bg-elevated, --chrome-bg-input, --chrome-bg-row-hover, --chrome-bg-row-selected
Text	--color-text, --color-text-muted, --color-text-faint, --color-text-inverse	--text-primary, --text-secondary, --text-tertiary, --text-disabled, --text-link	--chrome-text, --chrome-text-muted, --chrome-text-faint, --chrome-text-disabled
Borders	--color-divider (hairline), --color-border (component outline)	--border-subtle, --border-default, --border-strong, --border-focus	--chrome-border-subtle, --chrome-border-default, --chrome-border-strong
Accent	--color-primary, -hover, -active, -subtle, -text (4 states, 1 hue)	--color-primary-700 (bg); --color-accent-500 (CTA bg only)	--chrome-accent, --chrome-accent-subtle (SEPARATE from data palettes)

Prompt 01 emits the semantic layer as part of its :root block. When you paste the archetype spec, the DENSITY, THEME, and TOKEN NAMING sections tell the AI exactly which role names to use. This is why swapping between Harbor and Sentinel produces tokens.css files with completely different semantic vocabulary — not just different hex values, but different architectural shapes.

What's next

The token architecture is complete. Chapters 9–11 are the payoff: each archetype documented as a full reference system — its token decisions explained, its component priorities laid out, its anti-patterns catalogued. Chapter 9 — Harbor is first: the editorial archetype in full.

Act III — Reference Systems · Chapter 09

Harbor — the editorial archetype

Dark-first, single-hue, fluid serif display. The complete reference system for editorial sites, marketing pages, portfolios, and services firms. Structure felt, not seen.

Live Harbor spec preview

Dimension	Harbor decision	Why
Theme	Dark-first	Editorial reading is longer and more atmospheric than product UI. Warm dark surfaces reduce visual fatigue.
Token naming	Semantic only — no numeric ramps	Editorial systems use ~15 surface roles total. Numeric ramps add complexity without payoff.
Typography	Fluid clamp() for all sizes; serif italic for display emphasis	Reads at any width without media queries. Serif italic is the curated-content signal.
Accent	Single hue, 4 states	A single accent is the strongest editorial signal — the page reads as a composition, not a control panel.
Elevation	Background tinting; warm oklch shadows for floating elements only	Cold rgba shadows look sterile against warm palette.
Density	Generous (4px base, 14-step integer scale)	Magazine rhythm — whitespace is the design.
Iconography	Phosphor (light/regular weight)	The only widely-available family with multiple weight variants. Has editorial warmth Lucide lacks.

The spec — copy and paste

The Harbor spec is what you paste into Prompt 01 (second paste, after governance). Open the Ch03 dropdown, select Harbor, and hit COPY — or grab it from kit/specs/harbor.spec.md. The spec drives every structural decision in the token file the prompt generates.

→ Go to Ch03 archetype dropdown to copy the Harbor spec

Six section templates

Harbor's section vocabulary is finite and deliberate. Any marketing page is an assembly of these six — the AI assembles, never invents a new section type. This is what prevents the drift toward generic SaaS layouts.

Template	Structure	Use for	Key constraints
01 · Hero Editorial	h1 (clamp 2.5–6rem) + lede (max 56ch) + single CTA	Opening sections, page hero	One CTA. No carousel. No multi-CTA.
02 · Value Rows	Eyebrow + 3–5 numbered rows (display serif numbers + heading + 1–2 sentence body)	Features, benefits, how-it-works	Never the 3-col "icon above title" grid. Numbers over icons.
03 · Asymmetric Split	40/60 or 60/40 image+text	Product + narrative juxtapositions	Never 50/50 — too symmetric for editorial. No image carousels.
04 · Metrics Block	3 oversized numbers (--text-hero) + small labels	Proof points, impact stats	Max 3 numbers. No "+1,247% hyperbole". --color-surface-dynamic background.
05 · Case Study	Image + bold pull quote (italic serif) + outcome metric	Social proof, client stories	No headshot circles. No star ratings. No testimonial slider.
06 · CTA Block	Single sentence + single button	Conversion moments, between sections	One primary CTA. No arrows on submit. Never "Get Started".

Anti-pattern catalog

Harbor bans the patterns AI builders drift toward by default. These are the specific clichés that make generated marketing pages look like templates rather than designed products. Prompt 01's governance-override step enforces these as hard gates.

Banned

Why

Use instead

Gradient circle icons

Signals "generic SaaS" instantly — the most overused pattern in AI-generated UIs

Single-color Phosphor line icons on neutral backgrounds, inheriting currentColor

3-column "icon above title" feature grid

Most cliched layout on the web. Harbor invented the Value Rows template to replace it

Numbered Value Rows (template 02) or Asymmetric Split (template 03)

"Unlock the Power of…" / "Revolutionize…" CTAs

Meaningless hyperbole that every SaaS uses. Erodes trust

Specific claims with numbers ("Cut expense reports from 45 min to 3")

Arrows on every CTA button

Visual noise that weakens the navigation-arrow signal when you actually need it

Arrows only on navigation links, never on form submits or primary actions

Floating labels (placeholder-as-label)

Hides labels on focus, breaks autofill, fails a11y — Harbor spec bans this

Labels above inputs, always visible. Hint text below if needed.

Indigo / purple in palette

The cliched "AI startup" signal. Every generated system goes here unprompted

Warm accent hue per spec. Default teal. Single hue enforced.

Strong border + strong shadow on same element

Double visual weight competes with the content inside the element

One or the other. Shadow for floating (modal, popover). Border for anchored.

Stock photography of smiling people

Signals template origin immediately — kills editorial authenticity

Product screenshots, abstract imagery, or no imagery at all

What Prompts 03–08 produce for Harbor

Prompt	Harbor-specific output
Prompt 03 · Foundation	Buttons (teal bg + dark text primary; outline secondary; ghost tertiary). Fields with labels above. Avatar component defaults to initials — stock photo variant banned. Tag/badge in pill shape.
Prompt 04 · Composite	Pull quote (italic serif, left rule, 52ch, no quote marks). Dialog with serif italic title. Drawer (right edge, 400px). Process diagram (numbered with display serif numerals).
Prompt 05 · Patterns	Form layout (single column, labels above, no multi-column). Article header (eyebrow + h1 with italic emphasis + lede). Service grid (2-col, not the banned 3-col). Pricing block (max 3 tiers, background tint for featured).
Prompt 08 · Spec page	Live-rendered spec page — every component in dark and light mode, WCAG contrast table, governance section. The shape is what build/preview/specs/harbor.html demonstrates.

When to pick Harbor vs other archetypes — see Ch02's failure modes table and the Ch02 decision tree. The rendered spec page is always the fastest way to verify whether the generated system matches Harbor's character before shipping components.

Act III — Reference Systems · Chapter 10

Sentinel — the enterprise archetype

Light-first, compact, primitive + semantic two-layer architecture. The complete reference system for dashboards, monitoring tools, admin products, and B2B SaaS. Clear, helpful, respectful of expertise.

Live Sentinel spec preview

Dimension	Sentinel decision	Why
Theme	Light-first	Enterprise products run in office lighting for 8+ hour sessions. Light mode is the default expectation.
Token naming	Primitive ramps (Layer 1) + semantic aliases (Layer 2)	9-step granularity for status badges, charts, table tints, hover/active/selected states. Components only touch Layer 2.
Typography	Fixed modular (1.125 ratio, 14px base), sans-only	Dashboards need pixel-perfect table alignment. Serif decoration competes with information density.
Accent	Multi-ramp + semantic; the Accent-CTA rule is foundational	Enterprise products encode meaning in color. 9-step ramps give precise hover/active/selected states.
Elevation	Flat with hairline borders; cold neutral rgba shadows for floating only	Borders communicate structure precisely. A shadow always means "this is floating above the page."
Density	Compact (4px base, percentile naming --space-25 to --space-1000)	Product UI is task-oriented. Whitespace costs density. Percentile naming allows future insertions without renaming.
Iconography	Lucide (2px stroke, 24px grid)	The modern enterprise / SaaS convention. Neutral and functional. 2px stroke matches Sentinel's compact density.

The spec — copy and paste

The Sentinel spec is what you paste into Prompt 01 (second paste, after governance). Open the Ch03 dropdown, select Sentinel, and hit COPY — or grab it directly from kit/specs/sentinel.spec.md. The two-layer token architecture, the Accent-CTA rule, and every structural decision in the generated token file come from this spec.

→ Go to Ch03 archetype dropdown to copy the Sentinel spec

The Accent-CTA rule — Sentinel's keystone

Sentinel's anchor orange (#FF9900) scores 2.30:1 against white — a WCAG FAIL for body text. Against dark text (#1C1917) it scores 7.0:1 — AAA. The rule that drops out: --color-accent-500 is a background color only. Never a text color on a light surface. Primary CTAs are always orange-bg + dark-text. White-text-on-orange is banned everywhere in the system.

Prompt 01 applies this as a governance gate. If you supply a BYOC seed that fails this same test, the prompt flags it before emitting a single token. This is the single most important WCAG behavior the prompt enforces for Sentinel.

The data table — Sentinel's signature component

Sentinel's most architecturally distinct component is the data table. It encodes everything the archetype stands for: tabular numerals, compact row height, keyboard navigation, column resize, column reorder, bulk-action bar, virtualized rendering for 1000+ rows. Prompt 03 generates this as part of the foundation set. The table inherits all Sentinel token rules — fixed 14px type, percentile spacing, hairline border, semantic color for status columns.

Feature	Implementation	Archetype rule
Number columns	`font-feature-settings: "tnum" 1` + right-align	Tabular numerals non-negotiable — column alignment collapses without them
Row height	--space-150 (12px) vertical padding	Compact enough for 20–30 visible rows; generous enough for keyboard users
Status columns	Badge (--text-caption) with color + icon + text	Triple encoding — color alone fails colorblind a11y
Selection	Checkbox per row + Shift-click range + Cmd/Ctrl-click multi	Keyboard-first interaction model for Sentinel users
Bulk actions	Header morphs to action bar when rows selected	Pattern from Linear, Notion, GitHub — Sentinel users recognize it
Empty state	In-table empty message (not replacing the chrome)	Sentinel never full-screen-blocks for empty data

Anti-pattern catalog

Banned

Why

Use instead

White text on --color-accent-500 (orange)

2.30:1 — WCAG FAIL. The Accent-CTA rule is foundational to Sentinel.

Dark text on orange bg (7.0:1 AAA). Accent is always background-only.

Fluid clamp() typography

Dashboards require pixel-perfect table alignment. Fluid sizing breaks column consistency.

Fixed px modular scale (1.125 ratio, 14px base).

Serif typography

Decoration competes with information density. Sentinel is data-forward, not editorial.

Sans-only (Inter or equivalent humanist sans). Any serif declaration fails lint.

Zebra-striped table rows

Visual noise without information. Old-pattern that adds cognitive load.

Hover state + subtle 1px row dividers. Selection via checkbox, not color.

Components referencing Layer 1 primitives directly

Breaks the two-layer architecture. Rebrand requires touching every component.

Components consume Layer 2 semantic aliases only. Never --color-primary-700 directly.

Warm oklch shadows

Harbor's signature. Looks tonally inconsistent on a light-surface enterprise product.

Cold neutral rgba shadows, floating elements only. Default surfaces use hairline borders.

What Prompts 03–08 produce for Sentinel

Prompt	Sentinel-specific output
Prompt 03 · Foundation	Button (orange bg + dark text for primary — accent-CTA rule). Field with labels above + "(required)" inline (not asterisk-only). Checkbox/radio/switch at 16px (larger than default). Badge with square --radius-xs (not pill).
Prompt 04 · Composite	Data table — the signature component — with tabular numerals, keyboard navigation, virtualization, bulk-action bar. Command palette (⌘K). Date picker with keyboard navigation.
Prompt 05 · Patterns	Search + filter pattern (applied filters always visible as chips, never hidden). Wizard / Stepper (sticky progress header + sticky footer). Detail header (breadcrumb + entity title + status badge + metadata row).
Prompt 08 · Spec page	WCAG verification table required — Sentinel governance enforces it. Every text/background pairing enumerated. build/preview/specs/sentinel.html demonstrates the full output.

Act III — Reference Systems · Chapter 11

The specialist archetypes

Harbor and Sentinel cover 60–70% of products. Current (mobile-first), Lattice (data-dense), and Meridian (long-form reading) cover the cases where neither generalist archetype fits — and where forcing one onto the wrong use case costs more than not picking.

The specialist archetypes share the same generation pipeline as Harbor and Sentinel. Paste governance + archetype spec into Prompt 01, run Prompts 02–08 in sequence. What changes is the spec — and the spec changes everything.

Current — mobile-first consumer

The specs — copy and paste

Each specialist archetype has its own spec in the Ch03 dropdown and in the kit. Copy yours before running Prompt 01.

Archetype	Copy from	Kit file
Current	Ch03 dropdown → select Current → COPY	`kit/specs/current.spec.md`
Lattice	Ch03 dropdown → select Lattice → COPY	`kit/specs/lattice.spec.md`
Meridian	Ch03 dropdown → select Meridian → COPY	`kit/specs/meridian.spec.md`

Current is the only archetype where the primary input device is a thumb, not a cursor. Every architectural decision follows from that constraint.

Constraint	Current's response	What breaks if you skip it
Touch targets	44px minimum hit area on every interactive element (enforced as governance gate)	Sub-44px = accessibility failure. iOS reports it. Users file bugs.
iOS Safari font zoom	All input font-size ≥ 16px — hard floor	Below 16px, iOS auto-zooms the viewport on field focus. Page jumps. Users abandon.
Haptic feedback	Every button press, toggle flip, swipe-action commit triggers a haptic (paired with visible state change)	Haptic alone is an a11y fail. Haptic without visible state is equally wrong.
Bottom-anchored navigation	Tab bar (4–5 destinations, max) at the bottom. Hamburger nav banned as primary.	Navigation hidden in a hamburger → engagement drops measurably. Primary nav must be one tap from any screen.
Bottom sheets over modals	Use bottom sheets for contextual options, filters, quick forms — not center-screen modals	Modals on mobile are hard to dismiss one-handed. Sheets swipe-down natively.

The Current spec also defines six screen layouts (Tab Bar Shell, Stack Navigation, Bottom Sheet, Full-Screen Takeover, Onboarding Wizard, Empty State) as the only layouts Prompt 05 assembles into. The AI is constrained to these six — it cannot invent a seventh. This is the same vocabulary constraint Harbor applies to section templates.

Live Current spec preview

Lattice — data-dense terminal

Lattice is the inversion of Meridian. Where Meridian maximizes comprehension, Lattice maximizes density. Every pixel earns its place. The density doctrine is the architectural keystone — every decision passes through it.

Constraint	Lattice's response	What breaks if you skip it
Two-color-system rule	Chrome palette (restrained grays) and data palette (diverging / sequential / categorical) are completely separate. They never share a hex value.	Chrome using red → "red row" loses its signal. Data color competes with chrome color. Operators make mistakes.
Monospace numerics	All numeric columns use `--font-number` (monospace) with `font-feature-settings: "tnum" 1`	Proportional fonts → decimal points misalign in columns. Looks like a bug. Operators notice in seconds.
Keyboard-first discipline	Every interactive action has a documented keyboard shortcut. Every tooltip shows the shortcut. Command palette (⌘K) is the primary input method.	Terminal operators spend 8 hours/day in the product. Actions without shortcuts feel broken.
Zero animation on data updates	Cell flash, row pulse, update animations banned. State changes are instant.	Motion fatigue is real over 8-hour sessions. A cell that pulses on every update becomes unbearable by hour two.
Row height user-configurable	--row-height-compact (24px) / --row-height-standard (32px) / --row-height-comfy (40px)	Operators have preferences. Fixed row height makes the product feel like a design decision was imposed on them.

Live Lattice spec preview

Meridian — long-form reading

Meridian is the only archetype where body text is the hero. Everything else exists to serve reading — and reading has well-documented ergonomic thresholds that this archetype enforces as governance gates, not preferences.

Constraint	Meridian's response	What breaks if you skip it
Reading-width hard cap	Body content hard-capped at 68ch. Tables and code blocks allowed up to 80ch since they're scanned, not read prose.	Past 80ch, eye fatigue accumulates. Users start skimming. Engagement drops over sessions longer than 15 minutes.
Body size floor	--text-body = 17–19px (clamp). Governance gate lint-fails any --text-body declared below 17px.	16px is fine for product UI. 16px for sustained reading imposes a cognitive load that compounds over time.
Line-height floor	--leading-body = 1.75. Governance gate lint-fails tighter values.	Tighter leading → lines run together → reading pace slows → comprehension drops.
Three themes required	Light + sepia + dark. All three defined. BYOC validation rejects light-only palettes.	Readers who marathon-read switch to sepia or dark to reduce eye strain. No sepia = product loses these users.
No animation on body content	Scroll-triggered reveals, fade-in paragraphs, skeleton shimmer all banned in body content.	The eye follows motion involuntarily. Animation in reading content breaks reading flow every time it fires.

Live Meridian spec preview

Hybrid products — when one archetype isn't enough

Most real products have surfaces that span two archetypes. The governance spec documents the five canonical hybrid patterns:

A note on the product names below: these are illustrative references to publicly-recognisable products that exemplify each hybrid pattern, not endorsements or partnerships. The named products almost certainly do not describe themselves using this book's archetype vocabulary; the mapping is observational, applied here to make the patterns concrete. Trademarks are property of their respective owners.

Combination	Real-world examples	Strategy
Harbor + Sentinel (marketing site + product app)	Linear, Notion, Stripe, Vercel	Separate tokens.css per surface. Shared brand primitive hex; independent semantic layers and type scales.
Sentinel + Current (web app + mobile app)	Slack, Figma, Asana	Shared brand color anchor. Otherwise distinct systems. Don't try to responsive-down Sentinel to mobile.
Sentinel + Lattice (product shell + data terminal)	Datadog, Linear command bar, Snowflake	Keep Lattice scoped to specific routes. Navigating into a Lattice route changes chrome density visually.
Harbor + Meridian (marketing + docs)	Stripe Docs, Tailwind, Vercel Docs	Shared header brand. Docs reading experience must NOT inherit Harbor's atmospheric hero patterns.
Sentinel + Meridian (product app + in-app docs)	Notion help, GitHub Docs	Meridian-styled content pane renders inside Sentinel's chrome (right drawer, modal, dedicated route).

The universal hybrid rule: keep token files separate. Each archetype owns its own semantic layer, typography scale, density scale, component priority order, and banned patterns. What you share across archetypes is exactly: brand primary hex, high-level voice principles, and universal governance gates.

When no archetype fits cleanly

Some products genuinely don't match any single archetype — a creative tool (Figma, Photoshop), a real-time collaborative canvas, a 3D viewer, a video timeline editor. Don't force a fit. Instead, follow this decision sequence:

Identify the closest neighbour. Which archetype's governance gates overlap most with your needs (touch targets, density, reading width, animation budget)? Inherit that archetype's gates as your baseline.
Document what you're overriding. Open a new file your-archetype.spec.md in kit/specs/. Inherit Sentinel (or closest match) and explicitly list each gate you're changing and why. The list is your archetype's identity.
Add at most one new layout type. If your product needs a canvas, timeline, or 3D viewport, add it as a sixth Prompt-05 layout — not a replacement. The constraint vocabulary still wins.
Run the kit anyway. Prompts 02–08 still produce the right outputs as long as the spec is concrete. The kit doesn't know whether you're Sentinel or a custom archetype — only what your spec says.

Anti-pattern: writing a brand-new archetype from scratch. The five archetypes encode hundreds of accumulated decisions. Inheriting from the nearest one and overriding 5–10 gates is faster, lower-risk, and easier to maintain than greenfield.

What Prompts 03–08 produce for specialist archetypes

Prompt	Current output	Lattice output	Meridian output
Prompt 03	Button (44px min, pressed state replaces hover, haptic-light on press). Tab bar (bottom-anchored, 4–5 destinations). FAB (56px, fixed bottom-right).	Icon button (every one with keyboard shortcut in tooltip). Field (mono for number inputs). Status dot (8px — Lattice innovation).	Article (structural primitive). Heading (auto-id + anchor). Body paragraph (68ch cap). Footnote reference (superscript).
Prompt 04	Bottom sheet (peek/half/full snaps, swipe-down dismiss). Action sheet (iOS-style with Cancel). Pull-to-refresh (haptic-light on threshold).	Pane (structural primitive, resizable). Splitter (resize handle, keyboard resize). Toolbar (with keyboard hints). Filter bar (chips always visible, never hidden).	Sidebar TOC (scroll-spy). Callout box (note/tip/warning/danger). Code block (syntax highlight + copy button). Pull quote (serif italic, 52ch).
Prompt 05	Tab bar shell (full layout, safe-area aware). Onboarding pager (3–4 screens, skip from any step). Search experience (clear-X when filled).	Data grid (virtualized, all Lattice column features). Sparkline (inline mini-chart in table cells). Command palette (⌘K — PRIMARY input method).	Article layout (D1). Doc layout (D2 — left sidebar + right TOC). Reader mode (D4 — distraction-free). Print stylesheet (D5 — required for Meridian).
Prompt 08	Spec page with both light + dark mode rendered (required). Touch target visualization. preview/specs/current.html	Spec page dark-first. Two-color-system demonstrated. Keyboard shortcut table. preview/specs/lattice.html	Spec page with light + sepia + dark themes rendered. Reading-width demonstrated. preview/specs/meridian.html

Act IV — Components & Patterns · Chapter 12

Component specification

When you spec a component, you are not describing pixels. You are writing an API contract. The AI builds from the contract, not from what the component looks like.

Prompts 03–05 generate components from the archetype spec and tokens.css. The archetype spec tells them which components to build; the tokens tell them which values to use. This chapter covers what makes a component spec complete enough for the AI to get it right the first time — and what Figma Component Properties add when you're on Path B.

Five elements of a complete component spec

A component spec is complete when it answers these five questions. Leave any one out and the AI fills the gap from training-data defaults — which means the statistical average of the web, not your archetype.

#	Element	What it captures	Button example
1	Variants	The different roles the component plays	primary / secondary / ghost / destructive
2	Sizes	Dimensional variants (height, padding, type size)	sm (h:32px) / md (h:40px) / lg (h:48px)
3	States	Interactive and async states — all must be visually distinct	default / hover / focus-visible / active / disabled / loading
4	Tokens consumed	Which semantic tokens the component references — never primitives	--color-primary (bg) / --color-primary-text (fg) / --radius-md / --space-3 inner / --space-4 outer
5	Content rules	What goes inside and what doesn't — label only, icon + label, icon only; max/min label length; what's banned	Label required. Icon optional (instance swap). Arrow banned on form submit. Max label 32 chars. Never ALL CAPS on Harbor.

The archetype spec's component library section (which Prompt 03 consumes) pre-fills all five elements per component per archetype. You don't author the spec from scratch — you paste it. The AI executes from the spec's contract.

Figma Component Properties — the API contract for Path B

On Path B, Claude reads component specs directly from Figma via MCP. Figma Component Properties are the structured format that makes this work — they encode the logical structure of a component (what props it has, what types they are) separate from the visual renders. The AI reads properties directly rather than inferring structure from screenshots.

Four property types, each with a direct TypeScript equivalent:

Figma type	What it stores	TypeScript equivalent
Boolean	Toggle on/off — icon visible, loading state, destructive mode, required indicator	`prop?: boolean` with a sensible default
Text	User-authored content — label, placeholder, helper text, error message	`prop: string` or `prop?: string`
Instance Swap	Swappable sub-component — icon slot, avatar, prefix element, trailing indicator	`prop?: ReactNode` or a typed icon union
Variant	Named enumeration — variant, size, color role, position, theme	String union: `"primary" \| "ghost" \| "destructive"`

Button worked example — properties → TypeScript → Storybook

Six properties. One Figma component. Three outputs (TypeScript interface, Storybook argTypes, CSS implementation) all derived from the same source without manual transcription.

Button · Component Properties → TypeScript interface

/* Properties defined in Figma:
   variant      Variant  "primary | ghost | destructive | accent"
   size         Variant  "sm | md | lg"
   label        Text     "Save changes"
   icon         Instance Swap  → Icon component
   iconPosition Variant  "left | right"
   loading      Boolean  false                                 */

interface ButtonProps {
  /** Visual role — maps to Figma variant "variant" */
  variant?: "primary" | "ghost" | "destructive" | "accent";
  /** Touch target size — maps to Figma variant "size" */
  size?: "sm" | "md" | "lg";
  /** Label — maps to Figma text property "label" */
  children: React.ReactNode;
  /** Icon slot — maps to Figma instance swap "icon" */
  icon?: React.ReactNode;
  /** Icon placement — maps to Figma variant "iconPosition" */
  iconPosition?: "left" | "right";
  /** Loading state — maps to Figma boolean "loading" */
  loading?: boolean;
  onClick?: (e: React.MouseEvent) => void;
}

Component build order

Prompts 03–05 run in a strict order. Each tier composes from the tier below — never the reverse. A composite component (Modal) must not be built before the foundation components it needs (Button, Icon) exist.

03

Foundation — atomic, no dependencies on other components

Button · Input · Badge · Icon · Avatar · Checkbox · Radio · Switch · Tag · Chip. Each standalone. Each consumes only tokens, never other components.

04

Composite — composed from foundation components

Card · Form (field groups) · Modal · Drawer · Tabs · Dropdown · Toast · Tooltip · Alert · Loading states. Each explicitly declares which foundation components it composes.

05

Patterns — archetype-specific page-level compositions

Data table · Form layout · Empty state · Search + filter · Dashboard widgets · Archetype-specific templates (Harbor section templates, Sentinel List + Detail, etc.). Each is unique to one or more archetypes.

What's next

Chapter 13 — Layout Patterns covers how components assemble into layouts — the responsive rules, the container-width tokens, and why each archetype has its own finite layout vocabulary the AI must respect.

Once components are generated (Prompts 03–07 via Chain 3), Prompt 14 drives Chain 6 — Storybook Generation to produce living documentation: one *.stories.tsx per component with every variant, every state, and a11y annotations wired in. The component spec from this chapter becomes the Storybook story tree without a manual transcription step.

Act IV — Components & Patterns · Chapter 13

Layout patterns

Don't let the AI invent layouts. Give it a finite vocabulary — then it assembles rather than creates, and the result reads as a designed system, not a generated template.

Every archetype's spec defines a closed set of layout shapes. Harbor has 6 section templates; Sentinel has 6 application layouts; Current has 6 screen types; Lattice has 6 workspace configurations; Meridian has 6 document layouts. The AI picks from this set. It never invents a seventh. This is how you get design-system-coherent layouts rather than layout drift.

Container widths and breakpoints as tokens

Every layout decision traces back to two types of tokens: content widths and breakpoints. When these are encoded as tokens rather than inline values, the AI uses the same number everywhere — and changing the system means changing one token, not grepping 200 CSS files.

tokens.css · container widths + breakpoints

/* Content widths — per archetype (example: Harbor) */
--content-narrow:   640px;   /* reading column, pull quotes */
--content-default:  960px;   /* articles, value rows */
--content-wide:     1200px;  /* case studies, hero compositions */
--content-max:      1440px;  /* full-bleed compositions */

/* Breakpoints — shared across archetypes (Harbor/Sentinel) */
--bp-mobile:        390px;   /* Current's primary target */
--bp-tablet:        600px;
--bp-desktop:       900px;
--bp-wide:          1200px;

/* If you ever change a breakpoint, one token edit updates every component.
   If the AI uses 898px in one component and 900px in another, those are
   two different breakpoints — and that is how design systems die. */

Layout vocabulary — five archetypes, five vocabularies

Each archetype's layout vocabulary reflects its primary use case. Harbor sections are about editorial rhythm. Sentinel layouts are about application chrome. Current screens are about mobile navigation patterns. Lattice workspaces are about pane configurations. Meridian documents are about reading experiences.

Archetype	Layout set	Count	Composition rule
Harbor	Section templates — Hero Editorial / Value Rows / Asymmetric Split / Metrics Block / Case Study / CTA Block	6	Any marketing page is an assembly of these six. AI assembles, never invents.
Sentinel	Application layouts — App Frame / Sidebar List+Detail / Full-Width Dashboard / Split Panel / Focus Mode / Empty App State	6	Every product page is one of these six. Chrome is fixed; content fills inside.
Current	Screen types — Tab Bar Shell / Stack Navigation / Bottom Sheet / Full-Screen Takeover / Onboarding Wizard / Empty State	6	Every screen is one of these six. Users already know how they work; not re-learning chrome per app.
Lattice	Workspace configurations — Multi-Pane / Full-Table / Chart Grid / Split Editor / Streaming Log / Modal/Settings	6	Operators learn the chrome once. Lattice never builds custom pane arrangements.
Meridian	Document layouts — Article / Doc (KB) / Landing/Index / Reader Mode / Print-Friendly / Search Results	6	Every page is one of these six. Navigation is conventional — readers must not re-learn how to navigate.

Prompt 05 (Pattern components) generates archetype-aware page patterns from this vocabulary. When you paste the Sentinel spec, Prompt 05 generates the Dashboard layout, the List + Detail layout, and the Focus Mode layout — because the spec encodes those as the expected patterns. You don't specify them explicitly. The vocabulary is in the spec.

Responsive: column collapse rules per archetype

Archetype	Primary breakpoint	Desktop layout	Mobile layout	What changes
Harbor	900px	Multi-column sections, generous spacing	Single column; section padding halves	Asymmetric splits → single column. Metrics stay 3-up.
Sentinel	900px	App frame + sidebar + main	Sidebar collapses to bottom nav or drawer	Left sidebar becomes overlay or tab bar. Data tables get horizontal scroll.
Current	N/A (mobile-first)	Single-column with tablet cap at 600px	Full-width, safe-area aware	Current is designed mobile-first — desktop is a graceful expansion, not a priority.
Lattice	1200px (wide-monitor primary)	Multi-pane resizable workspaces	Single pane; horizontal scroll on tables	Mobile is acceptable but less rich. Lattice is desktop-optimised by design.
Meridian	1200px (sidebar TOC)	Article + right sidebar TOC	Article only, TOC collapses to "On this page" expandable	Reading column stays at 68ch (already narrow). Sidebar TOC becomes in-page.

What's next

Chapter 14 — Design Patterns covers the smaller-scale patterns: how components combine into cards, forms, feedback states, and empty states — and why defining patterns by composition rather than screenshot is what makes AI-generated patterns reliable.

Act IV — Components & Patterns · Chapter 14

Design patterns

Patterns are how components combine into recognisable interactions. Define them by composition — which components, in what arrangement — not by what they look like. The AI can reassemble components; it cannot reliably reverse-engineer screenshots.

Card patterns

Cards are the most common pattern AI builders get wrong. The failure mode is always the same: the AI defaults to a generic "card with shadow and image" because that's the most common pattern in training data. Archetype specs ban or mandate specific card shapes precisely because of this drift.

Variant	Structure	Token usage	Archetype constraint
Flat (default)	1px --border-default, no shadow, --surface-card background	--radius-sm or --radius-md; --space-4 padding	Sentinel/Lattice primary card. No shadow ever on default surfaces.
Raised	No border, --shadow-sm, slightly elevated background	--shadow-sm; --surface-card or --surface-2	Harbor: warm oklch shadow. Sentinel: cold rgba. Current: soft mobile shadow.
Interactive (hover lift)	Flat + hover → translateY(-2px) + shadow appears	--transition-ui on transform + box-shadow	Harbor/Current. Full card is the tap/click target. Never nest multiple primary actions inside.
Stat	Oversized number (--text-hero or --text-2xl) + small label below	--color-primary or --text-primary for number; --text-muted for label	Harbor: serif display number. Sentinel/Lattice: sans + tabular figures. Max 34px for Sentinel.
Result / status	4px left border in semantic color + content	--color-success/warning/error on left border; --surface-card background	Triple encoding: border color + icon + text label. Color alone is banned (colorblind a11y).

Form patterns

Forms are where the most a11y failures happen in AI-generated systems. Three rules enforced by every archetype's governance spec:

1

Labels above inputs — always visible

Floating labels (placeholder-as-label) hide on focus, break autofill, and fail screen readers. Every archetype bans them. Harbor additionally bans asterisk-only required indicators — "(required)" inline text is mandatory.

2

Inline validation on blur — not on every keystroke

Keystroke validation feels nagging. The user is mid-thought. Validate on blur (when they leave the field) and on submit (final check). Exception: password strength meters and username availability checks run real-time because the feedback is the feature.

3

Error messages follow What / Why / How

Bad: "Invalid input." Good: "Email format isn't recognized. Try name@example.com." The governance spec's error-message formula requires all three: what happened, why it failed, how to recover. Generic errors ("Something went wrong") are banned.

Feedback patterns

Loading, empty, error, and success states are the patterns most commonly skipped in first-pass AI generation. When the archetype spec is pasted into Prompt 03–05, these states generate automatically as part of every component's state coverage. The governance spec requires all eight states for every interactive component.

Pattern	Structure	Duration trigger	Archetype-specific rule
Loading skeleton	Gray placeholder blocks matching final content shape. Never generic rectangles.	300ms–1s waits	Meridian: opacity-static, no shimmer (motion ban). Harbor: subtle pulse, never shimmer. Lattice: solid block, no animation.
Empty state	Icon + heading + helper text + primary action. Heading is action-forward, never apologetic.	Zero-data first load	Current: illustration (not just icon) + "Scan your first receipt", never "You have no receipts yet." Lattice: terse — "0 results. Adjust filters." (no illustration).
Error state	Error icon + What/Why/How message + retry action	Network failure, server error	Lattice: names the record/job/ID ("Job #4821 failed: timeout after 30s") — never generic.
Toast notification	Icon + message + optional action + auto-dismiss (4–5s)	Confirmation, background completion	Current: single snackbar, bottom-anchored above safe-area. Sentinel: max 3 stacked, bottom-right. Harbor: max 3 stacked, auto-dismiss 5s, never side-in fanfare.

What's next

Chapter 15 — Figma + AI Foundation opens Act V: the Figma integration layer. This is where the book shifts from token and component theory to the MCP workflow that reads from — and writes back to — the living Figma file.

Act IV — Components & Patterns · Chapter 14b

Component-to-archetype mapping

The same component is not the same component. Three primitives — Button, Card, Form field — implemented across all five archetypes, with the spec rule that drives each difference made visible. After this chapter, you can read any component in any archetype and predict what its variants will look like before you generate it.

Chapters 12–14 covered the doctrine: API-contract specs, finite layout vocabularies, composition over screenshots. This chapter is the worked example. Same primitive, five archetype implementations, side-by-side. The point is not to memorize the differences — it is to internalise that the archetype IS the difference. When you paste a different archetype spec into Prompt 03, the same component prompt produces different code, and that's by design.

Worked example 1 — Button

Button is the highest-frequency component. Each archetype's Button reflects its core value system: Harbor's editorial restraint, Sentinel's accent-CTA convention, Current's touch ergonomics, Lattice's keyboard-first density, Meridian's reading-paged minimalism.

Archetype	Default size	Filled / outlined / text	Driving spec rule
Harbor	md (h:40px)	One filled primary per section · outlined secondary · text tertiary	"One primary CTA per section" — editorial restraint, no button walls
Sentinel	md (h:36px)	Filled accent for primary actions · ghost (transparent) for secondary · destructive variant	"Accent-CTA rule" — never white-on-orange chrome; accent only for action triggers
Current	md (h:48px — minimum 44px touch)	Filled primary · stroked outline secondary · subtle text tertiary	"44px touch-target floor" — buttons never smaller than thumb-zone minimum
Lattice	md (h:32px)	Subtle bg + accent border for primary · transparent + border for secondary · text-only tertiary · destructive border-only	"NO LG button — terminals don't have generous buttons; everything fits in headers"
Meridian	md (h:40px)	Underlined link-style for inline actions · single subtle filled CTA per page	"Reading-mode minimal" — the article is the surface; buttons defer to the prose

Note what shifts and what does not. Every archetype has the same three semantic Button roles (primary, secondary, tertiary) but the visual encoding differs. The prop API stays consistent — `variant`, `size`, `disabled`, `loading` — so a component consumer never needs to know which archetype it's in. The archetype is invisible at the API surface and decisive at the rendering layer.

Worked example 2 — Card

Card is where Act IV's "composition not screenshot" doctrine pays the most. Cards are the most common AI failure mode (training data defaults to a generic shadow + image card). Archetype specs explicitly mandate or ban specific card shapes — that is the only thing that prevents drift.

Archetype	Default card shape	Banned	Use cases
Harbor	Warm oklch shadow + 12px radius + serif italic display title	Hard outline · square corners	Article preview · case study tile · testimonial
Sentinel	1px hairline border + 8px radius + tabular content area	Drop shadows · floating effect	Stat card · workflow step · settings group · result/status
Current	Soft elevation + 16px radius + edge-to-edge image	Hairline-only borders · sharp corners	Feed item · profile card · product tile
Lattice	Pane (border-only, 0–4px radius, no shadow) — NOT a "card" by archetype convention	"Card-based rhythm for data" — explicitly banned in spec	Lattice has panes, not cards. Use pane for any container of related rows.
Meridian	Almost-flat + 1px subtle divider + 8px radius (rare — most content is flowed not carded)	Shadows · raised effects · multiple cards stacked	Annotation aside · footnote group · table-of-contents block

Worked example 3 — Form field

Form fields are where a11y bans matter most. Every archetype's governance section bans floating labels, color-only error states, and asterisk-only required indicators — but the visual treatment differs:

Archetype	Label position	Required indicator	Error treatment
Harbor	Above input · serif italic display	"(required)" parenthetical in label	Border + icon + What/Why/How message below input
Sentinel	Above input · sans-serif medium weight	"(required)" parenthetical in label · NEVER asterisk-only	Red border + error icon + message · field gets aria-invalid="true"
Current	Above input · 16px+ for thumb-zone clarity	"(required)" parenthetical · inputMode set per field type	Border + icon + message · keyboard adjusts to input semantic
Lattice	Above input or inline-left · monospace for numeric fields	"(required)" parenthetical · density-aware	1px chrome-accent border ring (not 3px glow) + tooltip-style message
Meridian	Above input · forms are rare — generous spacing when present	"(required)" parenthetical	Border + icon + message · multi-paragraph error allowed for complex validation

Three things stay constant across all five archetypes: labels above inputs (never floating), error message follows the What → Why → How formula (Ch14), and color is never the only signal (icon + border + text always combined). These are governance gates, not archetype choices — they apply universally.

Reading any archetype's component list

The pattern: every archetype's spec has a COMPONENT LIBRARY section listing the components in priority order. The order itself is meaningful — Lattice lists Data Grid before Modal because it's used 50× more often in a terminal product; Harbor lists Hero before Modal for the same reason in editorial. When you read a new archetype, scan the component list before the rules. The list tells you what the archetype is for; the rules tell you what it forbids.

What's next

That closes Act IV. Chapter 15 — Figma + AI Foundation opens Act V: how the Figma Dev Mode MCP turns these specs into living code via the read/write bridge. If you're on Path A (no Figma), skip Act V and go to Ch20.

Act V — AI Workflows · Chapter 15

Figma + AI foundation

Figma Dev Mode MCP is the bridge between your design system and any AI builder. Before MCP: you described a Figma file to Claude in prose and hoped. After MCP: Claude opens the file itself, reads the actual token values, inspects the actual component tree, and builds code that matches exactly.

This chapter is the foundation for Act V (Ch15–19). It covers what MCP is and does, how to verify the connection, what MCP unlocks beyond just reading token values, and what to expect in terms of output accuracy.

Which path are you on?

The three paths were introduced in Ch00. This table is the practical routing: what you have, what you produce, and where to go from here.

Path	Starting point	What you produce	Where to go
A · Quick	No Figma file. Starting from 3 adjectives and a chosen archetype.	tokens.css → component CSS → domain CSS → self-contained spec HTML. No Figma required at any step.	Ch20 (Path A pipeline). Run Chain 9 in the kit for the guided sequence.
B · Standard	Figma file already exists. Components designed by your team.	Production code generated from the Figma source via MCP — token references read directly from the file, not guessed.	Continue here (Ch15–19). Ch20 is the Path A chapter — skip it.
C · Deep	Starting from adjectives (Path A), but Figma is the long-term target.	Run Path A first to generate tokens and components in code. Push the token layer to Figma Variables. Designer builds Figma components on top. MCP takes over from there.	Ch20 (Path A) first — Step 7 contains the Figma write-back prompt. Return here for the MCP workflow.

What MCP is

MCP (Model Context Protocol) is a standard that lets an AI model communicate with external tools — read files, call APIs, query databases. The Figma Dev Mode MCP is Figma's official implementation: it lets Claude read and write Figma files directly, without screenshots or manual token transcription.

This book covers a single MCP: the Figma Dev Mode MCP (official). Not open-source community servers. Every prompt and workflow in Ch15–19 is verified against Figma's vendor-supported implementation.

Read capabilities vs write capabilities

Direction	What it does	Used by
Read (Figma → tokens.css)	Reads frames, components, Variable collections, Text Styles, auto-layout specs, Component Properties, Code Connect mappings. Screenshots of specific frames.	Prompt 01 (Path B) — reads Variables for color/spacing/radii. Prompt 02 (Path B) — reads Text Styles for typography. Prompts 03–05 — reads component trees for structure.
Write (tokens.css → Figma)	Creates/updates Figma Variables and modes. Creates frames and component instances. Updates existing components with new variants. Syncs design tokens from code back to Figma.	Chain 8 write-back (Ch16 and onwards) — pushes a revised tokens.css back to Figma as Variable collections. Closes the read/write round-trip.

Accuracy depends on input quality

How much structured information you provide to Claude determines how close the output is to your design intent. This is not a limitation of the model — it is a structural reality. More signal → less inference → fewer gaps filled with training-data defaults.

Input provided	Accuracy	What the AI infers
Description only ("build a dashboard")	~40%	Everything — archetype, tokens, layout, components, spacing
Screenshot only	~70%	Token values (guessed from pixel colors), component states, spacing
Figma Dev Mode MCP (file structure)	~80%	Token application mapping, Text Style compatibility
+ Figma Variables (color/spacing tokens)	~90%	Typography tokens, domain-specific extensions
+ archetype spec pasted	~95%	Edge cases, context not in Figma (governance overrides, banned patterns)
Full stack: MCP + Variables + Text Styles + spec	~98%	Minimal inference — AI reads rather than guesses

Setup requirements

Requirement	Details
Figma Desktop app	Running locally on your machine. The MCP server talks to the local Figma app — not the browser version.
Dev Mode access	Figma Professional plan or higher. Dev Mode is not available on the free tier.
Compatible AI client	Claude Desktop, Claude Code, Cursor, VS Code with Claude extension, or any MCP-compatible client.
MCP configuration	Your AI client must be configured to connect to Figma's local MCP server. Figma's official docs walk the configuration — one JSON entry in your MCP settings.

Verifying the connection — diagnostic prompt

Before running any Figma-dependent prompt, confirm the MCP is connected and responding. Paste this diagnostic into your MCP-enabled Claude session:

MCP connection diagnostic

MCP CONNECTION DIAGNOSTIC — Figma Dev Mode MCP

List all Figma MCP tools you have access to.

For each tool, report:
1. Tool name
2. What it does (one sentence)
3. Required parameters
4. Optional parameters

Then confirm:
  - Is the Figma MCP server connected and responding?
  - Is a Figma file currently open in the Desktop app?
  - If yes: what is the file name and file key?

If no tools are listed, the MCP server is not connected.
Check: (a) Figma Desktop is running, (b) Dev Mode is enabled,
(c) your AI client's MCP configuration points at Figma's
local server URL, (d) you have restarted your AI client
since updating the MCP config.

Use when: starting any Path B workflow · verifying setup · debugging MCP failures
Kit file: kit/prompts/19-mcp-diagnostic.md

What MCP unlocks beyond setup

"Connect Claude to Figma" sounds like a single capability. It is not — it is a layer underneath six distinct workflow capabilities, each addressed by a specific chain in the kit. The reason MCP is the headline of Act V is that each of these would otherwise require copy-paste, screenshots, manual transcription, or a custom plugin.

Capability	What MCP enables	Without MCP	Where it lives in the kit
Token round-trip	Read Figma variables → tokens.css. Then code-side updates flow back via Prompt 15 — Figma library matches code, automatically.	Designer maintains Figma variables manually; engineer maintains tokens.css manually; quarterly catch-up sweeps	Chain 2 (extract) + Chain 8 (write-back)
Component-tree introspection	Claude sees the actual layer hierarchy, auto-layout settings, variants, and Component Properties — not a screenshot interpretation	Inferred from screenshots; AI invents structure that may not match the designer's intent	Phase 1 STUDY (Ch16) reads tree directly; Prompt 01 Path B
Component Properties as API contract	Read Figma's structured Boolean / Text / Variant / Instance Swap properties → TypeScript prop types directly	AI infers props from variant names; produces inconsistent or missing prop types	Ch12 — Component Specification; Prompt 13
Multi-file orchestration	Claude opens multiple Figma files (design system + product file + reference file) in one session and cross-references	One file at a time, copy-paste between contexts, lost mappings	Path C audit loops; Chain 4
Dependency graph reading	Claude follows component instances to their masters, reads override patterns, identifies which components are reused most	Manual inventory of which components are used where	Prompt 10 — Layer 3 extraction
Write-access mutation	Claude updates Figma variables via MCP write — no plugin install, no manual variable creation	Plugin installation per file, manual variable creation, no audit trail	Prompt 15 (Chain 8)

The MCP read/write layer is what makes the rest of Act V work. Ch16 (Three-Phase Workflow) uses it for token extraction. Ch17 (Claude Design) uses it for spec validation in Workflow B. Ch18 (Figma → Code) is the production output of MCP reads. Ch19 (Code → Figma) closes the round-trip via MCP writes. Without MCP, every chapter in this act becomes "paste a screenshot and hope."

What's next

Chapter 16 — The Three-Phase Workflow maps out the full Path B execution: Phase 1 (Figma → MCP → tokens), Phase 2 (tokens → components → patterns), Phase 3 (audit, write-back, drift monitoring). The MCP connection verified here is what the rest of Act V depends on.

Act V — AI Workflows · Chapter 16

The three-phase workflow

STUDY first. GENERATE second. REFINE third. Every AI design-to-code workflow that skips a phase produces mediocre results. The three-phase structure is not a preference — it's the minimum architecture for consistent fidelity.

Most teams ask the AI to "build this from Figma" in a single prompt and get something that's 70% right. The fix is to split the work into three sequential prompts, each with a different job and a different output artifact. The spec from Phase 1 constrains Phase 2. The comparison from Phase 3 is surgical, not vague.

The three phases

1

STUDY — extract, do not build

Ask Claude to analyze the Figma source and output a complete design specification. Explicitly forbid code generation. The spec is the artifact: colors, typography, spacing, components, effects, layout constraints — all extracted with exact values, no rounding, no interpretation.

— Output: spec.md (structured markdown, no code)

2

GENERATE — build from the spec, not the design

Hand the Phase 1 spec back to Claude with instructions to generate code that matches exactly. The AI works from the spec — a structured, unambiguous document — not from the visual design directly. No improvisation. Every value used must appear in the spec.

— Output: component code (references tokens, matches spec)

3

REFINE — surgical corrections only

Compare the generated output to the source. Submit corrections in a specific format: [Element]: [property] is X → should be Y. This format prevents the AI from reinterpreting — it applies mechanical changes, not creative ones. Never say "make it more like Figma." Say which element, which property, which value.

— Output: corrected code (only listed changes applied)

Phase 1 — STUDY prompt

The study prompt does the extraction. Run it with Figma Dev Mode MCP connected. The output is a structured spec document — not code. Explicitly block code generation in the prompt itself.

Phase 1 — STUDY (extract the design specification)

PHASE 1 — STUDY: Extract design specification

Study this Figma file: [FIGMA_URL]

Before writing any code, output a complete design specification
with these sections:

1. COLOR PALETTE
   - Every unique color with exact hex value
   - Usage role for each (background, text, border, accent, etc.)
   - Grouped into: primitives and semantic roles

2. TYPOGRAPHY SCALE
   - Every unique font size with exact value (no rounding)
   - Font-family, weight, line-height, letter-spacing per style
   - Suggested token name for each

3. SPACING MAP
   - Every unique spacing value (padding, margin, gap) in exact px
   - Grouped into a suggested token scale

4. COMPONENT INVENTORY
   - Every distinct component visible
   - Variants present, sizes used, interactive states visible

5. EFFECTS
   - Shadow values (x / y / blur / spread / color, exact)
   - Border radii used (exact px values)
   - Any transitions or animations

6. LAYOUT STRUCTURE
   - Container widths
   - Grid structures
   - Breakpoints inferable from source

Output as structured markdown with exact values throughout.

CRITICAL: Do NOT write any code. Do NOT suggest improvements.
Extract only what exists with exact values — no approximations.

MCP required: Figma Dev Mode MCP · Opus 4.7 · Claude Desktop
Kit file: kit/prompts/16-phase-1-study.md · part of Chain 10

Phase 2 — GENERATE prompt

Phase 2 — GENERATE (build from the spec)

PHASE 2 — GENERATE: Build code from Phase 1 specification

Spec from Phase 1: [paste Phase 1 output here]

Tech stack:
  Framework: [React + TypeScript | Vanilla HTML/CSS | Vue | Svelte]
  Styling: [CSS custom properties | Tailwind]
  Existing components at: [path, if any]
  Token file at: [path, if any]

Rules:
1. Use EXACT values from the spec. No rounding, no "approximately".
2. Define all colors and spacing as CSS custom properties in :root,
   grouped by category.
3. Every component references tokens via var(), never raw values.
4. Include hover, focus-visible, and disabled states for all
   interactive elements.
5. Use semantic HTML (button for buttons, not div with onclick).
6. Add aria-labels where visible text is absent.
7. Respect prefers-reduced-motion for any animation.
8. Do not improvise values not in the spec. If something is unclear,
   flag it with a comment and use the closest spec value.

Output: component code with tokens at top, component below.
No inline styles. No raw hex values in components.

Runs after: Phase 1 spec · Sonnet 4.6 · Claude Code (for file output)
Kit file: kit/prompts/17-phase-2-generate.md · part of Chain 10

Phase 3 — REFINE prompt

Refinement type	Wrong	Right
Vague direction	"The buttons don't look right — make them more like Figma"	`.btn-primary: padding is 8px 16px → should be 12px 20px`
Color fix	"The blue is off"	`--color-primary: is #2563eb → should be #1B2A4A`
Token swap	"Use the design system shadow"	`.card: box-shadow is 0 1px 3px rgba(0,0,0,0.1) → should be var(--shadow-md)`

Phase 3 — REFINE (surgical corrections)

PHASE 3 — REFINE: Surgical corrections only

Apply these corrections to the code. Modify ONLY what is listed.
Do not refactor, rename, or "improve" anything not on the list.

Format: [Element]: [property] is [current] → should be [value]

Corrections:
- [element]: [property] is [current] → should be [value]
- [element]: [property] is [current] → should be [value]
- [element]: [property] is [current] → should be [value]

Rules:
1. Make ONLY the changes listed.
2. Do not refactor or improve other code.
3. Do not "fix" other issues you notice — add them to a separate
   NOTE section if they are important, but do not change them.
4. Keep all existing structure intact.
5. Return the corrected code, not a diff.

Output the updated file(s).

Use after: Phase 2 output · visual comparison with source · Sonnet 4.6 · Claude Code
Kit file: kit/prompts/18-phase-3-refine.md · part of Chain 10

Closing the round-trip — Figma write-back

Phase 3 ends with the code corrected against the Figma source. But the inverse is also true: when REFINE updates a token value (e.g., a button's border-radius is fixed in code), Figma now drifts behind code. Chain 8 — Figma write-back closes that gap. Run Prompt 15 after a REFINE pass that materially changed token values; it reads the updated tokens.css, reconciles against the Figma variable library via MCP write access, and updates Figma to match. The three-phase workflow is then a complete round-trip — Figma → code → Figma — with code as the source of truth at the end of every cycle.

Act V — AI Workflows · Chapter 17

Claude Design

A full design critique, a rendered HTML prototype, and five rounds of layout iteration — from a description, a screenshot, and a spec, inside a single persistent session. Claude Design is the fastest first mile in any design system workflow. This chapter is the operating manual.

The seam with Figma is real and intentional: Claude Design handles speed, exploration, and critique; the Figma-first workflow (Ch15–16) handles artifact-level consistency at scale. These two compose — and every improvement to Claude Design makes both sides of the composition more powerful. This chapter covers what that composition looks like in practice, using Claude's full 2026 capability set.

Why Claude specifically — and not GPT-X or Gemini

Every prompt and chain in this kit was authored, tested, and tuned against Claude. The book's claims about reliability and determinism are claims about Claude's behaviour with these prompts — not claims about all frontier models.

Three Claude-specific features matter for design system work:

Projects with persistent system prompts. Upload your governance spec, archetype spec, and tokens.css once. Every conversation inside that Project inherits the full constraint layer without repasting. The constraint layer becomes ambient — and the kit's prompts assume that ambient context.
Artifacts (rendered HTML/CSS in-thread). Prompts 04, 05, and 08 generate live spec pages and component renders. Other models can produce the markup; Claude renders and re-renders it interactively in the same session, with the rest of the design system in scope. That is the difference between "reading a description of a layout" and "looking at the layout."
Extended Thinking on long, multi-constraint problems. Token derivation (Prompt 02), spec-page assembly (Prompt 08), and Layer 3 extraction (Prompt 10) all benefit from explicit chain-of-reasoning before output. Extended Thinking makes that reasoning visible, which lets you catch a wrong assumption before the AI commits to a wrong file.

The prompts will run on other models. The spec language is generic. But the dependability numbers in Ch01 ("95–98% fidelity from spec + tokens + MCP") are calibrated against Claude — if you swap models, calibrate locally before shipping. Models are not interchangeable for production design system work.

What Claude Design is in 2026

Six capabilities that define Claude Design's current power profile. No limitations in this list — scope comes next.

1

Visual critique from screenshots

Attach a screenshot. Claude identifies spacing inconsistencies, contrast failures, hierarchy problems, and token violations — cross-referenced against any spec you paste alongside. Critique is specific: element name, property, observed value, expected token. Not "this looks off" — "nav-item padding is 14px; the Sentinel spec requires 12px (--space-150)."

2

Rendered UI mockups via Artifacts

Claude generates complete, self-contained HTML/CSS that renders live in the Artifacts panel on claude.ai. What you see is actual rendered output — not a description of a design, not a wireframe approximation. Screenshot the Artifact and import it to Figma as a reference frame. This changes the Workflow A first mile from "imagine what Claude described" to "look at what Claude built."

3

Persistent design system context via Projects

Claude Projects let you upload the governance spec, archetype spec, and tokens.css as Project knowledge — once. Every conversation inside that Project inherits the full design system context without repasting. You do not start from zero each session. Design decisions accumulate. The spec becomes ambient, not a chore.

4

Extended Thinking for complex layout problems

Multi-constraint layout problems — responsive across three breakpoints, five nested component types, six token constraints — benefit from Claude reasoning through every constraint before proposing a layout. Extended Thinking surfaces the chain of reasoning so you can validate the logic, not just the output. Engage it deliberately; it costs more time and tokens than standard mode.

5

Multi-image vision for cross-screen consistency

Upload three to five screens in a single message. Claude analyses them simultaneously — comparing token application, spacing rhythm, and component behaviour across screens, not just within one. This is the only way to catch inter-screen drift in a single pass. Per-screen critique, run separately, never catches it.

6

Conversational iteration with design system memory

Within a Project, every iteration inherits the previous round's context. "Increase the row height by one step on the grid" works because the grid definition is in Project knowledge. "Change the accent to the warning colour" fails correctly — the spec's two-system rule forbids it. The Project context does not just remember; it enforces.

Full capability profile

The seam between Claude Design and Figma is a handoff point, not a deficiency. Left: what Claude Design does natively. Right: where the Figma source of truth takes over.

Claude Design does natively

Visual critique — from screenshot against a pasted or Project-resident spec. Per-screen and cross-screen (multi-image).

Rendered mockups — complete HTML/CSS Artifacts from a description + spec. Viewable, screenshottable, importable to Figma.

Layout iteration — fast exploration within a conversation or across sessions (Project). Changes are conversational.

Accessibility audit — visual layer: contrast, touch target size, alt text presence, visible focus indicators.

First-screen velocity — from zero to a rendered, spec-compliant layout faster than opening Figma.

Where Figma source of truth takes over

Token propagation at scale — a token change in Figma updates every component that references it. Claude Design cannot propagate across a product; it generates to the spec as given.

Component-to-token application — a Figma file encodes which element gets which token, inside which variant, in which state. Claude infers application from the spec; Figma makes it canonical.

ARIA and keyboard behaviour — visual critique cannot see focus order, role attributes, or keyboard handler logic. Code review via Ch18's MCP workflow is required.

Version history and rollback — a Figma library is versionable. A conversation is not a source of truth engineers can inspect or roll back to.

Projects — persistent design system context

The most consequential 2026 capability for design system work. A Claude Project stores files as persistent knowledge — the spec, the tokens, the governance rules — and every conversation inside the Project inherits them without repasting. For recurring design system work, this eliminates the single biggest friction in the workflow.

1

Create the Project and load knowledge

On claude.ai, create a new Project named after your product or design system. In Project Settings → Knowledge, upload: governance.spec.md, [archetype].spec.md, and tokens.css (once generated via Prompt 01). Optionally add a Project system prompt establishing the archetype, product name, and any product-specific constraints. This is a one-time setup.

2

Run design work without repasting

Every conversation inside the Project opens with full spec context. Ask Claude to generate a component — it already knows the archetype's token system, density rules, and banned patterns. Ask it to critique a screenshot — it already knows what "correct" looks like for your specific archetype. The spec is ambient. You work; you do not prepare to work.

3

Update the Project when the spec evolves

When tokens.css is updated (new component added, token renamed), replace the file in Project knowledge. All subsequent conversations inherit the update. Previous conversations in the Project are unaffected — they ran against the snapshot at the time. This is the correct behaviour: historical critique should be reproducible against the spec that was current then.

Project setup — system prompt for design system work

PROJECT SYSTEM PROMPT — design system context

Product: [product name]
Archetype: [Harbor | Sentinel | Current | Lattice | Meridian]
Design system version: [e.g. v1.2]

Context loaded in Project knowledge:
  - governance.spec.md (universal quality gates)
  - [archetype].spec.md (archetype-specific rules)
  - tokens.css (generated token output)

Your role in this Project:
You are a design system specialist working within the
[archetype] archetype. All generated components, critiques,
and recommendations must reference tokens from the loaded
tokens.css. All governance rules from governance.spec.md
apply unconditionally. Archetype-specific rules from
[archetype].spec.md override general defaults where
they conflict.

When generating HTML/CSS:
  - Use CSS custom properties from tokens.css, never raw values
  - Respect the archetype's spacing base, radii, and motion rules
  - Flag any request that would violate governance gates

When critiquing:
  - Reference specific token names, not just values
  - Use the severity system: Critical / Major / Minor
  - State the expected token alongside the observed value

Paste once into: claude.ai → Project Settings → Instructions · any model tier · update when spec changes
Kit file: kit/prompts/20-project-setup.md

Artifacts — rendered output in the browser

Artifacts transforms the Workflow A first mile. Before Artifacts, "Claude Design generates a mockup" meant Claude wrote code you had to copy, run locally, and screenshot. With Artifacts, the rendered output appears immediately in the claude.ai interface — viewable, shareable, screenshottable without leaving the conversation.

Workflow A without Artifacts

Claude outputs an HTML/CSS block → copy to editor → run local server → screenshot → paste into Figma as reference → build component using archetype tokens.

Friction: local environment required. Context switches between chat, editor, browser, Figma. One round-trip per iteration.

Workflow A with Artifacts

Claude outputs a rendered Artifact → view in Artifact panel → iterate in conversation → screenshot final state → paste into Figma as reference → build component using archetype tokens.

Result: no local environment. Iteration stays in the conversation. Multiple design directions in one session, each as a separate Artifact.

Generate a rendered UI mockup via Artifacts

RENDERED MOCKUP — Artifact output

[If not in a Project: paste governance.spec.md and
[archetype].spec.md here before this prompt.]

Component / screen to generate:
  [describe the component or screen]

Requirements:
  - Output as a complete, self-contained HTML file
  - Use only CSS custom properties defined in the loaded
    tokens.css (or pasted :root block)
  - No raw hex, no magic numbers in CSS — all values via var(--token)
  - Dark mode by default (if archetype is dark-first)
  - Include all interactive states visible at once
    (default, hover, focus, disabled) using adjacent variants
    in the layout — not JS toggle

Archetype constraints to enforce:
  - Spacing: [archetype]'s spacing base (2px | 4px | 8px)
  - Radii: [archetype]'s radius set (do not use pill if Lattice)
  - Typography: [archetype]'s type scale (fixed px if Lattice/Sentinel)
  - Motion: [archetype]'s duration tokens (instant if Lattice)

Output the HTML as an Artifact so it renders in the preview panel.

Use in: claude.ai with Artifacts enabled · Sonnet 4.6 recommended for speed · Opus 4.7 for complex multi-section layouts
Kit file: kit/prompts/21-artifact-mockup.md

Extended Thinking — when to engage it

Extended Thinking makes Claude reason step-by-step through a problem before producing output — surfacing the constraint chain so you can audit the logic, not just the result. It costs significantly more time and tokens than standard mode. Use it deliberately.

Task type	Use Extended Thinking?	Why
Multi-constraint layout Responsive across 3 breakpoints, 5+ nested component types, 6+ token constraints simultaneously	Yes	Extended Thinking traces which constraint takes precedence at each decision point. Without it, Claude resolves conflicts silently — you get an output but not a justification.
Token architecture decisions Designing new semantic token layers, naming a new data palette, extending the archetype spec	Yes	Token architecture has downstream consequences across every component. Extended Thinking surfaces the second-order effects (what breaks if this token is named this way) before you commit.
Single-component critique or generation Critiquing one screenshot, generating one component from a clear spec	No	Standard mode handles single-constraint tasks at 5× the speed. Extended Thinking adds latency with no quality gain on well-scoped problems.
Rapid iteration loops "Increase spacing by one step," "Swap to the warning colour," "Make this two-column"	No	Speed is the point of iteration loops. Extended Thinking breaks the conversation rhythm. Use standard Sonnet 4.6 and iterate fast.

Extended Thinking — layout architecture for complex screens

LAYOUT ARCHITECTURE — Extended Thinking

[Paste spec blocks if not in a Project]

Screen: [name and purpose of the screen]

Constraints — resolve all of these simultaneously:
  Responsive: [list breakpoints, e.g. 375px / 768px / 1280px]
  Components: [list nested component types, e.g. data table, filter bar,
               detail drawer, command palette]
  Token rules: [key constraints, e.g. Lattice 2px spacing base,
                flat elevation only, 32px row height default]
  Accessibility: [e.g. keyboard-navigable, WCAG AA contrast,
                  skip-link required]
  Layout archetype: [e.g. W1 Multi-Pane, W2 Full-Table — from spec]

Think through each constraint before proposing the layout:
  1. Which layout workspace template applies? Why?
  2. Which constraints are in tension? How do you resolve each conflict?
  3. What component hierarchy satisfies all constraints simultaneously?
  4. What would break at the narrowest breakpoint — and how do you
     prevent it within the archetype's rules?

Then produce the layout as an Artifact (rendered HTML/CSS).

Use with: Extended Thinking enabled · Opus 4.7 · claude.ai or Claude Desktop · multi-constraint screens only
Kit file: kit/prompts/22-extended-thinking-layout.md

Model selection — matching the task to the tier

Claude's three tiers (Opus 4.7 / Sonnet 4.6 / Haiku 4.5) differ in reasoning depth, speed, and cost. Opus is not always better — for critique and iteration loops, Sonnet 4.6 produces equivalent results at a fraction of the latency. Default to Sonnet 4.6; reach for Opus 4.7 when you need architectural depth.

Task	Model	Why
Archetype selection + initial token architecture Prompt 00 + Prompt 01, first pass	Opus 4.7	Architectural decisions. Errors here compound across every subsequent step. Invest in the depth once.
Full mockup generation from spec Artifact output, single complex screen	Sonnet 4.6	Well-scoped generation with spec in context. Sonnet 4.6 handles it reliably at significantly faster output speed.
Design critique (single or multi-image) Workflow C, quality gate before code	Sonnet 4.6	Critique is pattern-matching against a known spec. Strong performance at Sonnet tier. Reserve Opus 4.7 for edge cases with genuine ambiguity.
Rapid iteration loops "One step up on the spacing scale," "Add a loading state," "Make this RTL"	Sonnet 4.6	Speed is what matters in iteration. Sonnet 4.6 is fast enough to keep conversation rhythm without breaking flow.
Extended Thinking layout architecture Multi-constraint screens, token system design	Opus 4.7 + Extended Thinking	Complex constraint resolution benefits from Opus 4.7's full reasoning budget. Sonnet 4.6 with Extended Thinking is a valid fallback for budget-constrained workflows.

Multi-image vision — cross-screen consistency

Per-screen critique never catches inter-screen drift. If screen 3 uses --space-200 for card padding and screen 7 uses --space-300, a single-screenshot critique of each passes them both individually. Only a simultaneous multi-image critique catches the inconsistency.

1

Export 3–5 screens from Figma

Export screens that share a component type — all screens with a data table, all screens with a navigation header, all screens with a modal. The audit is most useful when the screens are comparable: same component, different contexts. Maximum 5 images per message for reliable analysis.

2

Upload all images in a single message

Attach all images to the same message — not one image per message. Claude's multi-image analysis compares them in the same context window. Sequential single-image critique cannot see all images simultaneously and cannot reliably reconstruct comparisons from memory.

3

Run the cross-screen critique prompt

The prompt below requests both per-screen issue lists and a consistency matrix — which properties are consistently applied across screens and which vary. The consistency matrix is the output that single-screenshot critique cannot produce.

4

Prioritise cross-screen inconsistencies

Any property flagged as inconsistent across screens is a Major or Critical issue — it will produce visible drift in the shipped product. Per-screen issues that are consistent across screens (e.g., all screens use the same wrong spacing value) may be a spec interpretation issue, not a designer error — investigate before flagging.

Three workflows — updated for 2026

Workflow	When	Sequence
A · Claude-first	Greenfield. No existing Figma component. MVP exploration. A new pattern before it is committed to the library.	Create Project with spec + tokens as knowledge → generate rendered Artifact from description → iterate in conversation → screenshot Artifact → import to Figma as locked reference frame → build Figma component using archetype tokens → MCP workflow takes over from Figma source
B · Figma-first (default)	Established Figma source exists. Need consistent, token-referenced production code. This is Ch15–Ch16's workflow.	Load Figma MCP in Claude Desktop (Project or session) → Phase 1 STUDY → Phase 2 GENERATE → Phase 3 REFINE → production code. Project context eliminates re-loading the MCP connection across sessions on the same codebase.
C · Hybrid critique loop	Designer finishes screens in Figma; needs a quality gate before engineering generates code.	Export 3–5 screens as PNGs → upload simultaneously → run cross-screen critique prompt (below) → designer fixes Critical/Major issues → re-export fixed screens → re-run critique to verify → MCP generates code from audited source

Ad-hoc (no Project)

Each session: paste governance.spec.md + archetype spec + tokens.css before running any prompt. Risk: paste an older version, forget the governance block, or skip it when in a hurry. Context resets completely on session end.

Use when: one-off critique, exploring a new archetype, or when the spec is still changing rapidly and a stable Project is premature.

Project-configured

Spec loaded once in Project knowledge. Every session opens with full context. No paste required. Updates to the spec propagate by replacing the file in Project settings. Context survives session ends, browser restarts, and team member handoffs (shared Projects).

Use when: the spec is stable and design work is recurring. The setup cost (10 minutes) is amortised across every subsequent session.

Prompt reference — all Claude Design workflows

Five prompt cards for the full workflow set. The Project setup prompt runs once. The rest run per-task.

Prompt card 1 (Project setup) is in the Projects section above. Cards 2–5 follow.

Cross-screen consistency audit — Workflow C (multi-image)

CROSS-SCREEN CONSISTENCY AUDIT

Archetype: [Harbor | Sentinel | Current | Lattice | Meridian]
Screens attached: [list screen names, e.g. Dashboard, Detail, Settings]
Token system:
  [paste :root block from tokens.css — or load from Project]

PART 1 — Per-screen issues
For each screen, check every visible element for:
1. COLOR — raw hex not in token system
2. SPACING — gaps or padding not on the archetype grid
3. TYPOGRAPHY — font size, weight, or family outside the scale
4. BORDER RADIUS — value not in the archetype's radius set
5. ANTI-PATTERNS — any pattern from the archetype's banned list

Output: ranked issue list per screen
  Critical / Major / Minor
  Format: [screen] · [element] · [property] · [observed] → [expected token]

PART 2 — Consistency matrix
Compare all screens against each other for:
- Shared component types (e.g., all tables, all nav headers, all modals)
- Token application consistency (does the same element use the same
  token across screens, or does it vary?)
- Pattern consistency (same interaction pattern coded the same way?)

Output: a table with rows = component type, columns = screen name,
cells = CONSISTENT / INCONSISTENT + note on the inconsistency.

VERDICT (one per screen + one overall):
  PASS / FIX BEFORE CODE / REWORK
Overall: CONSISTENT / DRIFT DETECTED (list cross-screen issues)

Use with: 3–5 screenshots attached simultaneously · tokens.css · Sonnet 4.6 · claude.ai or Claude Desktop
Kit file: kit/prompts/23-cross-screen-audit.md

The durable integration pattern

Every improvement to Claude Design strengthens the workflow on both sides of the seam. Better generation makes Workflow A's first mile faster and closer to spec. Better critique makes Workflow C's quality gate more precise and catches more drift before code ships. The system's value comes from the composition — a spec-loaded Project, rendered Artifacts, and a multi-image critique loop is significantly more capable than any one of those parts alone.

The next chapter (Ch18 — Figma → Code) picks up where Workflow A's Artifact handoff ends: the Figma component is built, the archetype tokens are applied, and the MCP workflow takes over to generate production code from a canonical source of truth.

Act V — AI Workflows · Chapter 18

Figma → Code

Five specialized prompt templates for turning Figma designs into production code. Each handles a different starting condition: pixel-perfect clone, framework migration, token extraction, or surgical refinement.

Scenario matching — which prompt when

Scenario	What you have	What you want	Prompt to use
A · Pixel-perfect	Figma frame with exact spec	HTML/CSS/React that matches exactly — no reinterpretation	Prompt A below (spec-first, then generate)
B · Existing codebase	Figma frame + existing component library	New component using existing primitives, new tokens for what's missing	Canonical template below (context-heavy)
C · Token extraction	Figma file with Variables	tokens.css + tailwind.config.js extracted from Variables	Token extraction prompt below
D · Refinement	Generated code + comparison to source	Surgical corrections — specific properties only	Phase 3 REFINE (Ch16) or Prompt E below

Canonical Figma → React template

The base template for Path B. Adapts to any framework — swap the framework/styling lines. The six elements (source pointer, context block, numbered task list, output format, critical rules, content slot) are the architecture every design-to-code prompt should inherit from.

Canonical Figma frame → React component

Read this Figma file: [FIGMA_URL]

Context:
  Framework: React with TypeScript
  Styling: CSS custom properties (tokens.css at src/tokens.css)
  Existing components at: src/components/ui/
  Design tokens at: src/tokens.css

Please:
1. Convert the "[FRAME_NAME]" frame to a React component
2. Use existing components where they match (check src/components/ui/)
3. Preserve all spacing using the token scale in tokens.css
4. Extract any new colors as CSS custom properties if not in tokens.css
5. Make it responsive (mobile-first, use archetype breakpoints)
6. Add meaningful TypeScript interface with prop names

Output format:
  - Single .tsx file, TypeScript interface at top
  - Named export for the component
  - New CSS custom properties appended to tokens.css (if any)
  - Short JSDoc comment explaining the component's role

Rules:
  - Use var(--token-name) for all values. No raw hex or magic numbers.
  - Semantic HTML: button for buttons, nav for nav, etc.
  - Every interactive element has hover, focus-visible, disabled states.
  - aria-label on any icon-only element.

MCP required: Figma Dev Mode MCP · Claude Sonnet · Claude Code

Token extraction prompt

Run this before component work on any Figma file that has Variable collections. Produces tokens.css + (optionally) tailwind.config.js in one pass. This is the Path B equivalent of Prompt 01 in Path A — but it reads from the live Figma artifact rather than generating from the archetype spec.

Token extraction from Figma Variables

Read the Figma file and extract every design token.

Source: [FIGMA_URL]

Output two files:

1. src/tokens.css
:root {
  /* Colors — grouped by role */
  --color-primary-50: #...;
  /* ... */

  /* Typography */
  --text-xs: clamp(...); /* or fixed px per archetype */
  /* ... */

  /* Spacing, radius, shadow, motion */
}

[data-theme="dark"] {
  /* Primitive overrides only — semantic aliases stay */
}

2. (Optional) tailwind.config.js — only if project uses Tailwind
   Extend theme so utilities map to tokens:
   colors: { primary: { 50: 'var(--color-primary-50)' } }
   spacing: { 1: 'var(--space-1)' }

Rules:
- Extract EXACT values. No rounding.
- Group semantically per the archetype TOKEN NAMING section
- Include a brief comment for non-obvious tokens
- Do NOT generate components. Token extraction only.
- Flag any WCAG contrast failures in a comment (WCAG AA = 4.5:1 body text)

MCP required: Figma Dev Mode MCP · Sonnet 4.6 · Claude Code

Pixel-perfect clone with token enforcement

Scenario A from the routing table. Use when the Figma frame is the contract — a designer-approved hi-fi mock that must ship at byte-level fidelity. The trick is enforcing tokens while preserving exact visual output. Without enforcement, the AI freelances; with it, the AI maps every value to a token or flags it as a deliberate one-off.

Pixel-perfect clone — token-enforced

Read this Figma frame and produce pixel-perfect HTML/CSS.
The frame is canonical — match it exactly, do not reinterpret.

Source frame: [FIGMA_URL]
Token system: src/tokens.css (paste full :root block below)
Archetype: [Harbor | Sentinel | Current | Lattice | Meridian]

Token enforcement rules — ZERO exceptions:
1. Every color value → var(--color-token-name) from tokens.css.
   If a color in Figma does not exist in tokens.css, DO NOT use it.
   Flag it: "[element]: Figma value #ABCDEF has no matching token.
   Closest match: var(--color-...). Use match? OR add new token?"

2. Every spacing/radius/shadow → var(--space-N) / var(--radius-N) /
   var(--shadow-N). Off-grid values flagged the same way.

3. Every typography value → match exactly to the type scale tokens.

4. Every transition → use --duration-* + --easing-* tokens.

Output format:
- HTML structure matching Figma layer hierarchy
- CSS using tokens only — no raw values anywhere
- A comment block at the top listing every flagged divergence
  ("Differs from spec at: [element], [property]")

Critical rule: do NOT silently fix divergences. List them. The
designer decides whether each is a one-off or a token addition.

MCP required: Figma Dev Mode MCP · Sonnet 4.6 · Claude Code · paste tokens.css inline
Kit file: kit/prompts/24-pixel-perfect-clone.md

Framework migration — Tailwind to vanilla CSS via Figma source

Common scenario: a codebase shipped in Tailwind, the design system is moving to vanilla CSS with custom properties (or the reverse). Use the Figma file as the canonical reference and migrate component-by-component. The key is to NOT translate Tailwind classes literally — use Figma as the source of truth for what each component should look like, then write fresh CSS against tokens.

Framework migration via Figma source

Migrate this component from [SOURCE_FRAMEWORK] to [TARGET_FRAMEWORK].
Use the Figma file as the canonical source — not the existing code.

Existing component: [paste source code]
Source framework: [Tailwind | styled-components | CSS modules | vanilla]
Target framework: [Tailwind | styled-components | CSS modules | vanilla]
Figma source: [FIGMA_URL] · frame: [FRAME_NAME]
Token system: src/tokens.css (paste below)

Steps:
1. READ the Figma frame and extract its design intent
   (spacing, typography, color usage by role).
2. IGNORE the existing component's literal class names.
   They may carry baggage from old conventions.
3. WRITE fresh code in the target framework using:
   - Token references from tokens.css (not raw values)
   - Semantic HTML matching the Figma layer hierarchy
   - Component states (hover, focus-visible, disabled)
     even if not visible in Figma's default state
4. PRESERVE the existing component's prop API surface
   (props in → props out the same), so consumers don't change

Output:
- New component file in target framework
- Migration notes: what changed, what stayed
- List of any tokens that need to be added to tokens.css
- Test surface: what props/states should be exercised post-migration

Do NOT translate class-by-class. Re-derive from Figma + tokens.

MCP required: Figma Dev Mode MCP · Sonnet 4.6 · Claude Code · component-by-component migration
Kit file: kit/prompts/25-framework-migration.md

Refactor a legacy component to match the archetype spec

The component shipped before the spec existed. It works, but it doesn't follow the archetype's banned patterns, governance rules, or token naming. Use this prompt to refactor it into compliance without changing its behavior. The output is a drop-in replacement that passes Chain 4 (drift audit) and Chain 5 (a11y audit).

Legacy component refactor — to archetype spec

Refactor this legacy component to comply with the archetype spec.
The behavior must stay identical — only the implementation changes.

Legacy component: [paste source code]
Archetype: [Harbor | Sentinel | Current | Lattice | Meridian]
Specs (paste both):
  governance.spec.md: [paste]
  [archetype].spec.md: [paste]
Token system: src/tokens.css (paste)

Refactor rules:
1. Replace every raw value with the matching token from tokens.css.
   If no matching token exists, propose a new one and add it inline.
2. Apply the archetype's banned patterns list — anything banned that
   the legacy code uses must be removed and replaced.
3. Apply governance state coverage: every interactive element must
   have default / hover / focus-visible / active / disabled.
4. Apply governance copy hygiene: remove "Please", marketing fluff,
   "Submit"/"OK" buttons, etc.
5. Apply ARIA correctness — aria-label on icon buttons, aria-expanded
   on disclosure widgets, aria-live for async messages.
6. Keep the prop API EXACTLY as it was — consumers must not break.

Output:
- The refactored component (drop-in replacement)
- A diff summary: what was banned, what was added, what was renamed
- A short test plan: which props/states to verify post-refactor
- A note on whether prop renames are needed (if so, provide a
  deprecation shim so the old API works for one MINOR version)

Run Chain 4 (drift audit) and Chain 5 (a11y audit) after refactor.
Both must pass with zero Critical findings before merging.

MCP optional: Connect Figma if reference exists · Sonnet 4.6 · Claude Code · per-component
Kit file: kit/prompts/26-legacy-refactor.md

Act V — AI Workflows · Chapter 19

Code → Figma

The less-common direction, but powerful: push an existing codebase back into Figma as a living component library. Three use cases, a six-phase pipeline, and one essential artifact — the component mapping report.

When to reverse the pipeline

Situation	What it means	What you get
Code exists, Figma doesn't	A scrappy product shipped before design caught up. You want a Figma library that matches what's live.	Figma component library extracted from code — accurate, ready for designer iteration
Figma drifted from code	Figma was the source of truth; reality diverged. Bring Figma back to match what actually ships.	Figma library re-synced to production state — restores Figma as a reliable reference
Tokens in code, not in Figma	Design system lives in tokens.css but Figma Variables haven't been set up. Designers can't use tokens.	Figma Variables populated from code tokens — designers and engineers finally share the same values

The six-phase reverse pipeline

1

Feed existing code

Provide components (source files), tokens.css, and screenshots of the live UI. The more context, the more accurate the mapping.

2

AI reads Figma library

With MCP connected, Claude reads the target Figma file to discover what already exists — existing components, Variable collections, Text Styles. This prevents duplication.

3

Component mapping report

Claude produces three sections: Direct Matches (code → Figma path), Partial Matches (exists but missing variants), Figma Assembly Spec (must be built, with auto-layout specs and child hierarchy). This is the most valuable artifact in the workflow — review it before any writes.

4

Build in Figma via MCP write

With write-capable MCP, Claude pushes the Figma Assembly Spec items into Figma: creates frames, applies auto-layout, sets Variables. Direct and Partial Matches are updated or extended as specified.

5

Designer polishes

Human designer adjusts auto-layout, adds documentation, refines variants, ensures Components match brand standards. The AI produces the structure; the designer finalises the presentation.

6

Figma becomes authoritative

Once polished, Figma is the canonical source. Switch to the normal Path B MCP workflow — future code generated from the now-accurate Figma source.

Component mapping report prompt

Component mapping report (Code → Figma)

Analyze my existing codebase and the target Figma file.
Produce a component mapping report.

Codebase: [paste components or file path]
Target Figma library: [FIGMA_URL]

Output three sections:

1. DIRECT MATCHES
   Code components that exist identically in the Figma library.
   For each: code path → Figma library path

2. PARTIAL MATCHES
   Code components that exist in Figma but are missing variants
   or states. For each:
   - What exists in Figma
   - What is missing
   - Recommended action

3. FIGMA ASSEMBLY SPEC
   Components in code that do not exist in Figma at all.
   For each, specify:
   - Name and target location in Figma library
   - Auto-layout direction (horizontal | vertical | grid)
   - Padding, gap, sizing constraints (exact px)
   - Child component hierarchy as ASCII tree
   - Variants to create (if any)

Rules:
  - Preserve exact naming conventions from the codebase
  - Use Figma auto-layout terminology (not CSS flexbox)
  - All measurements in px (Figma native)
  - Do NOT modify Figma yet. This is planning only.

Review the report with the designer before any writes.

MCP required: Figma Dev Mode MCP (read) · Sonnet 4.6 · Claude Desktop
Kit file: kit/prompts/27-mapping-report.md · feeds into Chain 8

Operational write-back — Chain 8 + Prompt 15

The mapping report above is the planning artifact. The execution is Chain 8 — Figma write-back, powered by Prompt 15. Where the mapping report tells you what needs to be added or changed, Chain 8 actually performs the writes — variable by variable, with a reconciliation table you confirm before any change reaches Figma.

Stage	What you do	Output
1. Plan	Run the mapping report prompt above. Review with the designer. Decide which Direct Matches, Partial Matches, and Assembly Spec items to action.	Reviewed mapping report — the design decision document
2. Reconcile	Run Prompt 15 (Chain 8). Choose a scope: A (values only), B (names + values), C (add missing only), D (full sync). The prompt produces a reconciliation table — every variable, MATCH / DIFF / MISSING / ORPHAN status — and pauses for your confirmation.	Reconciliation table with proposed writes
3. Write	Confirm. Prompt 15 issues MCP write calls one variable at a time, reporting each before execution.	Figma variable library updated to match `tokens.css`
4. Verify	Open Figma → Local variables. Spot-check 10 values against `tokens.css`. Check Components page for any "missing variable" warnings.	Confirmed Figma sync (or list of broken connections to repair)

When to use which: the mapping report (above) is for the first round when Figma has gaps and you need a designer review before any writes. Chain 8 is for ongoing maintenance — quarterly token sync, post-release reconciliation, BYOC palette updates that need to flow back to Figma. Together they cover both the discovery and the operational sides of the Code → Figma direction.

Act VI — Pipelines & Production · Chapter 20

Path A — the code-first pipeline

No Figma file. Three brand adjectives. Fourteen prompts chained in sequence. Final output: a shippable design system with tokens, components, patterns, chrome, and a live spec page. Estimated runtime: 45–90 minutes on first pass.

Path A is Chain 9 in the kit. The fastest way to run it is to paste kit/chain.md into Claude Code and answer the six upfront questions. If you prefer to understand each step before running it — or to run individual prompts manually — this chapter walks the pipeline in sequence.

Prerequisites and timing

Prerequisite	What to have ready	Where to get it
Archetype confirmed	One of: Harbor / Sentinel / Current / Lattice / Meridian	Run Prompt 00 (kit) or use Ch02 decision tree
Two specs ready	`governance.spec.md` + `<archetype>.spec.md`	`kit/specs/` or the Ch03 dropdown
Brand colors (optional)	Primary hex + optional accent hex (BYOC path)	Your brand guidelines
Brand fonts (optional)	Display + body font names + CDN URLs (if web fonts)	Your brand guidelines
Claude Code or Claude.ai	Opus for Prompts 00–02; Sonnet for 03–08+	claude.ai or CLI

The fourteen-step pipeline

Step	Kit prompt	What it produces	Output file	Checkpoint
0	Prompt 00	Archetype recommendation + per-dimension reasoning	(decision only)	Confirm archetype before continuing
1	Prompt 01	Color, spacing, radii, motion tokens. BYOC if applicable.	output/tokens.css	Review palette + WCAG table
2	Prompt 02	Font families + full type scale. Brand fonts applied.	output/tokens.css (updated)	Review type scale in browser
3	Prompt 03	Button, Input, Badge, Card, Alert, Dialog, Table, Form, Empty state	output/foundation.css	Visual + a11y review
4	Prompt 04	Modal, Drawer, Tabs, Combobox, Accordion	output/composite.css	Composition review
5	Prompt 05	Archetype-aware page patterns	output/patterns.css	Pattern review
6	Prompt 06	Nav, Header, Footer, Sidebar, Modal Sheet, Drawer	output/chrome.css	Chrome review
7	Prompt 07	Product-specific components from 2-sentence descriptions	output/domain.css	Domain review
8	Prompt 08	Self-contained design-system-spec.html	output/design-system-spec.html	Full visual review
9	Prompt 09	Archetype section template library	output/section-templates.css	Template review
10	Framework wrappers (Prompt 13)	CSS → React / Vue / Svelte component files	output/.tsx / .vue	Framework review
11	Commit + lock	Save all output files as canonical. Never regenerate from scratch.	(git commit)	Ship
12+	Prompt 11 (ongoing)	Drift report against canonical tokens.css	(report)	Run on every PR
Post-stabilise	Prompt 10	Component-level tokens from repeated values	output/tokens-layer-3.css	After components stabilise

Running in Chain Mode vs ad-hoc

Chain Mode (recommended)

Paste kit/chain.md into Claude Code. Answer 6 upfront questions. The orchestrator runs steps 0–11 sequentially, saving each output, pausing at each checkpoint. Resumes from output/chain-state.md if you stop.

Open the kit →

Ad-hoc (one prompt at a time)

Paste each prompt from kit/prompts/ in sequence. Each prompt's meta-header checks prerequisites and tells you what to save. Use when you want to inspect each output before continuing, or when running a partial pipeline.

Time estimates and iteration notes

Scenario	First-run time	Iteration loops
Tokens only (Prompts 00–02)	10–20 min	Tweak adjectives or BYOC seeds; re-run Prompt 01. Components inherit automatically.
Foundation + composite (Prompts 00–04)	25–40 min	Adjust token output before running 03–04. Changing tokens after components ship means re-running all component prompts.
Full pipeline (Prompts 00–09)	45–90 min	Best iterative strategy: lock tokens first, then iterate on components. Never re-run token prompts expecting prior output — use Prompt 11 (drift audit) for verification.

Act VI — Pipelines & Production · Chapter 21

Atomic prompts

Individual prompts are the atoms of the system. This chapter documents the full catalog — what each prompt does, what inputs it requires, what it produces, and which chains it belongs to. Use this as a reference when running ad-hoc or building custom chains.

All 28 prompts (00–27, plus 01b as merged reference) live in kit/prompts/. The kit's README.html, Prompt Flow Map, and Full Prompt Catalog show them in context. This chapter explains why the boundaries between prompts are where they are — and the constraints that make each one a complete, independent unit.

The atomic prompt contract

An atomic prompt is complete when it can be pasted into any Claude session, given its documented inputs, and produce its documented output without requiring context from a prior conversation. This is not about size — some atomic prompts are 70 lines; some are 200. It is about self-containment.

Element	What it means for an atomic prompt
Meta-header	Detects environment (Claude Code vs Claude.ai). Checks for prerequisite output files. Asks branching questions (BYOC, fonts). Reads chain-state.md for carryover.
Main body	The generation instructions. Explicit constraints (ZERO font tokens in Prompt 01; ZERO component code in token prompts). Clear output format.
Meta-footer	Save instruction (file name and path). HTML preview offer where applicable. Chain-state.md append. Handoff announcement (next prompt, what to paste).

Prompt catalog

The catalog groups 28 prompts into six functional clusters. Generation prompts (00–13) compose into Chain 9 (Path A end-to-end). Validation & integration (14, 15) are post-generation. Path B workflow (16–18) is the three-phase Figma → code chain. Claude Design (20–23) covers Workflow A/C in claude.ai. Scenarios (24–26) are alternative production paths. Code → Figma planning (19, 27) prepares the chain that closes the round-trip.

Prompt	Role	Key inputs	Output	Chain(s)	Model
Generation (Path A core)
00	Archetype selection	3 brand adjectives	Archetype recommendation + reasoning	9	Opus 4.7
01	Token generation	governance.spec + archetype.spec (+ optional BYOC hex seeds)	output/tokens.css (color, spacing, radii, motion — NO typography)	1, 2, 9	Opus 4.7
01b	BYOC reference	Merged into Prompt 01 — not a separate paste	Reference only	—	—
02	Typography scale	output/tokens.css + archetype TYPOGRAPHY section (+ optional brand fonts)	output/tokens.css updated with type scale	1, 9	Sonnet 4.6
03	Foundation CSS	output/tokens.css + governance.spec + archetype.spec	output/foundation.css	3, 9	Sonnet 4.6
04	Composite CSS	output/foundation.css + specs	output/composite.css	3, 9	Sonnet 4.6
05	Pattern CSS	output/composite.css + specs	output/patterns.css	3, 9	Sonnet 4.6
06	Chrome CSS	output/tokens.css + composite + specs	output/chrome.css	3, 9	Sonnet 4.6
07	Domain CSS	output/tokens.css + components + 2-sentence descriptions	output/domain.css	3, 9	Sonnet 4.6
08	Spec page	All CSS files + governance.spec + archetype.spec	output/design-system-spec.html	7, 9	Sonnet 4.6
09	Section templates	output/tokens.css + component CSS + specs	output/section-templates.css	9	Sonnet 4.6
10	Layer 3 tokens	Locked tokens.css + all component CSS	output/tokens-layer-3.css	1, 9	Sonnet 4.6
11	Drift audit	Canonical tokens.css + target codebase (or Figma via MCP)	Drift report (severity-ranked)	4	Sonnet 4.6
12	A11y audit	Rendered HTML or component code + tokens.css	A11y report (WCAG 2.1 AA)	5	Sonnet 4.6
13	Framework wrappers	output/foundation.css + output/composite.css + framework target	.tsx / .vue / *.svelte	3, 6	Sonnet 4.6
Validation & integration
14	Storybook stories	output/framework/*.tsx + tokens.css + specs	output/stories/*.stories.tsx (one per component, all variants + states)	6	Sonnet 4.6
15	Figma write-back	output/tokens.css + Figma file via MCP (write access)	Figma variable library updated to match code	8	Sonnet 4.6
Path B workflow (Three-Phase chain)
16	Phase 1 — STUDY	Figma file URL via MCP	output/phase-1-spec.md (color, type, spacing, components, effects, layout)	10	Opus 4.7
17	Phase 2 — GENERATE	output/phase-1-spec.md + tech stack target	Component code with tokens at top, components below	10	Sonnet 4.6
18	Phase 3 — REFINE	Phase 2 code + drift list (surgical format)	Corrected files — only listed changes applied	10	Sonnet 4.6
19	MCP diagnostic	(none — runs against connected MCP)	Tool list + connection state report	2 (optional Step 0)	Sonnet 4.6
Claude Design (Workflows A & C)
20	Project setup	Product name + archetype + spec files (loaded into Project knowledge)	Configured Claude Project — persistent context across sessions	Ad-hoc	any
21	Artifact mockup	Component description + spec + tokens	Self-contained HTML rendered as Artifact in claude.ai	Ad-hoc	Sonnet 4.6
22	Extended Thinking layout	Multi-constraint layout problem + breakpoints + token rules	Reasoned architecture + rendered Artifact	Ad-hoc	Opus 4.7 + ET
23	Cross-screen audit	3–5 screenshots + tokens.css + archetype	Per-screen issues + cross-screen consistency matrix	Ad-hoc (refs 5)	Sonnet 4.6
Scenarios (Figma → Code variants)
24	Pixel-perfect clone	Figma frame URL + tokens.css :root + archetype	HTML/CSS matching Figma exactly with divergence list	Ad-hoc	Sonnet 4.6
25	Framework migration	Existing component + source/target framework + Figma reference	New component file + migration notes + token gaps	Ad-hoc	Sonnet 4.6
26	Legacy refactor	Legacy component + specs + tokens.css	Refactored drop-in replacement + diff summary + test plan	Ad-hoc	Sonnet 4.6
Code → Figma planning
27	Mapping report	Codebase components + target Figma library URL	Direct Matches / Partial Matches / Figma Assembly Spec	8 (optional Step 0)	Sonnet 4.6

Why the token / typography split (Prompts 01 and 02)

Prompt 01 generates color, spacing, radii, and motion. It generates zero typography tokens. Prompt 02 generates all typography. The split exists so you can iterate on the color palette without regenerating the type scale — and iterate on type without re-running the full token pass. The two files compose in output/tokens.css (Prompt 02 updates the same file in place). Changing either one after components ship requires re-running all downstream component prompts.

Act VI — Pipelines & Production · Chapter 22

Prompt chains

A chain is a sequence of prompts where each output is an input to the next. Ten chains cover the full design system lifecycle — from initial token generation to Path B production code to Figma round-trip. This chapter documents each chain's structure, entry point, and intended outcome.

The ten chains

Chain	Name	Path	Prompts	Output
1	Token cascade	A	01 → 02 → (10 post-stabilise)	output/tokens.css (complete)
2	Figma MCP extract	B	MCP read → 01 → 02	output/tokens.css from live Figma
3	Component generation	A/B/C	03 → 04 → 05 → 06 → 07 → 13	Full component library CSS + framework wrappers
4	Drift audit	C	11	Drift report (run on every PR)
5	A11y audit	A/B/C	12	WCAG 2.1 AA findings
6	Storybook generation	A/B/C	Framework wrappers → Storybook stories	*.stories.tsx per component
7	Spec page publish	A/B/C	08 + post-render credit row	design-system-spec.html
8	Figma write-back	B	tokens.css → Figma Variables via MCP write	Figma library updated to match code
9	Greenfield end-to-end	A	00 → 01 → 02 → 03 → 04 → 05 → 06 → 07 → 09 → 08 → 10	Complete design system (tokens + components + spec page)
10	Three-phase Path B workflow	B	16 (STUDY) → 17 (GENERATE) → 18 (REFINE)	Production code matching Figma source byte-for-byte

Chain 9 — the canonical Path A run

Chain 9 is what kit/chain.md orchestrates. The sequence is the standard Path A run — it produces a complete design system from adjectives. The human's decision points: adjectives, archetype confirmation, BYOC colors, brand fonts, domain component descriptions, and go/no-go at each checkpoint.

Key sequencing rules:

Tokens must be locked before components run (Steps 1–2 before Steps 3–7)
Foundation before composite (Step 3 before Step 4)
Spec page last (Step 8, after all component CSS is final)
Layer 3 extraction after components stabilise (not on first run)
Never re-run token prompts after components ship — use Prompt 11 (drift audit) to detect divergence

Composing custom chains

Chains 1–8 are building blocks. If you need a subset — tokens only, or components without chrome — combine the relevant prompts. The rule for any custom chain: each prompt must have all its prerequisite output files before it runs. The dependency graph is simple: token prompts have no kit-level prerequisites; component prompts need tokens; spec page needs components; audit prompts need the final locked files.

Act VI — Pipelines & Production · Chapter 23

Quality gates

A quality gate is a check that runs automatically and can block a merge. This chapter covers the four gates that belong in every design system CI pipeline — WCAG contrast, token drift, a11y, and visual regression — and the prompts that power them.

Four gates, four cadences

Gate	What it checks	When to run	Kit prompt	Blocking?
WCAG contrast	Every text-on-surface pair in tokens.css against the archetype's WCAG floor (AA minimum, AAA target on primary body)	On every tokens.css change	Prompt 01 Step 4 output (embedded in token generation)	Yes — any FAIL blocks PR
Token drift	Hardcoded hex/px values in component CSS that bypass tokens; archetype banned patterns	Pre-merge on every PR touching component files	Prompt 11	Yes — Critical findings block; Major items advisory
A11y audit	WCAG 2.1 AA: focus indicators, ARIA labels, color encoding, keyboard reachability, semantic HTML	Pre-merge on component changes; quarterly full audit	Prompt 12	Yes — Critical and Major findings block; Minor advisory
Visual regression	Screenshot diff of every component against baseline — catches unintended visual changes	Pre-merge on every PR	Storybook + Chromatic / Percy / custom (not a kit prompt — external CI tool)	Advisory — diffs require human review

The governance spec IS the quality gate

The governance spec's QUALITY-GATE CHECKLIST section encodes these gates as explicit checkboxes — 13 items from contrast verification to Storybook stories. When Prompt 08 generates the spec page, it produces an accessibility verification table computed from the actual token values (not estimated). That table is the WCAG gate's evidence.

The banned patterns in each archetype spec (Harbor's "no gradient circles", Sentinel's "no floating labels", etc.) are also quality gates — they run as part of Prompt 01's governance override step (STEP 5). Any component that generates a banned pattern has a governance failure, not a stylistic preference.

Drift audit in CI — the minimal setup

1

Lock canonical tokens.css in the repo

Commit output/tokens.css after Prompts 01 + 02 are complete and approved. This is the baseline the drift audit compares against.

2

Run Prompt 11 on every PR that touches component files

Paste kit/prompts/11-drift-audit.md + the changed component CSS into Claude. If Critical or Major findings appear, the PR blocks.

3

Review findings, fix or document exceptions

Not every finding is a bug — some are deliberate overrides. Document exceptions with a comment explaining why the hardcoded value is intentional. The next drift audit will see the comment and downgrade the finding.

Act VI — Pipelines & Production · Chapter 24

Failure modes

The twelve patterns that make AI-generated design systems fail in production. Each has a root cause, a detection signal, and a fix. Most are preventable by following the archetype spec. All are recoverable.

Pipeline failures

Failure	Signal	Root cause	Fix
Drift on re-run	tokens.css looks different after running Prompt 01 again with same inputs	Prompt 01 is interpretive, not deterministic. The same adjectives produce similar but not identical palettes across sessions.	Lock canonical tokens.css after first approved run. Never re-run from scratch — use Prompt 11 (drift audit) to detect and fix individual divergences.
Components don't inherit theme	Dark mode toggle doesn't affect components; some elements stay light	Component CSS uses hardcoded hex instead of semantic tokens. Violates the two-layer architecture.	Run Prompt 11 to find all hardcoded values. Replace with var(--token-name) references. The governance gate should have caught this at generation time.
Wrong archetype applied	Sentinel dashboard with warm oklch shadows; Harbor marketing site with tabular numerals; Lattice terminal with pill-rounded buttons	The wrong archetype spec was pasted into Prompt 01. The AI generates correct tokens for the wrong archetype.	Re-run from Prompt 01 with the correct archetype spec. The cascade is cheap — tokens take 5 minutes, components 30 minutes. Catching this early is always better than after patterns are built.
Typography overriding tokens.css	Font size inconsistencies between components; some use 14px, others 16px	Prompt 01 was run without Prompt 02 completing; or typography tokens weren't added before component generation started.	Ensure Prompt 02 completes and updates tokens.css before any component prompt runs. Components inherit from tokens.css — if typography isn't there yet, they fall back to browser defaults.

Archetype-specific failures (cross-reference Ch09–Ch11)

Failure	Appears in	Fix
White text on orange CTA	Sentinel (accent-CTA rule violation)	Prompt 01 flags this. If it shipped, Prompt 11 will surface it as Critical. Replace with dark text on orange.
Sub-44px touch targets	Current	Prompt 03 enforces 44px minimum per spec. If violated, Prompt 12 (a11y audit) surfaces it. Add padding to extend hit area.
Body text below 17px	Meridian	Meridian governance gate lint-fails --text-body below 17px. If downstream code overrides the token, Prompt 11 catches it.
Chrome color used for data series	Lattice (two-color-system violation)	Prompt 01 derives both color systems from the spec. If chrome and data palettes share a hex, Prompt 11 flags it as Critical.
Serif body font in Sentinel	Sentinel (sans-only rule)	Prompt 02 respects Sentinel's TYPOGRAPHY section. If a brand font was supplied that violates the sans-only rule, the prompt flags it back during validation.

Five AI-specific failure modes

Failure	Detection	Prevention
"Reasonable defaults" override spec	Generated tokens use popular SaaS values instead of archetype values (e.g., Inter instead of serif for Harbor)	The archetype spec is the constraint. If the AI is overriding it, the spec section was missing or truncated. Verify the two-paste workflow was followed.
Marketing copy leak into product UI	"Let's get started!" / "Unlock the power of…" in component labels or empty states	Archetype specs all inherit governance COPY HYGIENE rules. Run Prompt 12 — it checks copy against banned patterns. Fix with Phase 3 REFINE prompt.
Gradient circles as icons	Icon backgrounds with radial gradients in multiple colors	Harbor spec bans this explicitly. Prompt 03 should generate correct icons. If they appeared, paste the spec's BANNED PATTERNS section into a Phase 3 REFINE prompt.
Hallucinated component variants	Button has a "premium" variant not in the spec; Card has a "featured" variant not documented	Prompts 03–05 generate from the spec's component list. Extra variants mean the AI added unsolicited improvements. Run Phase 3 REFINE to remove: "remove the [variant] variant — it is not in the spec."
Missing focus-visible states	Keyboard navigation shows no focus indicator on interactive elements	Governance spec requires focus indicators on all keyboard-reachable elements. Prompt 12 surfaces this as Critical. Fix: add :focus-visible rules referencing --color-focus-ring token.

Act VI — Pipelines & Production · Chapter 25

Building for change

A design system is not a delivered artifact. It is a maintained contract. This chapter covers how the layered token architecture absorbs rebrands, how Layer 3 extraction adapts to product evolution, and the four ceremonies that keep a system alive after v1 ships.

How the layers absorb change

Change event	What to update	Blast radius	Prompt to use
Rebrand — new palette	Layer 1 primitive tokens in tokens.css	Zero component edits — semantic aliases pick up new values automatically	Prompt 01 (BYOC path with new seeds) or manual primitive edits + Prompt 11 to verify
Theme switch — add sepia mode	New [data-theme="sepia"] block in tokens.css (primitive overrides only)	Zero component edits	Prompt 01 (re-run with expanded spec) or manual addition
Brand font change	Font family tokens in tokens.css (Prompt 02's output)	Zero component edits — font tokens cascade	Prompt 02 with new font name supplied
New archetype variant	Layer 3 component tokens + new component CSS	Scoped to the component	Prompt 03–05 for new components; Prompt 10 for Layer 3 extraction
Spacing system evolution	New value inserted in percentile scale (--space-75 between --space-50 and --space-100)	Zero component edits — add the new token, existing references stay unchanged	Manual addition to tokens.css + Prompt 11 to verify

Surviving a rebrand — the BYOC swap workflow

Most production design systems will face at least one rebrand in their lifetime — new primary color, new brand font, new tone of voice, sometimes all three. The layered architecture (Ch07) makes this a one-day operation if it was set up correctly. Five steps:

Run Prompt 01 (BYOC mode) with the new brand inputs. Supply only the new primary hex, accent hex (if any), and font names. Keep the same archetype spec — only the brand inputs change. Output: a new tokens.css Layer 1 (primitives only).
Diff Layer 1 against the locked tokens.css. Confirm only primitive values changed (palette ramps, font tokens). If semantic aliases or component tokens shifted, the architecture has leaked — fix that before swapping.
Swap Layer 1 in a feature branch. Deploy to a staging environment. Components inherit the new values automatically because they reference semantic aliases, not primitives.
Run Prompt 11 (drift audit) and Prompt 12 (a11y audit). The new palette may break contrast on some pairings — Lattice's data-color and chrome rules, Sentinel's accent-CTA rule, Meridian's sepia mode. Fix any Critical findings before merge.
Re-run Prompt 08 (spec page). The rendered spec page is the engineering reference of record. The WCAG verification table is recomputed from the new values. Commit the new spec page alongside the new tokens.

If the rebrand also changes voice (e.g., "from Bloomberg-formal to Linear-confident"), re-derive the COPY HYGIENE section in the governance spec and re-run Prompt 12 — copy-banned patterns are part of the brand contract, not a separate concern.

Anti-pattern: a rebrand that touches Layer 2 or Layer 3 directly. If the new palette requires "this one component to use a different blue," that is a spec problem, not a token problem — fix the archetype spec, regenerate, do not patch the token tree.

The four maintenance ceremonies

1

Pre-merge gate (every PR)

Prompt 11 (drift audit) runs against every PR that touches component files. Critical findings block. Major findings are advisory but documented. This ceremony takes 2–3 minutes per PR and catches 90% of token violations before they compound.

2

Quarterly full audit (4× per year)

Run Prompts 11 + 12 against the entire codebase. Surface accumulated technical debt. Reset the baseline. Update the WCAG verification table in the spec page (Prompt 08) to reflect any token changes shipped since last quarter.

3

Post-stabilisation Layer 3 extraction (once per major component cycle)

After components ship and stabilise across 2–3 sprints, run Prompt 10 (Layer 3 extraction). It scans your locked CSS for values repeated 3+ times and proposes component-level tokens. Review the proposals, confirm, and commit the new Layer 3 file.

4

Spec page refresh (on every minor version bump)

Every time a token or component changes, re-run Prompt 08 (spec page) to generate an updated design-system-spec.html. The live rendered spec is the canonical engineering reference — keeping it current is a governance gate, not a nice-to-have.

Appendix · Org Playbook

Org playbook

Everything in this book can be run by one person in an afternoon. But design systems live at the intersection of design, engineering, and product — and that intersection requires a human playbook, not just a prompt library. This appendix covers roles, objections, and the three-month rollout.

Who does what

Role	Owns	Runs these prompts	Decision authority
Design system lead	Archetype selection, token decisions, governance spec, anti-pattern catalog	Prompt 00 (archetype), BYOC decisions, quarterly audit review	Archetype selection; what gets added to banned patterns; version bump decisions
Product designer	Component variants, section templates, domain components	Prompt 07 (domain descriptions), Workflow C critique loop, Phase 3 REFINE	Visual review at each checkpoint; go/no-go on component output before engineering uses it
Front-end engineer	Framework wrappers, CI gates, token file integrity	Prompts 03–06, 13 (framework wrappers), 11 (drift audit in CI)	Framework choice; implementation of CI gate; token naming in Layer 3
PM / stakeholder	Budget, timeline, adoption strategy	Does not run prompts; reviews spec page and component catalog	Prioritises which components ship in which sprint; approves the initial archetype choice

The three-month rollout

1

Month 1 — Token foundation + first components (Weeks 1–2: design lead + one engineer)

Choose archetype. Run Chain 9 to tokens + foundation. Ship Button, Input, Badge to engineering. Commit tokens.css as canonical. Set up Prompt 11 in CI. Design lead reviews spec page. First ceremony established: pre-merge drift gate.

2

Month 2 — Composite + patterns + adoption (Full team)

Run Prompts 04–07. Ship composite and pattern components. Domain components for core product surfaces. Designers use Workflow C critique loop before handing to engineering. Engineers run framework wrappers (Prompt 13). First quarterly audit scheduled.

3

Month 3 — Governance + Layer 3 + documentation

Run Layer 3 extraction (Prompt 10). Publish spec page as internal reference (Prompt 08 output). Write governance ADR: which archetype, why, what's banned and why. Run first quarterly full audit. Commit the four ceremonies as recurring calendar items.

Migrating onto an existing codebase

The three-month rollout above assumes a greenfield project. Most teams adopt this on a system that already ships — partial tokens, inconsistent components, accumulated drift. The migration playbook below is designed to deliver value from week one without forcing a rewrite.

W1

Week 1 — Diagnose, don't change

Day 1–2. Pick the archetype that best describes the existing product (Ch02). Don't change anything yet. Day 3. Paste the picked archetype spec + a sample of your component CSS into Prompt 11 (drift audit). Capture the report — this is your starting baseline. Day 4–5. Run Prompt 12 (a11y audit) on three to five representative pages. Combine both reports into a single "current state" doc. No code changes this week. Output: a concrete inventory of drift and a11y gaps, prioritised by Critical / Major / Minor.

M1

Month 1 — Token foundation in parallel, no breaking changes

Run Chain 9 to produce a clean tokens.css from the chosen archetype + your existing brand inputs (BYOC mode). Commit it as tokens.new.css alongside the existing system — do not replace yet. Pick one new component (or one being rewritten anyway) and build it against the new tokens to validate the foundation. Set up Prompt 11 to run on PRs that touch the new component only. By month-end you have a working two-system state — old for legacy, new for opt-in.

M2

Month 2 — Migrate component-by-component, ship continuously

Pick the highest-traffic component (typically Button, then Input, then Card). For each: rewrite using Prompt 03 against the new spec, run Prompt 11 to verify, swap call-sites in PRs of 5–10 files at a time. Each swap PR ships independently — no big bang. Critical drift findings block; Major findings are tracked but allowed in flight. By month-end, expect 30–50% of component surface migrated.

M3

Month 3 — Retire the legacy system

Finish remaining component migrations. Delete tokens.old.css (or the legacy equivalents) once no call-sites remain. Promote tokens.new.css to tokens.css. Run Prompt 10 (Layer 3 extraction) against the now-stabilised codebase to surface component-level patterns. Publish the spec page (Prompt 08) and write the governance ADR. Codify the four ceremonies (Ch25) on the calendar.

Q1

Quarter 1 close — the first full audit + Layer 3 lock

End of month 3 / start of month 4: run Prompts 11 + 12 across the entire codebase. Compare the report against your week-1 baseline — this is the migration's headline metric ("Critical findings reduced from N to 0", "AA contrast pass rate from 71% → 96%", etc.). Lock Layer 3 tokens. Make the new spec page the canonical engineering reference. Migration declared complete; the system enters maintenance mode.

Migration anti-patterns to avoid

Big-bang rewrite branch. A 6-week off-trunk migration branch will not merge cleanly. Always ship through trunk in 5–10 file PRs.
Mass-replace tokens before components are ready. Old components reading new tokens produce visual chaos — components that look broken without being broken. Migrate component first, then swap its tokens, then move on.
Skipping the week-1 baseline. Without the starting drift / a11y report, you cannot prove the migration delivered value. The baseline is the ROI evidence.
Doing M1 and M2 in parallel. Tempting, but the new tokens need to be locked before any component depends on them. Sequence is non-negotiable: tokens lock → components migrate → legacy retires.

Common objections and honest responses

Objection	Honest response
"The AI gets things wrong."	Yes, at ~70% fidelity from description alone. With specs + MCP + tokens, it's ~95–98%. The system in this book is specifically the constraint layer that gets you from 70% to 98%. Every prompt in the kit was designed to reduce inference and increase determinism.
"We already have a design system."	This book's workflow works with existing systems. Path B reads from your Figma file via MCP. Prompt 01 (BYOC) uses your existing brand colors. The archetype spec describes structural decisions you've already made — paste your system's rules into the relevant sections.
"We don't use Figma."	Path A needs no Figma at all. All generation runs from the archetype spec and your tokens.css. Path B and Chain 8 (write-back) require Figma and the Dev Mode MCP. The choice of path is exactly this: do you have Figma, or not?
"This will put designers out of work."	The system automates the mechanical work — token generation, component boilerplate, spec page rendering. The taste decisions, the archetype selection, the domain component descriptions, the go/no-go reviews — those are the designer's job throughout. More specifically: this system makes it possible for one design lead and one engineer to maintain a production-grade design system that previously required a team of five.
"We tried AI for design before and it didn't work."	The failure mode you experienced is described in Ch24. Without a spec, without a constraint layer, without a quality gate, AI-generated design produces 70% fidelity with drift on every subsequent run. This book is the constraint layer. The prompts are not magic — they are structured contracts that reduce the AI's creative latitude to the point where mechanical execution is possible.

Appendix B · Glossary

Glossary of terms

The vocabulary of this book in one place. Definitions are scoped to how the term is used here — broader industry meanings may differ.

Term	Definition (as used in this book)
Anti-pattern	A specific generation behaviour the system bans (e.g., gradient-circle icons, floating placeholder labels). Anti-patterns are catalogued per archetype and enforced by Prompts 11 and 12.
Archetype	One of five named system shapes — Harbor, Sentinel, Current, Lattice, Meridian — that encode a coherent set of governance gates, banned patterns, component priorities, and density rules. Picking an archetype narrows the AI's generation space to a known-good region. Specialist archetypes (Current, Lattice, Meridian) are inherited and overridden, not built from scratch.
Artifact	A live rendered HTML/CSS panel inside a Claude conversation. Prompts 04, 05, and 08 produce Artifacts so you can see the output rendered, not just described. See Projects.
BYOC	"Bring Your Own Color" — Prompt 01's mode for reusing an existing brand palette and fonts instead of deriving them from adjectives. Used for rebrands, multi-brand systems, and migration onto existing systems.
Chain	A sequence of prompts that runs end-to-end with handoffs between steps. Each chain has a single goal (e.g., Chain 9: archetype to spec page in an afternoon). Chains live in `kit/chains/`.
chain.md	The orchestrator file at `kit/chain.md`. Run it inside Claude Code to step through a chain with file outputs saved automatically. Alternative: copy individual prompt files ad-hoc and run them in claude.ai.
Drift	The accumulated visual and structural inconsistency that shows up across screens generated in separate sessions. The five drift layers (Ch01) — palette, radius, type, spacing, icon stroke — compound when each is "reasonable but different." Prompt 11 (drift audit) is the detection gate.
Extended Thinking	Claude feature where the model reasons through a problem in visible chain-of-thought before producing the final output. Used for high-constraint prompts (token derivation, spec assembly, Layer 3 extraction) where the cost of a wrong assumption is high.
Figma Dev Mode MCP	Figma's official Model Context Protocol server, which exposes design context (frames, variables, components) to AI tools. Required for Path B (read from Figma) and Chain 8 (write back to Figma).
Governance gate	A non-negotiable rule that the spec encodes and prompts enforce. Gates fail the build, not the design review (e.g., Meridian's `--text-body` floor at 17px). Distinct from preference.
Governance spec	The cross-archetype rules that apply to every system in this book — copy hygiene, accessibility floors, banned patterns common to all. Pasted into Prompt 01 alongside the archetype spec.
Layer 1 / Layer 2 / Layer 3	The three-layer token architecture (Ch07). Layer 1 = primitives (raw values: hex, px, scale steps). Layer 2 = semantic aliases (intent: `--color-primary`, `--radius-md`). Layer 3 = component-level tokens extracted after components stabilise (`--button-bg`, `--card-padding`). Components reference Layer 2 or Layer 3, never Layer 1.
MCP	Model Context Protocol — Anthropic-defined open protocol for connecting AI tools to data sources. The Figma Dev Mode MCP and Storybook MCP are the two used in this book.
Path A / Path B / Path C	The three workflows (Ch00). Path A = no Figma, generate from spec + tokens (greenfield). Path B = read from Figma via MCP, generate code (existing Figma source of truth). Path C = bidirectional, code and Figma stay in sync (mature systems).
Project (Claude Project)	Claude.ai feature where you upload reusable context (governance spec, archetype spec, tokens.css) once and every conversation in that Project inherits it. Eliminates the repaste-the-spec ritual.
Prompt (kit prompt)	A self-contained, copy-paste prompt file in `kit/prompts/`. Each prompt has a meta-header (preconditions, branching questions), a body (the work), and a meta-footer (auto-save, handoff). The kit ships 28 atomic prompts plus inline teaching prompts throughout the book.
Spec page	The rendered HTML reference page produced by Prompt 08, showing every token, component, state, and accessibility verification. Lives at `preview/specs/` and is canonical for engineering reference.
Token	A named CSS custom property in `tokens.css` that components consume. Tokens carry meaning (`--color-primary`) and absorb change (rebrand updates the token, components stay untouched). See Layer 1 / 2 / 3.
Workflow A / B / C	Three working modes within a path (Ch00). Workflow A = generate-then-review. Workflow B = co-design (request, review, iterate). Workflow C = critique-loop on existing output. Workflow C is required for any output that ships.

Appendix C · References

References & further reading

The standards, specifications, and prior art this book builds on. Where the book's claims rest on external sources, this is where they're traced.

Standards & specifications

WCAG 2.1 AA / 2.2. Web Content Accessibility Guidelines. The contrast ratios (4.5:1 body, 3:1 large text and UI), focus-visible requirements, and keyboard reachability rules in this book derive from WCAG 2.1 AA. w3.org/TR/WCAG22
ARIA 1.2. Accessible Rich Internet Applications spec. The role / state / property requirements for interactive components (Button, Input, Combobox, Dialog) follow ARIA 1.2 patterns. w3.org/TR/wai-aria-1.2
ARIA Authoring Practices Guide (APG). Reference implementations for common interaction patterns. Where the book references "the standard pattern for X," the APG is the source. w3.org/WAI/ARIA/apg
Model Context Protocol (MCP). Open protocol for connecting AI assistants to data sources, published by Anthropic. Defines how Figma Dev Mode and Storybook expose design context to Claude. modelcontextprotocol.io
W3C Design Tokens Format Module (DTCG). Emerging standard for design token interchange. The book's Layer 1 / 2 / 3 architecture is compatible with DTCG semantics; export tooling is out of scope but planned for v2.x. w3.org/community/design-tokens
OKLCH color space. Perceptually uniform color space used by Prompt 02 for ramp generation. w3.org/TR/css-color-4 §OKLab/OKLCH

Tools & platforms referenced

Figma Dev Mode & Code Connect. The official designer-developer handoff surface; required for Paths B and C. help.figma.com — Dev Mode
Claude (Anthropic). The model family these prompts were authored against — Opus 4.7, Sonnet 4.6, Haiku 4.5 (current as of 2026-05). claude.com
Claude Projects & Artifacts. See Ch17. Artifacts render HTML/CSS in-thread; Projects persist context across conversations. docs.claude.com
Storybook. Component workshop and visual reference. Used as the visual regression target in Ch23 (CI gates) and the docs surface in Ch20. storybook.js.org

Prior art on design systems

Brad Frost, Atomic Design (2016). The atoms-molecules-organisms framing remains the most cited mental model for component decomposition; this book's component spec (Ch12) is aligned with that shape. atomicdesign.bradfrost.com
Nathan Curtis, EightShapes design-tokens writing. The three-layer token model (primitive / semantic / component) traces to Curtis's mid-2010s articles on naming and aliasing. medium.com/eightshapes-llc
Salesforce Lightning Design System. One of the earliest publicly-documented enterprise systems with explicit governance and token architecture. lightningdesignsystem.com
Material Design. Google's system; the source of the typography-scale and density-tier framing this book adapts (and selectively diverges from). m3.material.io
Polaris (Shopify), Carbon (IBM), Spectrum (Adobe), Primer (GitHub). Public enterprise systems that informed the Sentinel archetype's component priority order and governance posture. Each has open documentation worth reading.

Industry research on AI & design consistency

The "70% fidelity from description / 95–98% with spec + MCP + tokens" range is a working estimate based on the author's project work, not a published study. Independent, peer-reviewed numbers on AI design fidelity are not yet established as of 2026-05; readers should calibrate locally before quoting these as benchmarks.
Anthropic's published guidance on prompt engineering, structured outputs, and tool use informs the prompt format used in kit/prompts/. docs.claude.com — prompt engineering

External links above were correct as of 2026-05-02. URLs change. Where a link 404s, search the title — the canonical content typically remains discoverable.

What ships with this book

Five archetypes — pick the one that matches your product

How the system governs itself

Who this is for

How to use this book

About the author

The 2026 AI design landscape

Three paradigms

Figma Native AI

Claude Design

Figma × MCP × Claude

The substrate that binds them

What the spec is, concretely

Three workflows, three paths

Run the system

Specs · prompts · chains

Two views of the prompt system

What this book does not cover

The AI design consistency crisis

The drift, in five layers

Why drift happens

Specification ≠ artifact

The fix shape — five self-actions

Your role vs AI's role

Who this book is for

Design system leads brought into AI integration

Senior designers crossing into engineering

Front-end engineers who own the design system

Founders & PMs without a design org

What this book gives you

What's next

Five archetypes of design systems

The five — at a glance

Harbor

Sentinel

Current

Lattice *

Meridian

Twelve dimensions, opposite choices

Adjectives: narrowing, not determining

Adjective dictionary

The archetype-from-adjectives prompt

The cascade — generation with checkpoints

Picking the archetype — a three-question tree

Archetype and palette are separable

What's locked vs what's yours

Failure modes — forcing the wrong archetype

What's next

Token architecture

Three layers — primitives, semantic aliases, components

Why the layering matters

Where the specs come from

Pick your archetype's spec

Governance — paste this FIRST, every time

The two prompts

Prompt 1 — Token generation (with BYOC built in)

BYOC — Bring Your Own Colors

Prompt 2 — Typography scale (with brand fonts if noted)

What you get back

Layer 3 emerges later, not now

What's next

Colors & WCAG

WCAG fundamentals — the four numbers that matter

Five archetypes, five color strategies

Ramp generation — oklch interpolation from a seed

Why oklch, not HSL or RGB

The Accent-CTA rule — Sentinel's keystone

Triple encoding — color is never the only signal

Dark mode as token swap

Failure modes — what Prompt 1b flags back

The contrast table is the deliverable

What's next

Figma Variables

Variables vs CSS custom properties — the differences that matter

Three collections, three jobs

Modes — the theme mechanism

How the MCP reads Figma Variables (Path B · Standard)

What MCP covers — and what it doesn't

When you need sync outside the AI workflow

Source-of-truth decision

Lattice ^*