site-voice-packs | Sihyeon Jeon

Reference-site context test for agent-written web pages; useful structure came with source-site product names; site-voice-packs separates page structure and copy rhythm, then scores visible HTML output

Record

Problem

Reference sites gave useful structure but leaked source-site product names

Bottleneck

Keep structure and rhythm without copying spans or importing the source business

Fix

Split SITE and VOICE files, add source-term boundaries, score visible HTML output

Result

Stripe/LedgerFlow web output: visible HTML score 63.2 to 92.1; copy-overlap risk 0.0

Guardrail

Copy safety and claim safety remain gates

Side-by-side visible HTML comparison for the same LedgerFlow prompt — Same web page prompt; left without context, right with SITE.md and VOICE.md

Built

reusable SITE.md and VOICE.md context files
concise CLI/package shape
source-site text leakage checks
visible before/after examples designed for direct agent inspection
HTML output comparison score for generated pages

Signals

without context visible HTML score: 63.2
with SITE.md + VOICE.md visible HTML score: 92.1
score delta: +28.9
copy-overlap risk: 0.0
copy safety: 100.0

visible HTML score

Repo label: webfit. Score from page structure, copy rhythm, copy safety, and claim safety

structure match

Section order, page jobs, density, navigation, and conversion path

copy-overlap risk

Source-copy overlap risk; the HTML score should not reward copied source text

boundary

Pattern only; product nouns and facts must come from the new brief