Reference-site context test for agent-written web pages; useful structure came with source-site nouns; site-voice-packs separates page structure and copy rhythm, then scores visible HTML output
Record
Reference sites gave useful structure but leaked source-site product nouns
Keep structure and rhythm without copying spans or importing the source business
Split SITE and VOICE files, add source-term boundaries, score visible HTML output
Stripe/LedgerFlow web output: 63.2 to 92.1 webfit; mimic risk 0.0
Copy safety and claim safety remain gates

Built
- reusable SITE.md and VOICE.md context files
- concise CLI/package shape
- source-site contamination guardrails
- visible before/after examples designed for direct agent inspection
- site2voice webfit gate for HTML output comparison
Signals
- without context webfit: 63.2
- with SITE.md + VOICE.md webfit: 92.1
- reference-fit delta: +28.9
- mimic risk: 0.0
- copy safety: 100.0
Visible HTML score from structure fit, voice fit, copy safety, and claim safety
Section order, page jobs, density, navigation, and conversion path
Copy-overlap risk; high webfit should not reward source copying
Pattern only; product nouns and facts must come from the new brief