Industrial manual QA fails when citations are plausible but not specific
Checks answer paths before they look acceptable in aggregate metrics
The first tests looked acceptable because the retrieved passages were near the right topic
Different from generic answer similarity; can pass a loose score and still be unusable
The fixture checks whether the answer was grounded in the right document, not just a similar paragraph
- Gatepass · warn · clarify · escalate
- Authoritymethodology · guidance · regulation
- Citationgold span hit · first gold rank
- Safetyunsafe operational advice escalates
- NASA reliability-centered maintenance
- OSHA / eCFR hazardous energy control
- NIOSH hazardous energy guidance
- DOE / FEMP operations guidance
Hybrid RRF, v5_t31 internal report
Whether the expected source appears in the first five retrieved chunks
Whether the answer cites the required span, not just a nearby passage
Whether the system chooses pass, warn, clarify, or escalate correctly
Whether the answer relies on the right source family for the question
What retrieval-only review would miss
Full fixture items that still missed retrieval or citation readiness after the hybrid run
Expected source was in top-5, but the used citation still missed the gold span
NASA RCM methodology questions that retrieved nearby material but missed the intended authority span
Dense retrieval reached adjacent support; hybrid restored exact safety support on the safety cluster
Support gaps are exported as a review queue
Support gaps converted into question, missed-check, gold-span, and used-span rows
Top-5 retrieval hit, but exact support still failed
High-severity pump hydraulic-field citation misses
A compact file a document owner can inspect without reading the full report JSON
QA089
Exact eCFR stored-energy definition sat just outside top-5
Same-authority citation specificity, then small lexical exposure residual
Exact span cited; safety cluster moved to 5/5 exact under hybrid
Internal fixture evidence only; not a public benchmark, safety advice, or production compliance claim; source documents are linked and cited, not redistributed