K=3 adversarial verification cut false positives 27.8% to 0.0% (95% CI [11.1, 50.0] to [0, 0]; recall 100% to 77.8%). The recall drop is real. Held-out: 3/3 genuine bugs found, 0 surviving false positives.
Cost-aware model routing (DeepSeek to Haiku to Sonnet to Opus) plus adversarial multi-agent verification with full tracing. A trace UI that looks like a product. Vendors a shared kernel reused across Aegis and FieldAgent. Fans out finders per file, K skeptics per finding (concurrency cap 8). ~$0.25 total per run. 58 tests, ruff, mypy, CI green. Cost-routing live tier gated on an Anthropic key: the harness is committed, the number is honest.