Measure Twice: Exploring the Evaluation of Agentic Security Detection Systems2026-03-2230 minOn applying scientific skepticism and rigor to the measurement of agentic security systems.researchaidetectionazure