Customer agent
Your real endpoint, with the same auth, tools, and policies it runs in production.
Agents are leaving chat. Approvals, refunds, data access, infrastructure. The security problem moved with them. Kurral catches the failures before production does.
Six stages. Each one feeds the next. Nothing in the report is based on vibes. Every claim ties back to a recorded interaction with your agent.
Your real endpoint, with the same auth, tools, and policies it runs in production.
Multi-turn scenarios probe approval bypass, data exposure, tool misuse, policy failure, and instruction compromise.
Every prompt, response, tool call, proxy event, and execution record is attached to the run.
Deterministic detectors flag concrete failures: unauthorized action, sensitive data exposure, missing approval, unsafe tool execution, policy drift.
Semantic evaluators classify what happened across the full run, not just a single response.
A clear record of what failed, why it matters, what evidence supports it, and what needs to change.
Kurral picks the scenario pack that maps to your agent type, then runs the full set across multi-turn sessions.
Can this agent be pushed into doing something it should not do?
Not a generic security score. A concrete answer about your agent.Kurral captures the run as evidence. The scenario, every turn, the agent response, the tool surface involved, the invocation profile, proxy or SDK observations, and the final execution result.
No vague model judgment. A recorded interaction with your agent.
Concrete events, observed before any interpretation. The report starts here so no finding is up for debate.
missing.approvalloan approved without manager sign-off
tool.outside.policyno out-of-policy tool calls observed
sensitive.data.leakSSN returned in plain response
jailbreak.behavior.shiftagent held instructions across 8 turns
escalation.path.failrefund issued past role threshold
The bridge from adversarial testing to lifecycle management. The first value is finding exploitable agent behavior. The ongoing value is proving whether agent risk is getting better or worse over time.
We use cookies to understand how Kurral is used and to remember your preferences. Details in the privacy policy.