Name: Converra
Availability: InStock
Author: Converra

Question 1

What is step-level diagnosis for AI agents?

Accepted Answer

Step-level diagnosis identifies the exact turn in a multi-turn conversation where an AI agent's behavior caused a failure — and classifies the root cause (prompt issue, model mismatch, config error, or orchestration bug). This is more granular than conversation-level scoring ('this conversation failed') or turn-level scoring ('step 3 was bad'). Step-level diagnosis tells you why step 3 was bad and what type of fix will address it.

Question 2

Why isn't conversation-level diagnosis enough?

Accepted Answer

A 5-turn conversation that fails at step 2 and a 5-turn conversation that fails at step 5 need completely different fixes. Conversation-level scoring tells you the conversation failed but not where. Without step-level diagnosis, engineers read full transcripts to find the problem — a process that doesn't scale as conversation volume grows.

Question 3

How does step-level diagnosis work in multi-agent systems?

Accepted Answer

In multi-agent systems, Converra traces the conversation across agent boundaries. When a handoff between agents causes a failure, the diagnosis identifies both the handing-off agent and the receiving agent, the specific turn where the handoff broke, and whether the root cause is in the routing logic or the downstream agent's instructions.

Question 4

What metrics are scored at each step?

Accepted Answer

Each agent response is scored on intent recognition (did it understand the user's goal), relevance (was the response appropriate), context utilization (did it use prior conversation history), and tool use (did it call the right tools). These per-step scores pinpoint exactly which capability failed.

Question 5

Can I use step-level diagnosis without the full Converra loop?

Accepted Answer

Yes. Diagnosis is available on its own — you get the exact step, failure mode, and root cause classification. But the real value comes from the full loop: Converra takes that diagnosis, generates a targeted fix, tests it in simulation, and verifies the result from production data.

Step-level diagnosis

Conversation #4091 — SDR AgentExample

Three levels of diagnosis granularity

Conversation-level

Turn-level

Step-level (Converra)

Root cause classification

Prompt Issue

Model Mismatch

Config Error

Code / Orchestration

Per-step scoring

From diagnosis to fix — automatically

Frequently asked questions

What is step-level diagnosis for AI agents?

Why isn't conversation-level diagnosis enough?

How does step-level diagnosis work in multi-agent systems?

What metrics are scored at each step?

Can I use step-level diagnosis without the full Converra loop?

See diagnosis in action