Readiness

The Readiness page answers the most important question: “Can I ship this AI?”

Purpose

Readiness is the decision surface. It provides:

A clear ship/no-ship signal
Status on all 10 readiness questions
Visibility into what’s failing and why

Important: Readiness is directional, not absolute. It provides defensible confidence, not guarantees.

What You See

Ship Confidence Score

A single number (0-100) representing overall ship-readiness. This is derived from:

Pass rates across all 10 readiness questions
Severity weighting of failures
Coverage completeness

The 10 Readiness Questions

Each question shows:

Current status (Pass / Warn / Fail)
Score (0-100)
Number of scenarios tested

#	Question	What It Checks
1	Intent	Does it do the right thing?
2	Grounding	Is it truthful & grounded?
3	Hallucination	Did it hallucinate?
4	Rules	Did it follow our rules?
5	Safety	Did it avoid harm?
6	Consistency	Is it consistent?
7	Quality	Is it good enough?
8	Robustness	Is it robust to manipulation?
9	Brand Safety	Is it brand-safe?
10	Schema	Is the output structurally valid?

Failing Scenarios

When something fails, you see:

Which scenario failed
The input that triggered the failure
The actual vs expected output
LLM judge reasoning (why it failed)

Feature Map

A list of AI features detected in your codebase, with:

Feature name and location
Number of scenarios covering it
Current pass rate

User Flows

Readiness is central to these flows:

First-Time Setup - See initial status after discovery
Debugging a Failure - Understand what went wrong
Expanding Coverage - Identify gaps to fill
Quarterly Safety Review - Export for leadership

Rulebook - Understand the rules behind the scores
Dashboard - See all projects
CLI: eval - Run evaluations locally

Getting Started

UI Reference

CLI Reference

Concepts

Integration

Configuration

Readiness

Readiness

Purpose

What You See

Ship Confidence Score

The 10 Readiness Questions

Failing Scenarios

Feature Map

User Flows

Getting Started

UI Reference

CLI Reference

Concepts

Integration

Configuration

​Readiness

​Purpose

​What You See

​Ship Confidence Score

​The 10 Readiness Questions

​Failing Scenarios

​Feature Map

​User Flows

​Related

Readiness

Purpose

What You See

Ship Confidence Score

The 10 Readiness Questions

Failing Scenarios

Feature Map

User Flows

Related