Test Case Editing
Edit and refine your AI agent test cases with an intuitive interface that shows actual results alongside expected values, making it easy to update expectations and validate agent behavior.
The Problemβ
Testing AI agents requires constant refinement of test cases based on actual agent behavior.
When building and testing AI agents, teams face challenges:
- π Iterative Refinement: Test expectations need frequent updates as agents evolve
- π Comparison Difficulty: Hard to compare expected vs actual results side-by-side
- β‘ Action Discovery: Finding which actions are available for a specific topic
- π Input Variations: Testing different phrasings of the same question
- π― Manual Verification: Time-consuming to verify each test result manually
- π Status Updates: Need to re-run tests to see if changes fixed issues
In short: You need a streamlined way to edit test cases while comparing actual agent behavior with expected outcomes.
How GenAI Explorer Solves Thisβ
GenAI Explorer provides comprehensive test case editing with:
β Side-by-Side Comparison: See actual vs expected values in one glance
- Actual results on the LEFT (what the agent did)
- Expected values on the RIGHT (what you want)
- Natural left-to-right reading flow
β Smart Action Filtering: Topic-aware action selection
- Multi-select checkboxes for actions
- Automatically filtered by selected topic
- Shows only relevant actions for the topic
β Full Editability: Edit both inputs and outputs
- User utterance (question)
- Expected topic
- Expected actions (multi-select)
- Expected bot response
β Auto-Recalculation: Instant feedback
- Test status recalculates after saving
- See pass/fail immediately
- No need to manually re-run tests
β Visual Feedback: Color-coded comparison
- π’ Green: Actual values (from agent)
- π΅ Blue/Orange: Expected values (editable)
- Clear visual distinction between the two
Impact: Reduce test maintenance time by 70%, catch issues earlier with side-by-side comparison, and refine test cases 3x faster with smart action filtering.
Overviewβ
The Test Case Editing interface allows you to modify test expectations while viewing actual agent results side-by-side. This makes it easy to update test cases when agent behavior changes or when you need to refine your expectations.
Accessing Test Case Editorβ

From Test Cases Tableβ
- Navigate to your Testing>Test Cases record
- View the test cases
- Click the βοΈ Edit button in the "Actions" column for any test case
Test Cases Table Featuresβ
The table shows all your test cases with:
- Session Match: Whether a matching agent session was found
- Test Result: Overall pass/fail status (Success, Partial, Error, No Match)
- Input Utterance: The user's question
- Topics: Expected vs actual topics comparison
- Actions: Expected vs actual actions comparison
- Response: Expected vs actual bot response comparison
Color Indicators:
- π’ Green checkmark: Match/Success
- π΄ Red X: Mismatch/Failure
- β οΈ Yellow warning: Partial match
- βͺ Gray: No session match
Edit Dialog Layoutβ
When you click Edit, you'll see a comprehensive dialog with all test case details:
βββββββββββββββββββββββββββββββββββββββββββββββββββ
β Edit Test Case #1 β
β "What is my account balance?" [Chip] β
βββββββββββββββββββββββββββββββββββββββββββββββββββ€
β β
β π Input Utterance (User Question) - EDITABLE β
β βββββββββββββββββββββββββββββββββββββββββββββββ β
β β What is my account balance? β β
β βββββββββββββββββββββββββββββββββββββββββββββββ β
β β
β ββββββββββββββββ βββββββββββββββββββββββββββββββ β
β β
β π― Topics β
β ββββββββββββββββββββ¬βββββββββββββββββββββββββββ β
β β Actual β LEFT β Expected β RIGHT β β
β ββββββββββββββββββββΌβββββββββββββββββββββββββββ€ β
β β [ServiceTopic] β [ServiceTopic βΌ] β β
β β (Green chip) β (Dropdown) β β
β ββββββββββββββββββββ΄βββββββββββββββββββββββββββ β
β β
β β‘ Actions β
β ββββββββββββββββββββ¬βββββ ββββββββββββββββββββββ β
β β Actual β LEFT β Expected β RIGHT β β
β ββββββββββββββββββββΌβββββββββββββββββββββββββββ€ β
β β [GetBalance] β ββββββββββββββββββββββββ β β
β β [CheckLimit] β β β GetBalance β β β
β β (Green chips) β β β TransferFunds β β β
β β β β β CheckLimit β β β
β β β ββββββββββββββββββββββββ β β
β β β 5 actions from "Service" β β
β ββββββββββββββββββββ΄βββββββββββββββββββββββββββ β
β β
β π¬ Bot Response - EDITABLE β
β ββββββββββββββββββββ¬βββββββββββββββββββββββββββ β
β β Actual β LEFT β Expected β RIGHT β β
β ββββββββββββββββββββΌβββββββββββββββββββββββββββ€ β
β β ββββββββββββββββ β ββββββββββββββββββββββββ β β
β β β Your balance β β β Your balance is β β β
β β β is $1,234.56 β β β $1,234.56 β β β
β β ββββββββββββββββ β β (Editable textarea) β β β
β ββββββββββββββββββββ΄βββββββββββββββββββββββββββ β
β β
β βΉοΈ Changes saved locally. Click Save to update β
β β
βββββββββββββββββββββββββββββββββββββββββββββββββββ€
β [Cancel] [Save Changes] β
βββββββββββββββββββββββββββββββββββββββββββββββββββ
Editing Featuresβ
1. Input Utterance (User Question) β¨β
What it is: The question or statement the user sends to the agent.
Why edit it:
- Test different phrasings of the same question
- Fix typos in test cases
- Refine question clarity
- Add more context or details
How to edit:
- Click in the "Input Utterance" text field
- Type your new question
- The change will be saved when you click "Save Changes"
Example:
Before: "What is my account balance?"
After: "Show me my current account balance please"
Use Case: Testing if the agent handles polite variations differently.
2. Expected Topicβ
What it is: The topic (conversation intent) you expect the agent to identify.
Why edit it:
- Update when topic names change
- Fix mismatched expectations
- Align with agent's actual behavior
How to edit:
- Click the topic dropdown (Expected side)
- Select from available topics
- Topics are loaded from your GenAiPlanner configuration
- Supports both developer names and labels
Visual:
ββββββββββββββββββββ¬βββββββββββββββββββββββ
β Actual β Expected β