Adding Test Cases

Create comprehensive test suites by building test cases from scratch or converting real agent interactions into reusable tests.

The Problem

Creating comprehensive test suites manually is time-consuming and often misses edge cases.

When testing AI agents, teams struggle with:

⏱️ Time Consuming: Manually writing dozens of test cases
🔍 Missing Coverage: Hard to think of all possible user inputs
🎯 Real-World Gaps: Test cases don't match actual user behavior
📝 Repetitive Work: Copy-pasting from interaction logs
🔄 Iteration Speed: Slow to expand test coverage
💡 Discovery: Not knowing what to test until users try it

In short: You need a fast way to build comprehensive test suites that reflect real user interactions.

How GenAI Explorer Solves This

GenAI Explorer provides flexible test case creation with:

✅ Dual Creation Modes

Create from scratch with manual input
Convert real interactions into test cases automatically

✅ Smart Action Filtering

Actions automatically filtered by selected topic
Multi-select checkbox interface
See available actions per topic instantly

✅ Bulk Creation

Select multiple interactions at once
Create 10+ test cases in seconds
Visual feedback for selections

✅ Rich History View

See recent agent sessions
Review topics, actions, and responses
Filter by timestamp and channel
Preview full conversations

✅ Seamless Integration

One-click "Add Test Case" button
Auto-numbering for new cases
Immediate save integration
Auto-recalculation of matches

Impact: Create test suites 10x faster, improve coverage by 300%, capture real user scenarios automatically.

Overview

The Add Test Case feature provides two powerful ways to create test cases: manual creation for specific scenarios, and automatic creation from real agent interactions for comprehensive coverage.

Accessing the Feature

Location

The "Add Test Case" button is located in the test cases table toolbar (top right).

┌────────────────────────────────────────────────────┐
│ 📋 5 test cases • 3 types  [🔄 Refresh] [+ Add]   │
│                                                    │
│ [Test cases table...]                              │
└────────────────────────────────────────────────────┘

Button: Blue button with "+" icon labeled "Add Test Case"

When to Use Each Mode

From Scratch:

Testing specific edge cases
Creating targeted validation tests
Documenting expected behavior
Building baseline test cases

From History:

Capturing real user scenarios
Expanding test coverage quickly
Documenting actual agent behavior
Creating regression tests

Mode 1: Create from Scratch

Overview

Build a test case manually by specifying the user utterance and expected outcomes.

Fields

1. User Utterance (Required)

What it is: The question or statement the user sends to the agent.

Input: Multiline text field (2 rows)

Example:

What is my account balance?

Tips:

Be specific and clear
Use natural language
Match how users actually speak
Vary phrasing for similar tests

2. Expected Topic (Optional)

What it is: The conversational topic you expect the agent to identify.

Input: Dropdown selector

Options: Loaded from your GenAiPlanner configuration

Example:

ServiceTopic
BillingTopic  
SupportTopic

Tips:

Select a topic to enable action selection
Choose the most specific topic
Leave blank if topic doesn't matter

3. Expected Actions (Optional)

What it is: The actions you expect the agent to execute.

Input: Multi-select with checkboxes

Behavior:

Disabled until topic is selected
Filtered to show only actions from selected topic
Shows count: "5 action(s) available from 'ServiceTopic'"

Example:

Selected Topic: ServiceTopic

Available Actions:
☑ GetBalance
☑ TransferFunds
☐ CheckAccountLimit
☐ UpdateProfile
☐ ResetPassword

2 actions selected

Tips:

Select topic first
Choose all actions the agent should execute
Order doesn't matter
Can select 1+ actions

4. Expected Bot Response (Optional)

What it is: The text response you expect from the agent.

Input: Multiline textarea (3 rows)

Example:

Your balance is $1,234.56. You have available credit of $500.

Tips:

Be specific about key information
Can be partial or full match
Leave blank if response text doesn't matter
Focus on factual content

Workflow

Step 1: Click "Add Test Case" button
   ↓
Step 2: Select "Create from Scratch" tab
   ↓
Step 3: Fill in fields:
   - Utterance (required)
   - Topic (optional)
   - Actions (optional, needs topic first)
   - Response (optional)
   ↓
Step 4: Click "Add Test Case"
   ↓
Result: New test case added to table

Complete Example

Input:

Utterance: "I need to transfer $500 to my savings"
Topic: "TransferTopic"
Actions: ["ValidateAccount", "TransferFunds", "SendConfirmation"]
Response: "Successfully transferred $500 to your savings account."

Generated Test Case:

{
  "number": "6",
  "inputs": {
    "utterance": "I need to transfer $500 to my savings"
  },
  "expectation": [
    {
      "name": "topic_sequence_match",
      "expectedValue": "TransferTopic"
    },
    {
      "name": "action_sequence_match",
      "expectedValue": "['ValidateAccount', 'TransferFunds', 'SendConfirmation']"
    },
    {
      "name": "bot_response_rating",
      "expectedValue": "Successfully transferred $500 to your savings account."
    }
  ]
}

Mode 2: Select from History

Overview

Convert real agent interactions into test cases by selecting from recent sessions.

Loading Process

When you switch to the "Select from History" tab:

Step 1: Find Sessions

Queries recent sessions for your agent
Limited to last 50 sessions
Sorted by timestamp (newest first)

Step 2: Load Interactions

Gets all interactions from found sessions
Limited to last 100 interactions
Filters by agent name

Step 3: Load Related Data

Input/output messages
Action steps
Topics
Channel information

Step 4: Display

Combines all data per interaction
Shows rich preview of each
Ready for selection

Loading Time: Usually 2-5 seconds depending on data volume.

Interaction Display

Each interaction card shows:

┌──────────────────────────────────────────────────┐
│ ☐ "What is my account balance?"                  │
│                                                  │
│ [ServiceTopic] [GetBalance] [CheckLimit] [Chat] │
│                                                  │
│ │ Your current balance is $1,234.56. You have   │
│ │ available credit of $500.                     │
│                                                  │
│ 11/26/2025, 2:30:15 PM                          │
└──────────────────────────────────────────────────┘

Components:

Checkbox - Click to select/deselect
Utterance - User's question (bold, quoted)
Chips:
- 🔵 Blue: Topics
- 🟣 Purple: Actions
- ⚪ Gray: Channel (Chat, Voice, etc.)
Response Preview - Bot's response (first 150 chars, italic, bordered)
Timestamp - When the interaction occurred

Selection Behavior

Click anywhere on card: Toggles selection

Selected state:

✅ Checkbox checked
🔵 Blue border (2px)
💙 Light blue background

Unselected state:

☐ Checkbox unchecked
⚪ Gray border (1px)
⚪ White background

Counter: "(X selected)" shown at top

Multi-Select Example

User clicks 3 interactions:

✓ Interaction 1: "What is my balance?"
   Topics: [ServiceTopic]
   Actions: [GetBalance]
   
✓ Interaction 2: "Transfer $500"
   Topics: [TransferTopic]
   Actions: [ValidateAccount, TransferFunds]
   
✓ Interaction 3: "Reset my password"
   Topics: [SecurityTopic]
   Actions: [ValidateIdentity, ResetPassword]

Clicks "Add 3 Test Cases"
   ↓
System creates:
   1. Test case #6: Balance inquiry
   2. Test case #7: Fund transfer
   3. Test case #8: Password reset

Workflow

Step 1: Click "Add Test Case" button
   ↓
Step 2: Select "Select from History" tab
   ↓
Step 3: Wait for interactions to load (2-5 seconds)
   ↓
Step 4: Review available interactions
   ↓
Step 5: Click to select one or more
   ↓
Step 6: Click "Add X Test Case(s)"
   ↓
Result: X new test cases added to table

Benefits

Speed:

Create 10+ test cases in under a minute
No manual typing required
Automatic expectation extraction

Accuracy:

Based on real interactions
Captures actual topics and actions
Preserves real bot responses

Coverage:

Tests reflect real user behavior
Discovers edge cases you might miss
Documents what actually works

Common Workflows

Workflow 1: Quick Start (From Scratch)

Scenario: You're starting a new evaluation and want basic test cases.

Steps:

Click "Add Test Case"
Tab: "Create from Scratch"
Enter 3-5 common utterances
Add topics and actions for each
Click "Add Test Case" for each
Click "Save" in main view

Time: 5-10 minutes for 5 test cases

Result: Baseline test suite ready to run

Workflow 2: Capture Production Behavior

Scenario: Your agent has been running in production, you want to test actual scenarios.

Steps:

Click "Add Test Case"
Tab: "Select from History"
Review recent interactions
Select all successful ones (5-15)
Click "Add X Test Cases"
Review in table
Click "Save"

Time: 2-3 minutes for 15 test cases

Result: Comprehensive test suite from real data

Workflow 3: Regression Testing

Scenario: You changed your agent, want to ensure it still handles previous scenarios.

Steps:

Before changes: Add test cases from history
Save the evaluation
Make agent changes
Run tests
See what broke
Fix agent or update expectations

Time: Initial setup 5 minutes, ongoing validation seconds

Result: Confidence that changes didn't break existing behavior

Workflow 4: Edge Case Testing

Scenario: You discovered an edge case, want to add it to your suite.

Steps:

Test the edge case with your agent
Click "Add Test Case"
Tab: "Select from History"
Find the recent interaction
Select it
Click "Add Test Case"
Save

Time: 1 minute

Result: Edge case permanently captured in test suite

Workflow 5: Coverage Expansion

Scenario: You want comprehensive coverage across all topics.

Steps:

Chat with agent covering all topics
Click "Add Test Case"
Tab: "Select from History"
Select one interaction per topic
Click "Add X Test Cases"
Verify coverage in table
Save

Time: 10 minutes (including chatting)

Result: Full topic coverage in test suite

Tips and Best Practices

Creating from Scratch

Do:

✅ Use natural language
✅ Be specific with utterances
✅ Select topic before actions
✅ Focus on key response content

Don't:

❌ Use robotic or formal language
❌ Leave utterance blank
❌ Select actions without a topic
❌ Expect exact response matches

Selecting from History

Do:

✅ Review interactions before selecting
✅ Select diverse scenarios
✅ Check timestamps (use recent data)
✅ Bulk select related interactions

Don't:

❌ Select failed interactions blindly
❌ Ignore the response preview
❌ Select too many at once (review first)
❌ Forget to save after adding

After Adding

Do:

✅ Review new test cases in table
✅ Click "Save" to persist
✅ Run tests to validate
✅ Edit expectations if needed

Don't:

❌ Add too many without saving
❌ Skip validation
❌ Forget about auto-numbering
❌ Ignore session match results

Error Handling

No Sessions Found

Error: "No sessions found for agent: MyAgent"

Cause: No sessions exist for this agent yet.

Solution:

Chat with your agent first
Wait for sessions to sync (~1 minute)
Try again

No Interactions Found

Error: "No interactions found for recent sessions"

Cause: Sessions exist but have no interactions.

Solution:

Ensure agent is properly configured
Check that interactions are being recorded
Verify Data Cloud setup
Contact admin if persistent

Load Failed

Error: "Failed to load interactions: [error message]"

Cause: API error, permission issue, or network problem.

Solution:

Check Salesforce connection
Verify user has read access to Data Cloud
Retry loading
Check browser console for details

Empty Utterance

Alert: "Please enter an utterance"

Cause: User didn't fill in required field.

Solution:

Enter a user question in the utterance field
Try adding again

No Selection

Alert: "Please select at least one interaction"

Cause: User tried to add from history without selecting.

Solution:

Click to select one or more interactions
Checkbox should be checked
Try adding again

Technical Details

Test Case Numbering

Automatic:

System finds highest existing number
Increments by 1 for each new case
Example: If you have tests 1-5, new tests are 6, 7, 8, etc.

Expectation Formatting

Topic:

{
  "name": "topic_sequence_match",
  "expectedValue": "ServiceTopic"
}

Actions:

{
  "name": "action_sequence_match",
  "expectedValue": "['GetBalance', 'TransferFunds']"
}

Response:

{
  "name": "bot_response_rating",
  "expectedValue": "Your balance is $1,234.56"
}

Data Sources

From Scratch:

Manual user input only

From History:

ssot__AiAgentSession__dlm - Sessions
ssot__AiAgentInteraction__dlm - Interactions
ssot__AiAgentInteractionMessage__dlm - Messages
ssot__AiAgentInteractionStep__dlm - Steps

Query Limits

Sessions: Last 50
Interactions: Last 100
Messages: All for selected interactions
Steps: All for selected interactions

Why limits?

Performance (faster loading)
Relevance (recent data)
UX (manageable list size)

Keyboard Shortcuts

Coming soon:

Ctrl/Cmd + K - Open add dialog
Ctrl/Cmd + Enter - Add test case
Space - Toggle selection (history mode)
Esc - Close dialog

FAQ

Q: Can I add test cases in bulk from scratch?

A: Not directly in one action, but you can quickly add multiple by clicking "Add Test Case" after each entry. For bulk, use the history mode.

Q: How many interactions can I select at once?

A: There's no hard limit, but we recommend reviewing them first. Selecting 10-20 at a time is typical.

Q: Do I need to save after adding?

A: Yes! Click "Save" in the main view to persist your changes to Salesforce. The "isModified" indicator will show if you have unsaved changes.

Q: Can I edit test cases after adding?

A: Yes, use the ✏️ Edit button on each test case row. See Test Case Editing.

Q: What if the history is empty?

A: Chat with your agent first to create interactions, wait 1-2 minutes for data sync, then try loading history again.

Q: Can I filter or search history?

A: Not yet, but it's on the roadmap. Currently shows most recent 100 interactions sorted by timestamp.

Q: What happens if I close the dialog without adding?

A: All selections and inputs are discarded. Nothing is added to your test suite.

Q: Can I see the full conversation for an interaction?

A: Currently shows a preview. Full conversation view is planned for a future release.

Q: How do I know which interactions are good test cases?

A: Look for:

✅ Successful interactions (not errors)
✅ Clear user intent
✅ Representative of real use
✅ Diverse topics and actions

Q: Can I import test cases from a file?

A: Not yet, but CSV/JSON import is on the roadmap.

Next Steps

Test Case Editing - Refine your test cases
Einstein Model Testing - Run your tests
Conversation History - Explore interactions

Fast test case creation leads to comprehensive coverage and confident deployments!

The Problem​

How GenAI Explorer Solves This​

Overview​

Accessing the Feature​

Location​

When to Use Each Mode​

Mode 1: Create from Scratch​

Overview​

Fields​

1. User Utterance (Required)​

2. Expected Topic (Optional)​

3. Expected Actions (Optional)​

4. Expected Bot Response (Optional)​

Workflow​

Complete Example​

Mode 2: Select from History​

Overview​

Loading Process​

Interaction Display​

Selection Behavior​

Multi-Select Example​

Workflow​

Benefits​

Common Workflows​

Workflow 1: Quick Start (From Scratch)​

Workflow 2: Capture Production Behavior​

Workflow 3: Regression Testing​

Workflow 4: Edge Case Testing​

Workflow 5: Coverage Expansion​

Tips and Best Practices​

Creating from Scratch​

Selecting from History​

After Adding​

Error Handling​

No Sessions Found​

No Interactions Found​

Load Failed​

Empty Utterance​

No Selection​

Technical Details​

Test Case Numbering​

Expectation Formatting​

Data Sources​

Query Limits​

Keyboard Shortcuts​

FAQ​

Q: Can I add test cases in bulk from scratch?​

Q: How many interactions can I select at once?​

Q: Do I need to save after adding?​

Q: Can I edit test cases after adding?​

Q: What if the history is empty?​

Q: Can I filter or search history?​

Q: What happens if I close the dialog without adding?​

Q: Can I see the full conversation for an interaction?​

Q: How do I know which interactions are good test cases?​

Q: Can I import test cases from a file?​

Next Steps​

The Problem

How GenAI Explorer Solves This

Overview

Accessing the Feature

Location

When to Use Each Mode

Mode 1: Create from Scratch

Overview

Fields

1. User Utterance (Required)

2. Expected Topic (Optional)

3. Expected Actions (Optional)

4. Expected Bot Response (Optional)

Workflow

Complete Example

Mode 2: Select from History

Overview

Loading Process

Interaction Display

Selection Behavior

Multi-Select Example

Workflow

Benefits

Common Workflows

Workflow 1: Quick Start (From Scratch)

Workflow 2: Capture Production Behavior

Workflow 3: Regression Testing

Workflow 4: Edge Case Testing

Workflow 5: Coverage Expansion

Tips and Best Practices

Creating from Scratch

Selecting from History

After Adding

Error Handling

No Sessions Found

No Interactions Found

Load Failed

Empty Utterance

No Selection

Technical Details

Test Case Numbering

Expectation Formatting

Data Sources

Query Limits

Keyboard Shortcuts

FAQ

Q: Can I add test cases in bulk from scratch?

Q: How many interactions can I select at once?

Q: Do I need to save after adding?

Q: Can I edit test cases after adding?

Q: What if the history is empty?

Q: Can I filter or search history?

Q: What happens if I close the dialog without adding?

Q: Can I see the full conversation for an interaction?

Q: How do I know which interactions are good test cases?

Q: Can I import test cases from a file?

Next Steps