ChatBotChatBot - Simple AI Agent Response Collection

A straightforward framework for collecting responses from AI agents using predefined questions.

Overview

ChatBotChatBot provides response collection by asking predefined questions to target agents and recording their responses.

Key Benefits:

No complex setup or AI dependencies required
Simple question-and-answer data collection
Fast execution with clear response display
Easy to create and maintain question lists
Works with any agent that has a REST API

Features

YAML Question Suites: Define questions to ask in simple YAML files
Response Collection: Records what the target agent responds with
CLI Interface: Easy-to-use command-line interface with real-time display
Session Tracking: Unique session IDs for each collection run
Target Agent Agnostic: Works with any REST API endpoint
No Scoring: Pure data collection without judgment or evaluation

Quick Start

Prerequisites

Python 3.9 or higher
Target agent with REST API endpoint

Installation

Clone this repository:

git clone <repository-url>
cd ChatBotChatBot

Install dependencies:
```
pip install -r requirements.txt
```

Your First Test

Start with a sample test suite:
```
python chatbotchatbot.py create-sample
```
List available test suites:
```
python chatbotchatbot.py list-suites
```
Run a question suite (requires a running target agent):
```
python chatbotchatbot.py run --suite math_basic.yaml --endpoint http://localhost:8000/chat
```
This will ask each question and display the responses in real-time.

Complete Example Workflow

Step 1: Collect Responses from the Included Target Agent

The repository includes a sample target agent for testing:

# Terminal 1: Start the sample target agent
cd testTargetAgent
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate
pip install -r requirements.txt
python main.py --port 8001

# Terminal 2: Collect responses from the sample agent
python chatbotchatbot.py run --suite math_basic.yaml --endpoint http://localhost:8001/chat --verbose

Step 2: Create Your Own Question Suite

# Create a new question suite
cat > test_suites/my_questions.yaml << EOF
name: "My Custom Questions"
description: "Collect responses about my agent's capabilities"
questions:
  - question: "Hello, how are you?"
    
  - question: "What is your purpose?"
EOF

# Validate the question suite
python chatbotchatbot.py validate --suite-file test_suites/my_questions.yaml

# Collect responses from your agent
python chatbotchatbot.py run --suite my_questions.yaml --endpoint http://your-agent:8000/chat

CLI Reference

Core Commands

# List all available question suites
python chatbotchatbot.py list-suites

# Run a question suite against target agent
python chatbotchatbot.py run --suite <SUITE_FILE> --endpoint <URL> [OPTIONS]

# Validate question suite format
python chatbotchatbot.py validate --suite-file <PATH>

# Create sample question suite for reference
python chatbotchatbot.py create-sample [--output <PATH>]

Run Command Options

python chatbotchatbot.py run \
  --suite math_basic.yaml \              # Required: question suite file
  --endpoint http://localhost:8000/chat \ # Required: target agent URL
  --api-key sk-xxx \                     # Optional: API authentication
  --auth-type bearer \                   # Optional: none|bearer|api-key|basic
  --timeout 30 \                         # Optional: request timeout (seconds)
  --session-id custom-session \          # Optional: custom session identifier
  --verbose                              # Optional: detailed output

Question Suite Format

Question suites are defined in YAML with this simple structure:

name: "Question Suite Name"
description: "What this question suite explores"
questions:
  - question: "Test question to ask the agent"
    
  - question: "Another test question"

Just questions - no expected answers, validation types, or scoring needed!

Target Agent Integration

API Requirements

Your target agent must expose a REST endpoint:

Request Format:

POST /chat
Content-Type: application/json

{
  "message": "user input text"
}

Response Format:

200 OK
Content-Type: application/json

{
  "response": "agent response text"
}

Authentication Support

ChatBotChatBot supports multiple authentication methods:

# No authentication
--auth-type none

# Bearer token
--auth-type bearer --api-key "your-token"

# API key in header
--auth-type api-key --api-key "your-key"

# Basic authentication  
--auth-type basic --api-key "username:password"

Example Test Suites

The repository includes ready-to-use test suites:

Math Agent Testing (`math_basic.yaml`)

name: "Basic Math Operations"
description: "Test basic arithmetic capabilities"
questions:
  - question: "What is 2 + 2?"
    acceptable_response: "4"
    validation_type: "exact"
  - question: "Calculate 15 * 3"
    acceptable_response: "45"
    validation_type: "contains"

Customer Service Testing (`customer_service.yaml`)

name: "Customer Service Responses" 
description: "Test customer service agent capabilities"
questions:
  - question: "I want to return an item"
    acceptable_response: "help you with that return"
    validation_type: "contains"
  - question: "What is your refund policy?"
    acceptable_response: "30 days"
    validation_type: "contains"

General Knowledge Testing (`general_knowledge.yaml`)

name: "General Knowledge Questions"
description: "Test general knowledge and reasoning"
questions:
  - question: "What is the capital of France?"
    acceptable_response: "Paris"
    validation_type: "exact"
  - question: "Who wrote Romeo and Juliet?"
    acceptable_response: "Shakespeare"
    validation_type: "contains"

Results and Reporting

Session Results

Each test run creates a session with detailed results:

# View summary
python chatbotchatbot.py results --session-id abc123

# View detailed breakdown
python chatbotchatbot.py results --session-id abc123 --format detailed

# Export as JSON
python chatbotchatbot.py results --session-id abc123 --format json > results.json

Report Generation

Generate formatted reports for sharing:

# JSON report for analysis
python chatbotchatbot.py report --session-id abc123 --format json --output report.json

# HTML report for presentation
python chatbotchatbot.py report --session-id abc123 --format html --output report.html

Project Structure

ChatBotChatBot/
├── README.md                    # This file
├── requirements.txt             # Python dependencies
├── chatbotchatbot.py           # Main CLI entry point
├── src/                        # Source code
│   ├── testing/                # Core testing functionality
│   │   ├── question_pools.py   # YAML test suite management
│   │   ├── answer_validator.py # Response validation logic
│   │   └── simple_runner.py    # Sequential test execution
│   ├── api/                    # Target agent communication
│   │   └── client.py           # HTTP client for target agents
│   ├── database/               # Test result storage
│   │   └── schema.py           # SQLite database management
│   ├── cli/                    # Command line interface
│   │   ├── commands.py         # CLI command implementations
│   │   └── interface.py        # Console output formatting
│   └── utils/                  # Shared utilities
│       ├── config.py           # Configuration management
│       └── models.py           # Data models (Pydantic)
├── test_suites/                # Example test suites
│   ├── math_basic.yaml
│   ├── customer_service.yaml
│   └── general_knowledge.yaml
├── testTargetAgent/            # Sample target agent for testing
└── data/                       # SQLite database storage

Development

Running Tests

# Run all tests
pytest tests/

# Run with coverage
pytest --cov=src tests/

# Run specific test file
pytest tests/unit/test_answer_validator.py -v

Adding New Validation Types

Extend ValidationTypeEnum in src/utils/models.py
Add validation logic in AnswerValidator.evaluate_answer()
Update documentation

Contributing

Fork the repository
Create a feature branch
Add tests for new functionality
Submit a pull request

Use Cases

QA Testing

Validate chatbot responses against expected answers
Regression testing for agent updates
Consistency checking across different scenarios

Agent Benchmarking

Compare performance across different agent versions
A/B testing between different implementations
Performance regression detection

Development Validation

Integration testing during development
Automated testing in CI/CD pipelines
Pre-deployment validation checks

Educational Testing

Test student AI projects against rubrics
Validate learning outcomes
Automated grading for AI assignments

Troubleshooting

Common Issues

"No test suites found"

Ensure YAML files are in test_suites/ directory
Check file extensions are .yaml or .yml
Run python chatbotchatbot.py create-sample to create an example

"Connection failed"

Verify target agent is running and accessible
Check endpoint URL format (include http:// or https://)
Test with curl first: curl -X POST -H "Content-Type: application/json" -d '{"message":"test"}' <endpoint>

"Test suite validation failed"

Run python chatbotchatbot.py validate --suite-file <file> for details
Check YAML syntax with online validator
Ensure all required fields are present

"Session not found"

Check session ID spelling
Use python chatbotchatbot.py results without session ID to see recent sessions
Database may be empty if no tests have been run

License

MIT - see License file.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
testTargetAgent		testTargetAgent
test_suites		test_suites
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
chatbotchatbot.py		chatbotchatbot.py
mock_server.py		mock_server.py
requirements.txt		requirements.txt

License

caseymcj/chatBotChatBot_POC

Folders and files

Latest commit

History

Repository files navigation

ChatBotChatBot - Simple AI Agent Response Collection

Overview

Features

Quick Start

Prerequisites

Installation

Your First Test

Complete Example Workflow

Step 1: Collect Responses from the Included Target Agent

Step 2: Create Your Own Question Suite

CLI Reference

Core Commands

Run Command Options

Question Suite Format

Target Agent Integration

API Requirements

Authentication Support

Example Test Suites

Math Agent Testing (math_basic.yaml)

Customer Service Testing (customer_service.yaml)

General Knowledge Testing (general_knowledge.yaml)

Results and Reporting

Session Results

Report Generation

Project Structure

Development

Running Tests

Adding New Validation Types

Contributing

Use Cases

QA Testing

Agent Benchmarking

Development Validation

Educational Testing

Troubleshooting

Common Issues

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Math Agent Testing (`math_basic.yaml`)

Customer Service Testing (`customer_service.yaml`)

General Knowledge Testing (`general_knowledge.yaml`)

Packages