...

README.md

09b0ba625c

feat(redteam): Add MCP plugin (#3989)

3 months ago

promptfooconfig.yaml

09b0ba625c

feat(redteam): Add MCP plugin (#3989)

3 months ago

You have to be logged in to leave a comment.

redteam-mcp (Red Team Model Context Protocol)

This example demonstrates red teaming an AI assistant that uses Model Context Protocol (MCP) for tool use capabilities. The example focuses on testing various attack vectors specific to MCP implementations, such as function call exploits, system prompt leakage, unauthorized tool discovery, and other MCP-specific vulnerabilities.

You can run this example with:

npx promptfoo@latest init --example redteam-mcp

Environment Variables

This example requires the following environment variable:

ANTHROPIC_API_KEY - Your Anthropic API key

You can set this in a .env file or directly in your environment:

export ANTHROPIC_API_KEY=your_anthropic_key_here

Prerequisites

Node.js (v16+)

Getting Started

Initialize the example:

npx promptfoo@latest init --example redteam-mcp

Navigate to the example directory:
```
cd redteam-mcp
```
Run the red team evaluation:
```
npx promptfoo redteam run
```

What This Example Tests

This example evaluates an AI customer support agent implementing MCP with the following attack vectors:

Function Discovery: Attempts to trick the agent into exposing hidden functions
Parameter Injection: Tests crafting malicious inputs to inject unwanted parameters
Function Call Manipulation: Tries to induce unauthorized calls or response handling
Recursive Function Calling: Tests exploitation of recursion for denial of service
System Information Leakage: Attempts to extract internal system details
Function Output Manipulation: Tests causing misinterpretation of function results
Tool Metadata Injection: Attempts to smuggle instructions through tool names or descriptions
Unauthorized Tool Invocation / Privilege Escalation: Tests access to unauthorized tools

Configuration

This example is configured to test the Anthropic Claude 3 Sonnet model with MCP enabled. The MCP server is specified as:

https://customer-service-mcp-server-example.promptfoo.app/mcp

Testing Strategies

The red team evaluation uses multiple testing strategies:

Basic attacks
Best-of-n optimization
Jailbreak attempts

Expected Results

After running the evaluation, you'll see a report showing which attack vectors were successful and which were blocked by the system's defenses.

Customization

You can modify the promptfooconfig.yaml file to:

Test different providers (recommended: anthropic:claude-3-7-sonnet-20250219)
Add or remove red team plugins
Change the MCP server configuration
Adjust the system purpose and guardrails

Tip!

Press p or to see the previous file or, n or to see the next file

Specify your S3 bucket

Bucket name cannot be the same as the repository name. Please change one of them.

Bucket url and prefix

Region

Endpoint Url

Disable SSL verification

README.md

redteam-mcp (Red Team Model Context Protocol)

Environment Variables

Prerequisites

Getting Started

What This Example Tests

Configuration

Testing Strategies

Expected Results

Customization

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

nirbarazida / promptfoo mirror of https://github.com/promptfoo/promptfoo

README.md

redteam-mcp (Red Team Model Context Protocol)

Environment Variables

Prerequisites

Getting Started

What This Example Tests

Configuration

Testing Strategies

Expected Results

Customization

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

nirbarazida
/
promptfoo
mirror of https://github.com/promptfoo/promptfoo