instructions: 'You are a helpful research assistant. Use the available MCP tools to search for accurate information about repositories and provide comprehensive answers.'
tests:
- vars:
repo: modelcontextprotocol/modelcontextprotocol
topic: transport protocols
assert:
# Validate MCP tool execution was successful
- type: is-valid-openai-tools-call
weight: 0.3
# Check for specific content in the response
- type: contains
value: 'transport'
weight: 0.2
- type: contains
value: 'protocol'
weight: 0.2
# Ensure MCP tool was actually used (check for tool result)
- type: contains
value: 'MCP Tool Result'
weight: 0.1
# Validate the quality of the response
- type: llm-rubric
value: 'The response mentions transport protocols or MCP specification details'
weight: 0.2
- vars:
repo: facebook/react
topic: hooks
assert:
# Comprehensive MCP validation
- type: is-valid-openai-tools-call
- type: contains
value: 'React'
# Verify MCP integration worked
- type: contains
value: 'MCP Tool Result'
- type: llm-rubric
value: 'The response explains React functionality or features'
- vars:
repo: microsoft/typescript
topic: type system
assert:
# Test both success and content validation
- type: is-valid-openai-tools-call
- type: contains-any
value: ['TypeScript', 'type']
# Ensure no MCP errors occurred
- type: not-contains
value: 'MCP Tool Error'
- type: llm-rubric
value: 'The response describes TypeScript features or type system'
- vars:
repo: openai/openai-python
topic: API client
assert:
# Multi-layered validation approach
- type: is-valid-openai-tools-call
metric: mcp_tool_success
- type: contains-any
value: ['API', 'client', 'Python']
# Check that MCP tools were discovered and used
- type: contains
value: 'MCP Tool Result'
metric: mcp_tool_used
weight: 0
- type: llm-rubric
value: 'The response describes the OpenAI Python client library or API features'
Press p or to see the previous file or,
n or to see the next file
Comments
Integrate AWS S3
Use S3 remote
Select bucket
Access key
Finish
Use AWS S3 as storage!
Browsing data directories saved to S3 is possible with DAGsHub. Let's configure
your repository to easily display your data in the context of any commit!
Specify your S3 bucket
Select Region
af-south-1 - Africa (Cape Town)
ap-northeast-1 - Asia Pacific (Tokyo)
ap-northeast-2 - Asia Pacific (Seoul)
ap-south-1 - Asia Pacific (Mumbai)
ap-southeast-1 - Asia Pacific (Singapore)
ap-southeast-2 - Asia Pacific (Sydney)
ca-central-1 - Canada (Central)
eu-central-1 - EU (Frankfurt)
eu-north-1 - EU (Stockholm)
eu-west-1 - EU (Ireland)
eu-west-2 - EU (London)
eu-west-3 - EU (Paris)
sa-east-1 - South America (São Paulo)
us-east-1 - US East (N. Virginia)
us-east-2 - US East (Ohio)
us-gov-east-1 - US Gov East 1
us-gov-west-1 - US Gov West 1
us-west-1 - US West (N. California)
us-west-2 - US West (Oregon)
Congratulations!
promptfoo is now integrated with AWS S3!
Delete Storage Key
Are you sure you want to delete this access key?
No
Yes
Integrate Google Cloud Storage
Use Google Storage
Select bucket
Upload key
Finish
Use Google Cloud Storage!
Browsing data directories saved to Google Cloud Storage is possible with DAGsHub. Let's configure
your repository to easily display your data in the context of any commit!
Specify your Google Storage bucket
Congratulations!
promptfoo is now integrated with Google Cloud Storage!
Delete Storage Key
Are you sure you want to delete this access key?
No
Yes
Integrate Azure Cloud Storage
Use Azure Storage
Select bucket
Set key
Finish
Use Azure Cloud Storage!
Browsing data directories saved to Azure Cloud Storage is possible with DAGsHub. Let's configure
your repository to easily display your data in the context of any commit!
Specify your Azure Storage bucket
Congratulations!
promptfoo is now integrated with Azure Cloud Storage!
Delete Storage Key
Are you sure you want to delete this access key?
No
Yes
Integrate S3 compatible storage
Use S3 like remote
Select bucket
Access key
Finish
Use any S3 compatible storage!
Browsing data directories saved to S3 compatible storage is possible with DAGsHub. Let's configure
your repository to easily display your data in the context of any commit!
Specify your S3 bucket
Bucket name cannot be the same as the repository name. Please change one of them.
Check this box only if you trust this domain, otherwise your data and credentials might be
stolen by man in the middle or spoofing attacks.
Congratulations!
promptfoo is now integrated with your S3 compatible storage!