Are you sure you want to delete this access key?
This example demonstrates Claude's "thinking" capability, which allows you to see the model's step-by-step reasoning process before it provides a final answer. The example compares thinking outputs from both the Anthropic API directly and Claude on AWS Bedrock.
You can run this example with:
npx promptfoo@latest init --example claude-thinking
This example requires at least one of the following sets of credentials:
ANTHROPIC_API_KEY
- Your Anthropic API key from console.anthropic.comAWS_ACCESS_KEY_ID
- Your AWS access keyAWS_SECRET_ACCESS_KEY
- Your AWS secret keyaws configure
For AWS Bedrock, you must:
After setting up environment variables:
# From the example directory
promptfoo eval
promptfoo view
This example includes several test cases of increasing complexity:
These test cases are specifically designed to showcase Claude's ability to break down complex problems and show detailed thinking steps.
The thinking feature is enabled by setting special parameters in the provider configuration:
thinking:
type: 'enabled'
budget_tokens: 4096 # Controls how many tokens are allocated for thinking
max_tokens: 8192 # Must be greater than budget_tokens
When enabled, Claude's response will include a "Thinking:" section that shows its reasoning process before the final answer:
Thinking: Let me solve this step by step...
1. First, I'll divide the 8 balls into three groups...
2. In the first weighing, I'll compare groups A and B...
3. Based on the result, I can determine...
Final answer: We need exactly 2 weighings to find the heavier ball.
Press p or to see the previous file or, n or to see the next file
Browsing data directories saved to S3 is possible with DAGsHub. Let's configure your repository to easily display your data in the context of any commit!
promptfoo is now integrated with AWS S3!
Are you sure you want to delete this access key?
Browsing data directories saved to Google Cloud Storage is possible with DAGsHub. Let's configure your repository to easily display your data in the context of any commit!
promptfoo is now integrated with Google Cloud Storage!
Are you sure you want to delete this access key?
Browsing data directories saved to Azure Cloud Storage is possible with DAGsHub. Let's configure your repository to easily display your data in the context of any commit!
promptfoo is now integrated with Azure Cloud Storage!
Are you sure you want to delete this access key?
Browsing data directories saved to S3 compatible storage is possible with DAGsHub. Let's configure your repository to easily display your data in the context of any commit!
promptfoo is now integrated with your S3 compatible storage!
Are you sure you want to delete this access key?