...

README.md

1092f0e2cb

docs: improve crewai eval example (#5035)

1 month ago

agent.py

61b7c246ba

docs: Guide for Evaluating LangGraph Agents with Promptfoo (#4926)

3 weeks ago

promptfooconfig.yaml

1092f0e2cb

docs: improve crewai eval example (#5035)

1 month ago

requirements.txt

1092f0e2cb

docs: improve crewai eval example (#5035)

1 month ago

You have to be logged in to leave a comment.

crewai

This example shows how to use CrewAI agents with promptfoo to evaluate AI agent performance.

What is CrewAI?

CrewAI is a framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Quick Start

You can run this example with:

npx promptfoo@latest init --example crewai

Prerequisites

This example requires the following:

Python 3.10+
Node.js 14+
OpenAI API Key - You MUST have a valid OpenAI API key to run this example

Environment Setup

You need to set the OpenAI API key. Choose one of these methods:

Option 1: Environment Variable (Recommended)

export OPENAI_API_KEY=your-api-key-here

Option 2: .env File

Create a .env file in this directory:

OPENAI_API_KEY=your-api-key-here

If using a .env file, uncomment python-dotenv in requirements.txt and reinstall dependencies.

Installation

Install Python packages:

pip install -r requirements.txt

Note: The openai package and other dependencies (langchain, pydantic, etc.) will be automatically installed as dependencies of crewai.

Install promptfoo CLI:

npm install -g promptfoo

Files

agent.py: Contains the CrewAI agent setup and promptfoo provider interface
promptfooconfig.yaml: Configures prompts, providers, and tests for evaluation

Note on Reliability

When using a real LLM, you may notice that the agent's output is not always reliable, especially for more complex queries. For example, the agent may fail to return valid JSON or may not return a response at all. This is a common challenge when working with LLMs.

Running the Evaluation

Run the evaluation:

promptfoo eval

Explore results in browser:

promptfoo view

Troubleshooting

If you see authentication errors:

Ensure your OpenAI API key is set correctly
Verify the key is valid and has sufficient quota
Check that the environment variable is accessible to the Python process

Tip!

Press p or to see the previous file or, n or to see the next file

Specify your S3 bucket

Bucket name cannot be the same as the repository name. Please change one of them.

Bucket url and prefix

Region

Endpoint Url

Disable SSL verification

README.md

crewai

What is CrewAI?

Quick Start

Prerequisites

Environment Setup

Option 1: Environment Variable (Recommended)

Option 2: .env File

Installation

Files

Note on Reliability

Running the Evaluation

Troubleshooting

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

nirbarazida / promptfoo mirror of https://github.com/promptfoo/promptfoo

README.md

crewai

What is CrewAI?

Quick Start

Prerequisites

Environment Setup

Option 1: Environment Variable (Recommended)

Option 2: .env File

Installation

Files

Note on Reliability

Running the Evaluation

Troubleshooting

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

nirbarazida
/
promptfoo
mirror of https://github.com/promptfoo/promptfoo