You have to be logged in to leave a comment.

sidebar_position	sidebar_label
10	Command line

Command line

The promptfoo command line utility supports the following subcommands:

init [directory] - Initialize a new project with dummy files.
eval - Evaluate prompts and models. This is the command you'll be using the most!
view - Start a browser UI for visualization of results.
share - Create a URL that can be shared online.
auth - Manage authentication for cloud features.
cache - Manage cache.
- cache clear
config - Edit configuration settings.
- config get
- config set
- config unset
debug - Display debug information for troubleshooting.
generate - Generate data.
- generate dataset
- generate redteam
- generate assertions
list - List various resources like evaluations, prompts, and datasets.
- list evals
- list prompts
- list datasets
scan-model - Scan ML models for security vulnerabilities.
show <id> - Show details of a specific resource (evaluation, prompt, dataset).
delete <id> - Delete a resource by its ID (currently, just evaluations)
validate - Validate a promptfoo configuration file.
feedback <message> - Send feedback to the Promptfoo developers.
import <filepath> - Import an eval file from JSON format.
export <evalId> - Export an eval record to JSON format.
redteam - Red team LLM applications.
- redteam init
- redteam setup
- redteam run
- redteam discover
- redteam generate
- redteam poison
- redteam eval
- redteam report
- redteam plugins

Common Options

Most commands support the following common options:

Option	Description
`--env-file, --env-path <path>`	Path to .env file
`-v, --verbose`	Show debug logs
`--help`	Display help

`promptfoo eval`

By default the eval command will read the promptfooconfig.yaml configuration file in your current directory. But, if you're looking to override certain parameters you can supply optional arguments:

Option	Description
`-a, --assertions <path>`	Path to assertions file
`-c, --config <paths...>`	Path to configuration file(s). Automatically loads promptfooconfig.yaml
`--delay <number>`	Delay between each test (in milliseconds)
`--description <description>`	Description of the eval run
`--filter-failing <path or id>`	Filter tests that failed in a previous evaluation (by file path or eval ID)
`--filter-errors-only <path or id>`	Filter tests that resulted in errors in a previous evaluation
`-n, --filter-first-n <number>`	Only run the first N tests
`--filter-sample <number>`	Only run a random sample of N tests
`--filter-metadata <key=value>`	Only run tests whose metadata matches the key=value pair
`--filter-pattern <pattern>`	Only run tests whose description matches the regex pattern
`--filter-providers <providers>`	Only run tests with these providers (regex match)
`--filter-targets <targets>`	Only run tests with these targets (alias for --filter-providers)
`--grader <provider>`	Model that will grade outputs
`-j, --max-concurrency <number>`	Maximum number of concurrent API calls
`--model-outputs <path>`	Path to JSON containing list of LLM output strings
`--no-cache`	Do not read or write results to disk cache
`--no-progress-bar`	Do not show progress bar
`--no-table`	Do not output table in CLI
`--no-write`	Do not write results to promptfoo directory
`-o, --output <paths...>`	Path(s) to output file (csv, txt, json, jsonl, yaml, yml, html, xml)
`-p, --prompts <paths...>`	Paths to prompt files (.txt)
`--prompt-prefix <path>`	Prefix prepended to every prompt
`--prompt-suffix <path>`	Suffix appended to every prompt
`-r, --providers <name or path...>`	Provider names or paths to custom API caller modules
`--remote`	Force remote inference wherever possible (used for red teams)
`--repeat <number>`	Number of times to run each test
`--share`	Create a shareable URL
`--suggest-prompts <number>`	Generate N new prompts and append them to the prompt list
`--table`	Output table in CLI
`--table-cell-max-length <number>`	Truncate console table cells to this length
`-t, --tests <path>`	Path to CSV with test cases
`--var <key=value>`	Set a variable in key=value format
`-v, --vars <path>`	Path to CSV with test cases (alias for --tests)
`-w, --watch`	Watch for changes in config and re-run

The eval command will return exit code 100 when there is at least 1 test case failure or when the pass rate is below the threshold set by PROMPTFOO_PASS_RATE_THRESHOLD. It will return exit code 1 for any other error. The exit code for failed tests can be overridden with environment variable PROMPTFOO_FAILED_TEST_EXIT_CODE.

`promptfoo init [directory]`

Initialize a new project with dummy files.

Option	Description
`directory`	Directory to create files in
`--no-interactive`	Do not run in interactive mode

`promptfoo view`

Start a browser UI for visualization of results.

Option	Description
`-p, --port <number>`	Port number for the local server
`-y, --yes`	Skip confirmation and auto-open the URL

If you've used PROMPTFOO_CONFIG_DIR to override the promptfoo output directory, run promptfoo view [directory].

`promptfoo share [evalId]`

Create a URL that can be shared online.

Option	Description
`--show-auth`	Include auth info in the shared URL
`-y, --yes`	Skip confirmation before creating shareable URL

`promptfoo cache`

Manage cache.

Option	Description
`clear`	Clear the cache

`promptfoo feedback <message>`

Send feedback to the promptfoo developers.

Option	Description
`message`	Feedback message

`promptfoo list`

List various resources like evaluations, prompts, and datasets.

Subcommand	Description
`evals`	List evaluations
`prompts`	List prompts
`datasets`	List datasets

Option	Description
`-n`	Show the first n records, sorted by descending date of creation
`--ids-only`	Show only IDs without descriptions

`promptfoo show <id>`

Show details of a specific resource.

Option	Description
`eval <id>`	Show details of a specific evaluation
`prompt <id>`	Show details of a specific prompt
`dataset <id>`	Show details of a specific dataset

`promptfoo delete <id>`

Deletes a specific resource.

Option	Description
`eval <id>`	Delete an evaluation by id

`promptfoo import <filepath>`

Import an eval file from JSON format.

`promptfoo export <evalId>`

Export an eval record to JSON format. To export the most recent, use evalId latest.

Option	Description
`-o, --output <filepath>`	File to write. Writes to stdout by default.

`promptfoo validate`

Validate a promptfoo configuration file to ensure it follows the correct schema and structure.

Option	Description
`-c, --config <paths...>`	Path to configuration file(s). Automatically loads promptfooconfig.yaml

This command validates both the configuration file and the test suite to ensure they conform to the expected schema. It will report any validation errors with detailed messages to help you fix configuration issues.

Examples:

# Validate the default promptfooconfig.yaml
promptfoo validate

# Validate a specific configuration file
promptfoo validate -c my-config.yaml

# Validate multiple configuration files
promptfoo validate -c config1.yaml config2.yaml

The command will exit with code 1 if validation fails, making it useful for CI/CD pipelines to catch configuration errors early.

`promptfoo scan-model`

Scan ML models for security vulnerabilities. Provide one or more paths to model files or directories.

Option	Description	Default
`-b, --blacklist <pattern>`	Additional blacklist patterns to check against model names
`-f, --format <format>`	Output format (`text` or `json`)	`text`
`-o, --output <path>`	Output file path (prints to stdout if not specified)
`-t, --timeout <seconds>`	Scan timeout in seconds	`300`
`--max-file-size <bytes>`	Maximum file size to scan in bytes

`promptfoo auth`

Manage authentication for cloud features.

`promptfoo auth login`

Option	Description
`-o, --org <orgId>`	The organization ID to login to
`-h, --host <host>`	The host of the promptfoo instance (API URL if different from the app URL)
`-k, --api-key <key>`	Login using an API key

`promptfoo auth logout`

Logout from the promptfoo cloud.

`promptfoo auth whoami`

Show current user information.

`promptfoo config`

Edit configuration settings.

`promptfoo config get email`

Get the user's email address.

`promptfoo config set email <email>`

Set the user's email address.

`promptfoo config unset email`

Unset the user's email address.

Option	Description
`-f, --force`	Force unset without confirmation

`promptfoo debug`

Display debug information for troubleshooting.

Option	Description
`-c, --config [path]`	Path to configuration file. Defaults to promptfooconfig.yaml

`promptfoo generate dataset`

BETA: Generate synthetic test cases based on existing prompts and variables.

Option	Description	Default
`-c, --config <path>`	Path to the configuration file	promptfooconfig.yaml
`-w, --write`	Write the generated test cases directly to the config file	false
`-i, --instructions <text>`	Custom instructions for test case generation
`-o, --output <path>`	Path to write the generated test cases	stdout
`--numPersonas <number>`	Number of personas to generate	5
`--numTestCasesPerPersona <number>`	Number of test cases per persona	3
`--provider <provider>`	Provider to use for generating test cases	default grader
`--no-cache`	Do not read or write results to disk cache	false

For example, this command will modify your default config file (usually promptfooconfig.yaml) with new test cases:

promptfoo generate dataset -w

This command will generate test cases for a specific config and write them to a file, while following special instructions:

promptfoo generate dataset -c my_config.yaml -o new_tests.yaml -i 'All test cases for {{location}} must be European cities'

`promptfoo generate assertions`

Generate additional objective/subjective assertions based on existing prompts and assertions.

This command can be used to generate initial set of assertions, if none exist.
Will only add non-overlapping, independent assertions
Generates both python and natural language assertions.

When brainstorming assertions:

Generates python code for any objective assertions
Uses a specified natural language assertion type (pi, llm-rubric, or g-eval) for any subjective assertion.

Option	Description	Default
`-t, --type <type>`	The assertion type to use for generated subjective assertions.	pi
`-c, --config <path>`	Path to the configuration file that contains at least 1 prompt.	promptfooconfig.yaml
`-w, --write`	Write the generated assertions directly to the config file	false
`-i, --instructions <text>`	Custom instructions for assertion generation
`-o, --output <path>`	Path to write the generated assertions	stdout
`--numAssertions <number>`	Number of assertions to generate	5
`--provider <provider>`	Provider to use for generating assertions	default grader
`--no-cache`	Do not read or write results to disk cache	false

For example, this command will modify your default config file (usually promptfooconfig.yaml) with new test cases:

promptfoo generate assertions -w

This command will generate pi and python assertions for a specific config and write them to a file, while following special instructions:

promptfoo generate assertions -c my_config.yaml -o new_tests.yaml -i 'I need assertions about pronunciation'

`promptfoo generate redteam`

Alias for promptfoo redteam generate.

`promptfoo redteam init`

Initialize a red teaming project.

Option	Description	Default
`[directory]`	Directory to initialize the project in	.
`--no-gui`	Do not open the browser UI

Example:

promptfoo redteam init my_project

:::danger Adversarial testing produces offensive, toxic, and harmful test inputs, and may cause your system to produce harmful outputs. :::

For more detail, see red team configuration.

`promptfoo redteam setup`

Start browser UI and open to red team setup.

Option	Description	Default
`[configDirectory]`	Directory containing configuration files
`-p, --port <number>`	Port number for the local server	15500

`promptfoo redteam run`

Run the complete red teaming process (init, generate, and evaluate).

Option	Description	Default
`-c, --config [path]`	Path to configuration file	promptfooconfig.yaml
`-o, --output [path]`	Path to output file for generated tests	redteam.yaml
`--no-cache`	Do not read or write results to disk cache	false
`-j, --max-concurrency <number>`	Maximum number of concurrent API calls
`--delay <number>`	Delay in milliseconds between API calls
`--remote`	Force remote inference wherever possible	false
`--force`	Force generation even if no changes are detected	false
`--no-progress-bar`	Do not show progress bar
`--filter-providers, --filter-targets <providers>`	Only run tests with these providers (regex match)
`-t, --target <id>`	Cloud provider target ID to run the scan on

`promptfoo redteam discover`

Runs the Target Discovery Agent against your application.

:::info

Only a configuration file or target can be specified

:::

Option	Description	Default
`-c, --config <path>`	Path to `promptfooconfig.yaml` configuration file.
`-t, --target <id>`	UUID of a target defined in Promptfoo Cloud to scan.

`promptfoo redteam generate`

Generate adversarial test cases to challenge your prompts and models.

Option	Description	Default
`-c, --config <path>`	Path to configuration file	promptfooconfig.yaml
`-o, --output <path>`	Path to write the generated test cases	redteam.yaml
`-w, --write`	Write the generated test cases directly to the config file	false
`--purpose <purpose>`	High-level description of the system's purpose	Inferred from config
`--provider <provider>`	Provider to use for generating adversarial tests
`--injectVar <varname>`	Override the `{{variable}}` that represents user input in the prompt	`prompt`
`--plugins <plugins>`	Comma-separated list of plugins to use	default
`--strategies <strategies>`	Comma-separated list of strategies to use	default
`-n, --num-tests <number>`	Number of test cases to generate per plugin
`--language <language>`	Specify the language for generated tests	English
`--no-cache`	Do not read or write results to disk cache	false
`-j, --max-concurrency <number>`	Maximum number of concurrent API calls
`--delay <number>`	Delay in milliseconds between plugin API calls
`--remote`	Force remote inference wherever possible	false
`--force`	Force generation even if no changes are detected	false
`--burp-escape-json`	Escape special characters in .burp output for JSON payloads	false

For example, let's suppose we have the following promptfooconfig.yaml:

prompts:
  - 'Act as a trip planner and help the user plan their trip'

providers:
  - openai:gpt-4.1-mini
  - openai:gpt-4.1

This command will generate adversarial test cases and write them to redteam.yaml.

promptfoo redteam generate

This command overrides the system purpose and the variable to inject adversarial user input:

promptfoo redteam generate --purpose 'Travel agent that helps users plan trips' --injectVar 'message'

`promptfoo redteam poison`

Generate poisoned documents for RAG testing.

Option	Description	Default
`documents`	Documents, directories, or text content to poison
`-g, --goal <goal>`	Goal/intended result of the poisoning
`-o, --output <path>`	Output YAML file path	`poisoned-config.yaml`
`-d, --output-dir <path>`	Directory to write individual poisoned documents	`poisoned-documents`

`promptfoo redteam eval`

Works the same as promptfoo eval, but defaults to loading redteam.yaml.

`promptfoo redteam report`

Start a browser UI and open the red teaming report.

Option	Description	Default
`[directory]`	Directory containing the red teaming configuration	.
`-p, --port <number>`	Port number for the server	15500

Example:

promptfoo redteam report -p 8080

`promptfoo redteam plugins`

List all available red team plugins.

Option	Description
`--ids-only`	Show only plugin IDs without descriptions
`--default`	Show only the default plugins

Specifying Command Line Options in Config

Many command line options can be specified directly in your promptfooconfig.yaml file using the commandLineOptions section. This is convenient for options you frequently use or want to set as defaults for your project.

Example:

prompts:
  - Write a funny tweet about {{topic}}
providers:
  - openai:o3-mini
tests:
  - file://test_cases.csv

# Command line options as defaults
commandLineOptions:
  maxConcurrency: 5
  verbose: true
  table: true
  share: false
  cache: true
  tableCellMaxLength: 500

With this configuration, you can simply run promptfoo eval without specifying these options on the command line. You can still override any of these settings by providing the corresponding flag when running the command.

ASCII-only outputs

To disable terminal colors for printed outputs, set FORCE_COLOR=0 (this is supported by the chalk library).

For the eval command, you may also want to disable the progress bar and table as well, because they use special characters:

FORCE_COLOR=0 promptfoo eval --no-progress-bar --no-table

Environment variables

These general-purpose environment variables are supported:

Name	Description	Default
`FORCE_COLOR`	Set to 0 to disable terminal colors for printed outputs
`PROMPTFOO_ASSERTIONS_MAX_CONCURRENCY`	How many assertions to run at a time	3
`PROMPTFOO_CACHE_ENABLED`	Enable LLM request/response caching	`false`
`PROMPTFOO_CONFIG_DIR`	Directory that stores eval history	`~/.promptfoo`
`PROMPTFOO_DISABLE_AJV_STRICT_MODE`	If set, disables AJV strict mode for JSON schema validation
`PROMPTFOO_DISABLE_CONVERSATION_VAR`	Prevents the `_conversation` variable from being set
`PROMPTFOO_DISABLE_ERROR_LOG`	Prevents error logs from being written to a file
`PROMPTFOO_DISABLE_JSON_AUTOESCAPE`	If set, disables smart variable substitution within JSON prompts
`PROMPTFOO_DISABLE_OBJECT_STRINGIFY`	Disable object stringification in templates. When false (default), objects are stringified to prevent `[object Object]` issues. When true, allows direct property access (e.g., `{{output.property}}`).	`false`
`PROMPTFOO_DISABLE_REF_PARSER`	Prevents JSON schema dereferencing
`PROMPTFOO_DISABLE_REDTEAM_REMOTE_GENERATION`	Disables remote generation for red team tests. When true, all test generation happens locally. Note: Cloud users will still use remote generation by default unless this is explicitly set to true.	`false`
`PROMPTFOO_DISABLE_TEMPLATE_ENV_VARS`	Disables OS environment variables in templates. When true, only config `env:` variables are available in templates.	`false` (true in self-hosted)
`PROMPTFOO_DISABLE_TEMPLATING`	Disables Nunjucks template processing	`false`
`PROMPTFOO_DISABLE_VAR_EXPANSION`	Prevents Array-type vars from being expanded into multiple test cases
`PROMPTFOO_FAILED_TEST_EXIT_CODE`	Override the exit code when there is at least 1 test case failure or when the pass rate is below PROMPTFOO_PASS_RATE_THRESHOLD	100
`PROMPTFOO_LOG_DIR`	Directory to write error logs	`.`
`PROMPTFOO_PASS_RATE_THRESHOLD`	Set a minimum pass rate threshold (as a percentage). If not set, defaults to 100% (no failures allowed)	100
`PROMPTFOO_REQUIRE_JSON_PROMPTS`	By default the chat completion provider will wrap non-JSON messages in a single user message. Setting this envar to true disables that behavior.
`PROMPTFOO_SHARE_CHUNK_SIZE`	Number of results to send in each chunk. This is used to estimate the size of the results and to determine the number of chunks to send.
`PROMPTFOO_EVAL_TIMEOUT_MS`	Timeout in milliseconds for each individual test case/provider API call. When reached, that specific test is marked as an error.
`PROMPTFOO_MAX_EVAL_TIME_MS`	Maximum total runtime in milliseconds for the entire evaluation process. When reached, all remaining tests are marked as errors and the eval ends.
`PROMPTFOO_STRIP_GRADING_RESULT`	Strip grading results from results to reduce memory usage	false
`PROMPTFOO_STRIP_METADATA`	Strip metadata from results to reduce memory usage	false
`PROMPTFOO_STRIP_PROMPT_TEXT`	Strip prompt text from results to reduce memory usage	false
`PROMPTFOO_STRIP_RESPONSE_OUTPUT`	Strip model response outputs from results to reduce memory usage	false
`PROMPTFOO_STRIP_TEST_VARS`	Strip test variables from results to reduce memory usage	false
`PROMPTFOO_SELF_HOSTED`	Enables self-hosted mode. When true, disables OS environment variables in templates (only config `env:` values available), disables telemetry, and modifies other behaviors for controlled environments	`false`

:::tip promptfoo will load environment variables from the .env in your current working directory. :::

:::tip For detailed information on using timeout features, including configuration examples and troubleshooting tips, see Timeout errors in the troubleshooting guide. :::

Tip!

Press p or to see the previous file or, n or to see the next file

nirbarazida / promptfoo mirror of https://github.com/promptfoo/promptfoo

command-line.md 34 KB Permalink History Raw

Command line

Common Options

promptfoo eval

promptfoo init [directory]

promptfoo view

promptfoo share [evalId]

promptfoo cache

promptfoo feedback <message>

promptfoo list

promptfoo show <id>

promptfoo delete <id>

promptfoo import <filepath>

promptfoo export <evalId>

promptfoo validate

promptfoo scan-model

promptfoo auth

promptfoo auth login

promptfoo auth logout

promptfoo auth whoami

promptfoo config

promptfoo config get email

promptfoo config set email <email>

promptfoo config unset email

promptfoo debug

promptfoo generate dataset

promptfoo generate assertions

promptfoo generate redteam

promptfoo redteam init

promptfoo redteam setup

promptfoo redteam run

promptfoo redteam discover

promptfoo redteam generate

promptfoo redteam poison

promptfoo redteam eval

promptfoo redteam report

promptfoo redteam plugins

Specifying Command Line Options in Config

ASCII-only outputs

Environment variables

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

nirbarazida
/
promptfoo
mirror of https://github.com/promptfoo/promptfoo

command-line.md 34 KB

Permalink History Raw

`promptfoo eval`

`promptfoo init [directory]`

`promptfoo view`

`promptfoo share [evalId]`

`promptfoo cache`

`promptfoo feedback <message>`

`promptfoo list`

`promptfoo show <id>`

`promptfoo delete <id>`

`promptfoo import <filepath>`

`promptfoo export <evalId>`

`promptfoo validate`

`promptfoo scan-model`

`promptfoo auth`

`promptfoo auth login`

`promptfoo auth logout`

`promptfoo auth whoami`

`promptfoo config`

`promptfoo config get email`

`promptfoo config set email <email>`

`promptfoo config unset email`

`promptfoo debug`

`promptfoo generate dataset`

`promptfoo generate assertions`

`promptfoo generate redteam`

`promptfoo redteam init`

`promptfoo redteam setup`

`promptfoo redteam run`

`promptfoo redteam discover`

`promptfoo redteam generate`

`promptfoo redteam poison`

`promptfoo redteam eval`

`promptfoo redteam report`

`promptfoo redteam plugins`