...

README.md

360d7b519d

docs(examples): add uniform init commands to all example READMEs (#4068)

3 months ago

promptfooconfig.yaml

874d93b56e

docs: Update example configs with descriptions (#4283)

2 months ago

You have to be logged in to leave a comment.

llama-cpp (Getting Started with Promptfoo and llama.cpp)

You can run this example with:

npx promptfoo@latest init --example llama-cpp

Install llama.cpp

To begin, install llama.cpp by following the instructions on their GitHub page.

Starting the Server

To start the server, use the following command:

./llama-server -m your_model.gguf --port 8080

You can check if it's running by visiting http://localhost:8080.

Configuring Promptfoo

Edit the prompts in promptfooconfig.yaml.
Run the evaluation:
```
npx promptfoo@latest eval
```
View the results:
```
npx promptfoo@latest view
```

Note on Supported Models

llama.cpp supports many models that can be converted to the GGUF format. We recommend downloading models from Hugging Face. You may need to authenticate with your Hugging Face account using their CLI to download models.

Note on Prompt Formatting

We do not format the prompts for compatibility with llama.cpp. Prompts are passed as-is. Refer to the documentation or model card for the specific model you are using to ensure compatibility with its interface. We provide various formatting examples to illustrate different ways to format your prompts.

Note on Caching

Since promptfoo is unaware of the underlying model being run in llama.cpp, it will not invalidate the cache when the model is updated. This means you may see stale results from the cache if you change the model. Run npx promptfoo@latest eval --no-cache to perform the evaluation without using the cache.

Tip!

Press p or to see the previous file or, n or to see the next file

Specify your S3 bucket

Bucket name cannot be the same as the repository name. Please change one of them.

Bucket url and prefix

Region

Endpoint Url

Disable SSL verification

README.md

llama-cpp (Getting Started with Promptfoo and llama.cpp)

Install llama.cpp

Starting the Server

Configuring Promptfoo

Note on Supported Models

Note on Prompt Formatting

Note on Caching

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

nirbarazida / promptfoo mirror of https://github.com/promptfoo/promptfoo

README.md

llama-cpp (Getting Started with Promptfoo and llama.cpp)

Install llama.cpp

Starting the Server

Configuring Promptfoo

Note on Supported Models

Note on Prompt Formatting

Note on Caching

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

nirbarazida
/
promptfoo
mirror of https://github.com/promptfoo/promptfoo