...

README.md

ee85b4a415

docs: add bert-score example (#5091)

3 weeks ago

bertscore_check.py

ee85b4a415

docs: add bert-score example (#5091)

3 weeks ago

promptfooconfig-advanced.yaml

ee85b4a415

docs: add bert-score example (#5091)

3 weeks ago

promptfooconfig.yaml

ee85b4a415

docs: add bert-score example (#5091)

3 weeks ago

requirements.txt

ee85b4a415

docs: add bert-score example (#5091)

3 weeks ago

You have to be logged in to leave a comment.

bert-score (BERTScore Evaluation)

Use BERTScore to measure semantic similarity between LLM outputs and reference text.

npx promptfoo@latest init --example bert-score

Setup

pip install -r requirements.txt

Note: First run will download the BERT model (~1.4GB).

Usage

Basic Example

# promptfooconfig.yaml
tests:
  - vars:
      text: 'Hello world'
      reference: 'Hi there'
    assert:
      - type: python
        value: file://bertscore_check.py
        threshold: 0.7 # Pass if similarity > 70%

Run: promptfoo eval

Advanced Example

Compare against multiple valid references:

# promptfooconfig-advanced.yaml
assert:
  - type: python
    value: |
      from bert_score import score
      references = [
          "First valid answer",
          "Second valid answer",
          "Third valid answer"
      ]
      scores = []
      for ref in references:
          _, _, F1 = score([output], [ref], lang='en', verbose=False)
          scores.append(F1.item())
      return max(scores)  # Use best match

Run: promptfoo eval -c promptfooconfig-advanced.yaml

How It Works

BERTScore returns a similarity score from 0 to 1:

0.9+ = Nearly identical meaning
0.7-0.9 = Similar meaning
<0.7 = Different meaning

Learn more

Tip!

Press p or to see the previous file or, n or to see the next file

Specify your S3 bucket

Bucket name cannot be the same as the repository name. Please change one of them.

Bucket url and prefix

Region

Endpoint Url

Disable SSL verification

README.md

bert-score (BERTScore Evaluation)

Setup

Usage

Basic Example

Advanced Example

How It Works

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

nirbarazida / promptfoo mirror of https://github.com/promptfoo/promptfoo

README.md

bert-score (BERTScore Evaluation)

Setup

Usage

Basic Example

Advanced Example

How It Works

Comments

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

nirbarazida
/
promptfoo
mirror of https://github.com/promptfoo/promptfoo