Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

promptfooconfig.comparison.yaml 1.1 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
  1. # yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
  2. description: 'Compare reasoning capabilities across Mistral models'
  3. providers:
  4. - mistral:magistral-medium-latest
  5. - mistral:magistral-small-latest
  6. - mistral:mistral-large-latest
  7. - mistral:mistral-small-latest
  8. prompts:
  9. - 'Solve this step by step: {{problem}}'
  10. tests:
  11. - vars:
  12. problem: "A company has 100 employees. 60% work remotely, 25% work hybrid, and the rest work in office. If remote workers get a $200 stipend and hybrid workers get $100, what's the total monthly stipend cost?"
  13. assert:
  14. - type: llm-rubric
  15. value: 'Shows clear mathematical reasoning and arrives at correct answer ($13,500)'
  16. - type: cost
  17. threshold: 0.10
  18. - vars:
  19. problem: 'If I have 3 boxes with 4 apples each, and I eat 2 apples from the first box and 1 apple from the second box, how many apples do I have left in total?'
  20. assert:
  21. - type: llm-rubric
  22. value: 'Correctly calculates the remaining apples (9 apples total)'
  23. - type: contains
  24. value: '9'
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...