Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

promptfooconfig.reasoning.yaml 2.0 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
  1. # yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
  2. description: 'Advanced reasoning with Mistral Magistral models'
  3. providers:
  4. - id: mistral:magistral-medium-latest
  5. label: magistral-medium
  6. config:
  7. temperature: 0.7
  8. top_p: 0.95
  9. max_tokens: 40960
  10. - id: mistral:magistral-small-latest
  11. label: magistral-small
  12. config:
  13. temperature: 0.7
  14. top_p: 0.95
  15. max_tokens: 40960
  16. prompts:
  17. - 'Think through this problem step by step: {{problem}}'
  18. tests:
  19. - vars:
  20. problem: 'A farmer has chickens and rabbits in a pen. There are 30 heads and 74 legs total. How many chickens and how many rabbits are there?'
  21. assert:
  22. - type: contains
  23. value: 'step'
  24. - type: llm-rubric
  25. value: 'Correctly identifies that there are 16 chickens and 14 rabbits through systematic reasoning'
  26. - type: contains-any
  27. value: ['16 chickens', '14 rabbits', 'chickens: 16', 'rabbits: 14']
  28. - vars:
  29. problem: 'Three friends split a restaurant bill. Alice pays twice as much as Bob, and Charlie pays $15 more than Bob. If the total bill is $105, how much does each person pay?'
  30. assert:
  31. - type: contains
  32. value: 'equation'
  33. - type: llm-rubric
  34. value: 'Shows the algebraic setup and correctly calculates Bob: $22.50, Alice: $45, Charlie: $37.50'
  35. - type: contains-any
  36. value: ['22.50', '$22.50', '45', '$45', '37.50', '$37.50']
  37. - vars:
  38. problem: 'A water tank is being filled by two pipes and drained by one pipe. Pipe A fills at 10 gallons/minute, Pipe B fills at 15 gallons/minute, and the drain empties at 8 gallons/minute. If all pipes operate simultaneously and the tank starts empty, how long will it take to fill a 340-gallon tank?'
  39. assert:
  40. - type: contains
  41. value: 'rate'
  42. - type: llm-rubric
  43. value: 'Correctly calculates the net fill rate (17 gallons/minute) and determines it takes 20 minutes'
  44. - type: contains-any
  45. value: ['20 minutes', '20 min', '17 gallons/minute', '17 gal/min']
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...