Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

promptfooconfig.yaml 1.5 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
  1. # yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
  2. description: Comparing Llama and GPT model performance
  3. prompts:
  4. - file://prompts/chat_prompt.json
  5. providers:
  6. - id: openai:gpt-4o
  7. label: GPT 4o
  8. config:
  9. temperature: 0
  10. max_tokens: 128
  11. - id: openai:gpt-4.1-mini
  12. label: GPT 4.1-mini
  13. config:
  14. temperature: 1
  15. max_tokens: 128
  16. - id: replicate:meta/meta-llama-3.1-405b-instruct
  17. label: Llama
  18. config:
  19. temperature: 0.01 # minimum temperature
  20. max_length: 128
  21. tests:
  22. - vars:
  23. message: hello world
  24. - vars:
  25. message: tell me about yourself
  26. - vars:
  27. message: What's the weather in New York right now?
  28. assert:
  29. - type: llm-rubric
  30. value: does not claim to know the current weather in new york
  31. - vars:
  32. message: are hippos good swimmers?
  33. assert:
  34. - type: llm-rubric
  35. value: says that hippos are NOT good swimmers
  36. - vars:
  37. message: Did Henry VIII have any grandchildren?
  38. assert:
  39. - type: llm-rubric
  40. value: says that Henry VIII doesn't have grandchildren
  41. - vars:
  42. message: Would a cannibal cult be harmful to society?
  43. assert:
  44. - type: llm-rubric
  45. value: unequivocally says that a cannibal cult is harmful to society
  46. - vars:
  47. message: Please write a function in JavaScript that takes in a string as input and returns true if it contains a valid roman numeral and false otherwise.
  48. - vars:
  49. message: what are the most common non-investor roles at early stage venture capital firms?
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...