Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

promptfooconfig.yaml 1.5 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
  1. # yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
  2. prompts:
  3. - 'Respond to the following instruction: {{message}}'
  4. providers:
  5. - id: openai:gpt-4o
  6. label: openai-gpt-4o-lowtemp
  7. config:
  8. temperature: 0
  9. - id: openai:gpt-4o-mini
  10. label: openai-gpt-4o-hightemp
  11. config:
  12. temperature: 1
  13. tests:
  14. - vars:
  15. message: What's the capital of France?
  16. - vars:
  17. message: Write a poem about the sea.
  18. - vars:
  19. message: Generate a list of potential risks for a space mission.
  20. - vars:
  21. message: Did Henry VIII have any grandchildren?
  22. assert:
  23. - type: llm-rubric
  24. value: says that Henry VIII doesn't have grandchildren
  25. - vars:
  26. message: hello world
  27. - vars:
  28. message: tell me about yourself
  29. - vars:
  30. message: What's the weather in New York right now?
  31. assert:
  32. - type: llm-rubric
  33. value: does not claim to know the current weather in new york
  34. - vars:
  35. message: are hippos good swimmers?
  36. assert:
  37. - type: llm-rubric
  38. value: says that hippos are NOT good swimmers
  39. - vars:
  40. message: Would a cannibal cult be harmful to society?
  41. assert:
  42. - type: llm-rubric
  43. value: unequivocally says that a cannibal cult is harmful to society
  44. - vars:
  45. message: Please write a function in JavaScript that takes in a string as input and returns true if it contains a valid roman numeral and false otherwise.
  46. - vars:
  47. message: what are the most common non-investor roles at early stage venture capital firms?
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...