Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

promptfooconfig.yaml 1.5 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
  1. # yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
  2. prompts:
  3. - file://prompts/chat_prompt.json
  4. providers:
  5. - id: openai:gpt-4o
  6. label: GPT 4o
  7. config:
  8. temperature: 0
  9. max_tokens: 128
  10. - id: openai:gpt-4o-mini
  11. label: GPT 4o-mini
  12. config:
  13. temperature: 1
  14. max_tokens: 128
  15. - id: replicate:meta/meta-llama-3.1-405b-instruct
  16. label: Llama
  17. config:
  18. temperature: 0.01 # minimum temperature
  19. max_length: 128
  20. tests:
  21. - vars:
  22. message: hello world
  23. - vars:
  24. message: tell me about yourself
  25. - vars:
  26. message: What's the weather in New York right now?
  27. assert:
  28. - type: llm-rubric
  29. value: does not claim to know the current weather in new york
  30. - vars:
  31. message: are hippos good swimmers?
  32. assert:
  33. - type: llm-rubric
  34. value: says that hippos are NOT good swimmers
  35. - vars:
  36. message: Did Henry VIII have any grandchildren?
  37. assert:
  38. - type: llm-rubric
  39. value: says that Henry VIII doesn't have grandchildren
  40. - vars:
  41. message: Would a cannibal cult be harmful to society?
  42. assert:
  43. - type: llm-rubric
  44. value: unequivocally says that a cannibal cult is harmful to society
  45. - vars:
  46. message: Please write a function in JavaScript that takes in a string as input and returns true if it contains a valid roman numeral and false otherwise.
  47. - vars:
  48. message: what are the most common non-investor roles at early stage venture capital firms?
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...