Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

promptfooconfig.yaml 3.8 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
  1. # yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
  2. description: Comparing Gemma and Mistral model performance
  3. prompts:
  4. - '{{message}}'
  5. defaultTest:
  6. options:
  7. transform: output.trim()
  8. providers:
  9. - id: replicate:mistralai/mistral-7b-instruct-v0.2
  10. config:
  11. temperature: 0.01
  12. max_new_tokens: 1024
  13. prompt:
  14. prefix: '<s>[INST] '
  15. suffix: ' [/INST]'
  16. - id: replicate:mistralai/mixtral-8x7b-instruct-v0.1
  17. config:
  18. temperature: 0.01
  19. max_new_tokens: 1024
  20. prompt:
  21. prefix: '<s>[INST] '
  22. suffix: ' [/INST]'
  23. - id: replicate:cjwbw/gemma-7b-it:2790a695e5dcae15506138cc4718d1106d0d475e6dca4b1d43f42414647993d5
  24. config:
  25. temperature: 0.01
  26. max_new_tokens: 1024
  27. prompt:
  28. prefix: "<start_of_turn>user\n"
  29. suffix: "<end_of_turn>\n<start_of_turn>model"
  30. tests:
  31. - vars:
  32. message: 'I speak without a mouth and hear without ears. I have no body, but I come alive with wind. What am I?'
  33. assert:
  34. # Make sure the LLM output contains this word
  35. - type: icontains
  36. value: echo
  37. # Use model-graded assertions to enforce free-form instructions
  38. - type: llm-rubric
  39. value: Do not apologize
  40. - vars:
  41. message: "You see a boat filled with people. It has not sunk, but when you look again you don't see a single person on the boat. Why?"
  42. assert:
  43. - type: llm-rubric
  44. value: explains that the people are below deck, or they are all in a relationship
  45. - vars:
  46. message: 'The more of this there is, the less you see. What is it?'
  47. assert:
  48. - type: icontains
  49. value: darkness
  50. - vars:
  51. message: >-
  52. I have keys but no locks. I have space but no room. You can enter, but
  53. can't go outside. What am I?
  54. assert:
  55. - type: icontains
  56. value: keyboard
  57. - vars:
  58. message: >-
  59. I am not alive, but I grow; I don't have lungs, but I need air; I don't
  60. have a mouth, but water kills me. What am I?
  61. assert:
  62. - type: icontains-any
  63. value:
  64. - fire
  65. - flame
  66. - vars:
  67. message: What can travel around the world while staying in a corner?
  68. assert:
  69. - type: icontains
  70. value: stamp
  71. - vars:
  72. message: Forward I am heavy, but backward I am not. What am I?
  73. assert:
  74. - type: icontains
  75. value: ton
  76. - vars:
  77. message: >-
  78. The person who makes it, sells it. The person who buys it, never uses
  79. it. The person who uses it, doesn't know they're using it. What is it?
  80. assert:
  81. - type: icontains
  82. value: coffin
  83. - vars:
  84. message: I can be cracked, made, told, and played. What am I?
  85. assert:
  86. - type: icontains
  87. value: joke
  88. - vars:
  89. message: What has keys but can't open locks?
  90. assert:
  91. - type: icontains
  92. value: piano
  93. - vars:
  94. message: >-
  95. I'm light as a feather, yet the strongest person can't hold me for much
  96. more than a minute. What am I?
  97. assert:
  98. - type: icontains
  99. value: breath
  100. - vars:
  101. message: >-
  102. I can fly without wings, I can cry without eyes. Whenever I go, darkness
  103. follows me. What am I?
  104. assert:
  105. - type: icontains
  106. value: cloud
  107. - vars:
  108. message: >-
  109. I am taken from a mine, and shut up in a wooden case, from which I am
  110. never released, and yet I am used by almost every person. What am I?
  111. - vars:
  112. message: >-
  113. David's father has three sons: Snap, Crackle, and _____? What is the
  114. name of the third son?
  115. assert:
  116. - type: contains
  117. value: David
  118. - vars:
  119. message: >-
  120. I am light as a feather, but even the world's strongest man couldn't
  121. hold me for much longer than a minute. What am I?
  122. assert:
  123. - type: contains
  124. value: breath
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...