Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

promptfooconfig.yaml 3.1 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
  1. # yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
  2. description: Google AI Studio Gemini model evaluation
  3. prompts:
  4. - |
  5. Given this math puzzle: {{puzzle}}
  6. Please solve this step by step, showing your reasoning process.
  7. providers:
  8. - id: google:gemini-2.5-pro
  9. config:
  10. generationConfig:
  11. temperature: 0.7
  12. maxOutputTokens: 4096
  13. thinkingConfig:
  14. thinkingBudget: 2048 # Enhanced thinking for complex reasoning
  15. - id: google:gemini-2.5-flash
  16. config:
  17. generationConfig:
  18. temperature: 0.7
  19. maxOutputTokens: 2048
  20. thinkingConfig:
  21. thinkingBudget: 1024 # Enhanced reasoning and thinking capabilities
  22. - id: google:gemini-2.5-flash-lite
  23. config:
  24. generationConfig:
  25. temperature: 0.7
  26. maxOutputTokens: 1024
  27. thinkingConfig:
  28. thinkingBudget: 512 # Most cost-efficient and fastest 2.5 model
  29. - google:gemini-2.0-flash-exp
  30. - google:gemini-2.0-flash-thinking-exp
  31. - id: google:gemini-2.5-pro
  32. config:
  33. temperature: 0.7
  34. maxOutputTokens: 1024
  35. topP: 0.9
  36. topK: 40
  37. # System instruction from file example
  38. - id: google:gemini-2.5-pro
  39. label: gemini-with-system-instruction-file
  40. config:
  41. temperature: 0.3
  42. maxOutputTokens: 1024
  43. systemInstruction: file://system-instruction.txt
  44. # Structured output example
  45. - id: google:gemini-2.5-pro
  46. config:
  47. generationConfig:
  48. temperature: 0
  49. maxOutputTokens: 1024
  50. response_mime_type: 'application/json'
  51. response_schema:
  52. type: 'object'
  53. properties:
  54. answer:
  55. type: 'string'
  56. description: 'The numerical or text answer'
  57. confidence:
  58. type: 'string'
  59. enum: ['high', 'medium', 'low']
  60. method_used:
  61. type: 'string'
  62. description: 'One-word description of solving method'
  63. required: ['answer', 'confidence', 'method_used']
  64. # Function calling example
  65. - id: google:gemini-2.5-pro
  66. config:
  67. tools:
  68. function_declarations:
  69. - name: 'verify_answer'
  70. description: 'Verify if the answer is correct'
  71. parameters:
  72. type: 'object'
  73. properties:
  74. answer:
  75. type: 'string'
  76. description: 'The proposed answer'
  77. explanation:
  78. type: 'string'
  79. description: 'Brief explanation'
  80. required: ['answer']
  81. tool_config:
  82. function_calling_config:
  83. mode: 'auto'
  84. defaultTest:
  85. options:
  86. provider:
  87. text:
  88. id: google:gemini-2.5-flash
  89. embedding:
  90. id: google:embedding:text-embedding-004
  91. tests:
  92. - vars:
  93. puzzle: 'If a train travels at 60 mph for 2.5 hours, then at 40 mph for 1.5 hours, what is the average speed for the entire journey?'
  94. assert:
  95. - type: contains
  96. value: 52.5
  97. - type: llm-rubric
  98. value: |
  99. The answer should be 52.5 mph.
  100. - type: similar
  101. value: 52.5 mph
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...