Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

promptfooconfig.multimodal.yaml 1.6 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
  1. # yaml-language-server: $schema=https://promptfoo.dev/config-schema.json
  2. description: Vision-language model testing
  3. prompts:
  4. - file://vision_prompt.json
  5. providers:
  6. - id: hyperbolic:Qwen/Qwen2.5-VL-7B-Instruct
  7. label: Qwen VL 7B
  8. config:
  9. temperature: 0.1
  10. max_tokens: 1024
  11. # Note: This model may not be available with your current API key
  12. # - id: hyperbolic:qwen/Qwen2.5-VL-72B-Instruct
  13. # label: Qwen VL 72B
  14. # config:
  15. # temperature: 0.1
  16. # max_tokens: 1024
  17. # - id: hyperbolic:mistralai/Pixtral-12B
  18. # label: Pixtral 12B
  19. # config:
  20. # temperature: 0.1
  21. # max_tokens: 1024
  22. tests:
  23. # Animal behavior analysis
  24. - vars:
  25. question: 'Describe this adorable creature and what they might be thinking about while munching. Be creative!'
  26. image_url: 'https://upload.wikimedia.org/wikipedia/commons/thumb/0/0f/Grosser_Panda.JPG/1280px-Grosser_Panda.JPG'
  27. assert:
  28. - type: contains-any
  29. value: ['panda', 'bear', 'bamboo', 'eating']
  30. - type: llm-rubric
  31. value: 'The response accurately describes the panda with creative personality insights'
  32. # Creative animal storytelling
  33. - vars:
  34. question: 'Tell me a short story about what this panda is planning for the weekend. Make it funny!'
  35. image_url: 'https://upload.wikimedia.org/wikipedia/commons/thumb/0/0f/Grosser_Panda.JPG/1280px-Grosser_Panda.JPG'
  36. assert:
  37. - type: contains-any
  38. value: ['panda', 'weekend', 'bamboo', 'plan']
  39. - type: llm-rubric
  40. value: 'The response creates an engaging and humorous story about the panda'
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...