Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel

promptfooconfig.yaml 2.4 KB

You have to be logged in to leave a comment. Sign In
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
  1. prompts:
  2. - 'Hello! How are you?'
  3. providers:
  4. - openai:gpt-4o-mini
  5. tests:
  6. - assert: # Calculate score for one criteria only
  7. - type: g-eval
  8. value: >-
  9. Coherence - the collective quality of all sentences. We align this
  10. dimension with the DUC quality question of structure and coherence
  11. whereby "the reply should be well-structured and well-organized. The
  12. reply should not just be a heap of related information, but should
  13. build from sentence to a coherent body of information about a topic."
  14. - assert:
  15. - type: g-eval
  16. value: >-
  17. Consistency - the factual alignment between the reply and the source.
  18. A factually consistent reply contains only statements that are
  19. entailed by the source document. Annotators were also asked to
  20. penalize replies that contained hallucinated facts.
  21. - assert:
  22. - type: g-eval
  23. value: >-
  24. Fluency - the quality of the reply in terms of grammar, spelling,
  25. punctuation, word choice, and sentence structure.
  26. - assert:
  27. - type: g-eval
  28. value: >-
  29. Relevance - selection of important content for the source. The reply
  30. should include only important information for the source document.
  31. Annotators were instructed to penalize replies which contained
  32. redundancies and excess information.
  33. - assert: # Calculate average score among all criterias
  34. - type: g-eval
  35. value:
  36. - Coherence - the collective quality of all sentences. We align this dimension with the DUC quality question of structure and coherence whereby "the reply should be well-structured and well-organized. The reply should not just be a heap of related information, but should build from sentence to a coherent body of information about a topic."
  37. - Consistency - the factual alignment between the reply and the source. A factually consistent reply contains only statements that are entailed by the source document. Annotators were also asked to penalize replies that contained hallucinated facts.
  38. - Fluency - the quality of the reply in terms of grammar, spelling, punctuation, word choice, and sentence structure.
  39. - Relevance - selection of important content for the source. The reply should include only important information for the source document. Annotators were instructed to penalize replies which contained redundancies and excess information.
Tip!

Press p or to see the previous file or, n or to see the next file

Comments

Loading...