Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Michael 618e46d958
docs(openai): remove gpt-4.5-preview references after API deprecation (#5005)
1 month ago
..
7d4b0c5a80
fix(cli): esm provider loading (#4915)
1 month ago
81122a40cd
test: improve type safety and resolve TypeScript errors (#1216)
1 year ago
267a00fe40
fix: grading in crescendo (#4960)
1 month ago
f3c79680d4
chore(cli): add support for 'help' argument to display command help (#4823)
1 month ago
ff6b4c0b96
feat: enable WAL mode for SQLite (#4104)
2 months ago
bde6060d64
feat: add traces to javascript, python asserts (#4745)
1 month ago
58fd112507
chore: integrate knip for unused code detection and clean up codebase (#4464)
1 month ago
cd944496d9
fix(telemetry): prevent PostHog initialization when telemetry is disabled (#4772)
1 month ago
02d4023164
feat: add Langfuse prompt label support with improved parsing (#4847)
1 month ago
776bc5a10d
fix: additional checking on llm-rubric response (#4954)
1 month ago
8bc3027cb9
chore: revert eval view ui improvements (#4969)
1 month ago
851e67d7cd
feat(prompts): preserve function names when using glob patterns (#4927)
1 month ago
618e46d958
docs(openai): remove gpt-4.5-preview references after API deprecation (#5005)
1 month ago
79aa4b779a
chore(internals): remove python script result data type debug log (#4807)
1 month ago
e094e52c39
fix(redteam): find plugin assertion in strategy providers (#4981)
1 month ago
52bc3c62d5
test: fix flaky server tests (#4968)
1 month ago
f90fe98182
chore: remove redundant test comments (#4183)
3 months ago
a1caaebcb6
refactor: split test case loading from synthesis (#3004)
6 months ago
6c51a9456f
feat: opentelemetry tracing support (#4600)
2 months ago
3e5d31a14f
feat(config): add support for loading defaultTest from external files (#4720)
1 month ago
5982f28fe2
fix(cli): --filter-failing not working with custom providers (#4911)
1 month ago
a9c90f612b
test: add unit test for src/validators/redteam.ts (#4803)
1 month ago
2411342d24
chore: Improve telemetry delivery (#4655)
1 month ago
47b4f15856
chore: expose `deleteFromCache` to evict cache keys after fetch by providers (#3009)
6 months ago
6b486a3954
test: configure default globalConfig mock and logger mock (#2915)
7 months ago
6d1d880fc0
fix(schema): remove duplicate 'bias' entry in config-schema.json (#4773)
1 month ago
84201e54bf
chore(redteam): add centralized REDTEAM_DEFAULTS and maxConcurrency support (#4656)
2 months ago
026caa8577
chore: improve __metadata warning message and test coverage (#4842)
1 month ago
ff6b4c0b96
feat: enable WAL mode for SQLite (#4104)
2 months ago
34896542a2
feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_MS (#4322)
2 months ago
7d4b0c5a80
fix(cli): esm provider loading (#4915)
1 month ago
3e5d31a14f
feat(config): add support for loading defaultTest from external files (#4720)
1 month ago
02d4023164
feat: add Langfuse prompt label support with improved parsing (#4847)
1 month ago
58fd112507
chore: integrate knip for unused code detection and clean up codebase (#4464)
1 month ago
5dfa39ca8e
refactor: centralize readline utilities to fix Jest open handle issues (#4219)
3 months ago
58fd112507
chore: integrate knip for unused code detection and clean up codebase (#4464)
1 month ago
5f8300a269
test: add integrity check for generated-constants.ts (#4753)
1 month ago
2411342d24
chore: Improve telemetry delivery (#4655)
1 month ago
dd8e5b25ab
fix(google-sheets): replace hardcoded range with dynamic approach (#4822)
1 month ago
a08fc4e4da
feat(eval): track assertion tokens in token usage (#3551)
4 months ago
3e5d31a14f
feat(config): add support for loading defaultTest from external files (#4720)
1 month ago
58fd112507
chore: integrate knip for unused code detection and clean up codebase (#4464)
1 month ago
131c36dd4c
chore: add global env-file option to all commands recursively (#3969)
3 months ago
2411342d24
chore: Improve telemetry delivery (#4655)
1 month ago
ec4240a754
feat(redteam): Cloud-based plugin severity overrides (#4348)
2 months ago
8494c43824
chore: replace node-fetch with native fetch API (#1968)
10 months ago
6a4b257262
chore(providers/sagemaker): Improves validation of user-provided config (#4809)
1 month ago
7162bc3a9e
fix(sharing): Fix file outputs when sharing (#4698)
2 months ago
a938e4b52c
feat: optionally time out eval steps (#3765)
3 months ago
5aba1c3214
chore(telemetry): Identify to PostHog whether user is also cloud user (#4782)
1 month ago
6b486a3954
test: configure default globalConfig mock and logger mock (#2915)
7 months ago

Comments

Loading...