Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Guangshuo Zang fb349c048f
feat(moderation): add guardrail checks and logging for moderation (#2624)
7 months ago
..
81122a40cd
test: improve type safety and resolve TypeScript errors (#1216)
1 year ago
fb349c048f
feat(moderation): add guardrail checks and logging for moderation (#2624)
7 months ago
23716ff4bf
feat: add --filter-errors-only parameter to `eval` (#2539)
7 months ago
8afa0ffe8b
chore: separate errors from assert failures (#2214)
8 months ago
47d53196a9
chore: suppress logger output in huggingface dataset tests
8 months ago
5bd6b98215
chore(tests): improve misc test setup and teardown (#2579)
7 months ago
cf69af04bd
fix(prompts): restore behavior that delays yaml parsing until after variable substitution (#2383)
8 months ago
bb148dc661
chore: bump groq-sdk from 0.11.0 to 0.12.0 (#2642)
7 months ago
d5b1130e26
ci(tests): separate unit and integration tests in CI pipeline (#1849)
10 months ago
2ac78ff2d2
feat(redteam): add Likert-based jailbreak strategy (#2614)
7 months ago
2315a3b950
chore: Add type for provider test response (#2567)
7 months ago
5bd6b98215
chore(tests): improve misc test setup and teardown (#2579)
7 months ago
5c4d389ce7
feat: Cloud Login (#1719)
11 months ago
5bd6b98215
chore(tests): improve misc test setup and teardown (#2579)
7 months ago
0665ec88d5
refactor: simplify node version check (#1794)
11 months ago
8e6f4eed10
chore(providers): support file:// syntax for Python providers (#1748)
11 months ago
d1a88804da
refactor(providers): centralize cost calculation logic (#1679)
11 months ago
547fcf4b8d
chore(redteam): improve iterative provider with test case grader (#2552)
7 months ago
1627d418bb
revert: refactor(evaluator): enhance variable resolution and prompt rendering" (#2386)
8 months ago
b64998a8ae
feat(fetch): add support for custom SSL certificates (#2591)
7 months ago
5c4d389ce7
feat: Cloud Login (#1719)
11 months ago
0ab5bdb2f2
feat(googleSheets): Add sheet identifier to Google Sheets URL for saving eval results (#2348)
8 months ago
73ba22af85
chore: Revert "chore(redteam): expose redteam run command and auto-share remote results" (#2613)
7 months ago
7ede615dab
fix(moderation): handle empty output to avoid false positives (#2508)
8 months ago
7ca7640a90
fix(cli): recommend npx if necessary (#2325)
8 months ago
8494c43824
chore: replace node-fetch with native fetch API (#1968)
10 months ago
cf129f5a82
Revert "feat: chunk results during share to handle large evals" (#2399)
8 months ago
5c4d389ce7
feat: Cloud Login (#1719)
11 months ago
a295d06008
feat: import tests from js/ts (#2635)
7 months ago
425556f034
fix(test): handle redteam config validation in TestSuiteConfigSchema
9 months ago
a693785072
chore(deps): update @swc/core to version 1.7.1 (#1285)
1 year ago

Comments

Loading...