promptfoo-evals

Write, refine, run, and QA promptfoo evaluation suites: promptfooconfig.yaml, prompts, providers, vars, tests, assertions, model-graded rubrics, transforms, datasets, exports, and CI gates. Use for non-redteam eval coverage, regression tests, or new eval matrices. Do not use for adversarial redteam plugin or strategy setup.

Safety Notice

This listing is imported from SkillsMP metadata and should be treated as untrusted until upstream source review is completed.

Copy this and send it to your AI assistant to learn

Install skill "promptfoo-evals" with this command: npx skills add abhishek0010/skillsmp-abhishek0010-abhishek0010-promptfoo-evals

No markdown body

This source entry does not include full markdown content beyond metadata.

Source Transparency

This detail page is rendered from real SKILL.md content. Trust labels are metadata-based hints, not a safety guarantee.

Related Skills

Related by shared tags or category signals.

General

search-params

URL search param and hash state management. Use when adding or modifying URL search params, working with useSearchParams, setSearchParams, useSearchParamState, or navigate() with query strings or hash fragments, or fixing browser back/forward button issues.

Repository SourceNeeds Review
Coding

redteam-plugin-development

Standards for creating redteam plugins and graders. Use when creating new plugins, writing graders, or modifying attack templates.

Repository SourceNeeds Review
Security

promptfoo-redteam-setup

Create or refine promptfoo redteam setup configs: purpose, targets, plugins, strategies, frameworks, multi-input target inputs, policy text, grader guidance, contexts, and static-code-derived target/threat mapping. Use when preparing a red team scan plan from live probes, code evidence, or provider configs, or when generating adversarial test cases for QA. Do not use for basic provider wiring alone or for running/evaluating an already-generated redteam scan.

Repository SourceNeeds Review
General

ClawHealth Data Skill

Read a user's ClawHealth 30-day HealthKit sync, produce daily health reports, open temporary panels, explain checkup signals, and recommend supplement protoc...

Registry SourceRecently Updated
promptfoo-evals | V50.AI