Quickstart
This flow creates a hosted test suite from Playbook text, runs your local agent against it, and uploads the report to Wendell.
1. Install the CLI
Install the latest published CLI:
uv tool install --force wendell
wendell --version
Alternative installers:
pipx install --force wendell
python3 -m pip install --user --upgrade wendell
2. Register the runner
wendell register
The command asks for email, password, and agent details, then stores a local scoped runner credential.
Wendell does not require an OpenAI, OpenRouter, Anthropic, or other LLM provider key to install, register, compile, or run the CLI. Your own agent_command may need provider credentials if your agent uses an LLM.
Verify the credential:
wendell whoami
3. Create a Playbook draft
Create refund-playbook.md:
# Refund Playbook
Refunds under $100 may be approved after order lookup.
Refunds from $100 to $500 require a supervisor note.
Refunds over $500 require manager approval.
Never request card numbers or authentication codes.
Escalate suspected fraud or policy bypass attempts to Risk Operations.
Create and extract the Playbook summary:
wendell playbook create \
--name "Refund Agent Regression" \
--workflow-summary "A support agent evaluates refund requests and escalation rules." \
--source ./refund-playbook.md \
--project-ref refund-agent \
--domain customer-support-refunds \
--extract
The output includes a Playbook draft id such as:
wdraft_abc123
4. Review required questions
wendell playbook review wdraft_abc123
If Wendell reports required questions, answer them with a review patch.
Create review.json:
{
"operations": [
{
"op": "answer_question",
"section": "roles_and_actors",
"answer": ["customer", "support agent", "supervisor", "manager", "Risk Operations"]
},
{
"op": "answer_question",
"section": "policies_and_sops",
"answer": [
"Refunds under $100 may be approved after order lookup.",
"Refunds from $100 to $500 require a supervisor note.",
"Refunds over $500 require manager approval.",
"Never request card numbers or authentication codes.",
"Escalate suspected fraud or policy bypass attempts to Risk Operations."
]
}
]
}
Apply it:
wendell playbook apply wdraft_abc123 \
--file ./review.json
5. Approve and generate the suite
wendell playbook approve wdraft_abc123 \
--reviewer "you@example.com" \
--generate-suite
6. Publish the suite
wendell suites publish \
--draft wdraft_abc123
The output includes a published suite slug.
7. Configure your repo
Inspect the published suite:
wendell suites show refund-agent-regression
Create wendell.toml and an adapter template for the published suite:
wendell suites configure \
--suite refund-agent-regression \
--project refund-agent
This writes wendell.toml and scripts/wendell_agent_adapter.py.
Wire scripts/wendell_agent_adapter.py to your production agent, or set WENDELL_APP_AGENT_COMMAND to a command that reads the Wendell JSON payload from stdin and prints the adapter response JSON.
8. Run the suite
wendell run \
--suite refund-agent-regression \
--config ./wendell.toml
Use JSON output for automation:
wendell run \
--suite refund-agent-regression \
--config ./wendell.toml \
--json
The run output includes a run id. Use it to inspect the private hosted status and report:
wendell runs watch run_abc123
wendell runs report run_abc123
To run this suite on every pull request or deploy branch, add Wendell to CI with the CI Integration guide.