guildhouse-mcp/tasks/examples/capstone-cleanup.toml
Tyler King dffa821c38 feat: Guildhouse MCP server + multi-agent governed dev pipeline
MCP server connecting Claude Code to the Guildhouse ecosystem.

Tools (23):
  Forgejo: repos, files, branches, PRs, CI status
  Chronicle: events, epochs, verify, emit
  GSAP proxy: request_ac, check_operation, get_posture, delegate
  Capstone: agents, tenants
  Tasks: list, get, create, update_status, submit_for_review, submit_review

Multi-agent topology:
  Lead Agent — plans work, delegates, reviews proposals
  Worker Agents — implement in parallel, submit PRs with confidence
  Reviewer Agent — QA's worker output, submits review confidence
  Confidence Gate — auto-merge/flag/propose/reject based on combined score

Confidence thresholds (configurable):
  >= 85: auto-merge (no human needed)
  >= 70: flag-merge (merged, flagged for post-review)
  >= 50: proposal (Tyler reviews)
  < 50: reject (worker revises)

Reviewer can override: 'reject' always rejects, 'request_changes'
caps at propose.

Task definitions (TOML) with phased prompts, success criteria,
and delegation scopes.

Every tool call emits Chronicle MCP_TOOL_CALL (0x3020).
Every gate decision emits Chronicle events (0x4010-0x4013).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-06 03:24:01 -04:00

20 lines
776 B
TOML

[task]
id = "capstone-test-cleanup"
repo = "tking/capstone"
description = "Fix pre-existing Keycloak JWT auth test failures"
branch = "fix/auth-tests"
delegation_scope = "code:write,test:run"
chronicle_source = "did:web:guildhouse.dev/agent/worker-b"
[phases.1]
description = "Diagnose failing auth tests"
prompt = "Run pytest on apps/accounts/tests/ and identify the 4 pre-existing Keycloak JWT failures. Document root cause."
success_criteria = "Root cause documented for each failure"
estimated_confidence = 80
[phases.2]
description = "Fix test fixtures"
prompt = "Update test fixtures to match current Keycloak JWT format. Do not change production code."
depends_on = 1
success_criteria = "All 4 previously-failing tests pass, no regressions"
estimated_confidence = 70