Skip to content

🤖🤖🤖 Update quality-playbook skill to v1.4.0 + add agent#1402

Open
andrewstellman wants to merge 13 commits intogithub:stagedfrom
andrewstellman:quality-playbook-v1.4.0
Open

🤖🤖🤖 Update quality-playbook skill to v1.4.0 + add agent#1402
andrewstellman wants to merge 13 commits intogithub:stagedfrom
andrewstellman:quality-playbook-v1.4.0

Conversation

@andrewstellman
Copy link
Copy Markdown
Contributor

@andrewstellman andrewstellman commented Apr 15, 2026

Summary

  • Updates quality-playbook skill from v1.3.x to v1.5.6

Changes from 1.3.x to 1.4.2:

  • Adds agents/quality-playbook.agent.md — checks skill installation, explains phases, handles "keep going" / "help" / "what happened"
  • Six-phase interactive architecture with end-of-phase messages
  • Four iteration strategies (gap, unfiltered, parity, adversarial)
  • TDD red-green verification for every confirmed bug
  • Quality gate script for mechanical artifact validation
  • Moved ITERATION.md into references/ (installed via wildcard copy)

Updated to v1.5.6.
Changes since v1.4.2:

SKILL.md modernized (skills bundle now includes phase_prompts/, agents/, and bin/citation_verifier.py per the v1.5.6 install contract; quality_gate.sh retired in v1.4.5 and replaced by quality_gate.py).
references/ expanded from 12 to 16 files.
The top-level agents/quality-playbook.agent.md now installs from awesome-copilot's own skills/quality-playbook/ folder instead of pointing users to the upstream Quality Playbook repo, so adopters discovering the skill via awesome-copilot can install without leaving the marketplace. Manual cp recipes provided for all four supported AI tools (Copilot, Cursor, Claude Code, Continue). Pointer to the upstream repo preserved as alternative for users who want the full script-driven install UX.

Empirically validated against Cursor, Claude Code, and GitHub Copilot install flows on 2026-05-08.

See: https://www.oreilly.com/radar/ai-is-writing-our-code-faster-than-we-can-verify-it/

Copilot AI review requested due to automatic review settings April 15, 2026 14:34
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Apr 15, 2026

🔍 Skill Validator Results

⛔ Findings need attention

Scope Checked
Skills 145
Agents 1
Total 146
Severity Count
--- ---:
❌ Errors 21
⚠️ Warnings 136
ℹ️ Advisories 0

Summary

Level Finding
[geofeed-tuner] File reference './run/data/report-data.json' is 2 directories deep — maximum is 1 level from SKILL.md.
[geofeed-tuner] File reference 'assets/example/01-user-input-rfc8805-feed.csv' is 2 directories deep — maximum is 1 level from SKILL.md.
[geofeed-tuner] File reference './run/data/comments.json' is 2 directories deep — maximum is 1 level from SKILL.md.
[geofeed-tuner] File reference './run/data/report-data.json' is 2 directories deep — maximum is 1 level from SKILL.md.
[geofeed-tuner] File reference './run/data/mcp-server-payload.json' is 2 directories deep — maximum is 1 level from SKILL.md.
[geofeed-tuner] File reference './run/data/mcp-server-payload.json' is 2 directories deep — maximum is 1 level from SKILL.md.
[geofeed-tuner] File reference './run/data/report-data.json' is 2 directories deep — maximum is 1 level from SKILL.md.
[geofeed-tuner] File reference './run/data/report-data.json' is 2 directories deep — maximum is 1 level from SKILL.md.
Duplicate skill name 'azure-resource-health-diagnose' found in 'plugins/devops-oncall/skills/azure-resource-health-diagnose' (first seen in 'plugins/azure-cloud-development/skills/azure-resource-health-diagnose')
Duplicate skill name 'openapi-to-application-code' found in 'plugins/openapi-to-application-go/skills/openapi-to-application-code' (first seen in 'plugins/openapi-to-application-csharp-dotnet/skills/openapi-to-application-code')
Full validator output ```text Found 145 skill(s) [quality-playbook] 📊 quality-playbook: 66,332 BPE tokens [chars/4: 73,092] (comprehensive ✗), 128 sections, 44 code blocks [quality-playbook] ⚠ Skill is 66,332 BPE tokens (chars/4 estimate: 73,092) — "comprehensive" skills hurt performance by 2.9pp on average. Consider splitting into 2–3 focused skills. [acreadiness-assess] 📊 acreadiness-assess: 808 BPE tokens [chars/4: 839] (detailed ✓), 3 sections, 1 code blocks [acreadiness-generate-instructions] 📊 acreadiness-generate-instructions: 2,129 BPE tokens [chars/4: 2,104] (detailed ✓), 9 sections, 2 code blocks [acreadiness-policy] 📊 acreadiness-policy: 1,005 BPE tokens [chars/4: 940] (detailed ✓), 11 sections, 4 code blocks [ai-team-orchestration] 📊 ai-team-orchestration: 1,436 BPE tokens [chars/4: 1,400] (detailed ✓), 13 sections, 5 code blocks [arize-ai-provider-integration] 📊 arize-ai-provider-integration: 2,632 BPE tokens [chars/4: 2,548] (standard ~), 29 sections, 16 code blocks [arize-ai-provider-integration] ⚠ Skill is 2,632 BPE tokens (chars/4 estimate: 2,548) — approaching "comprehensive" range where gains diminish. [arize-ai-provider-integration] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [arize-annotation] 📊 arize-annotation: 2,517 BPE tokens [chars/4: 2,689] (standard ~), 27 sections, 15 code blocks [arize-annotation] ⚠ Skill is 2,517 BPE tokens (chars/4 estimate: 2,689) — approaching "comprehensive" range where gains diminish. [arize-annotation] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [arize-dataset] 📊 arize-dataset: 3,818 BPE tokens [chars/4: 3,809] (standard ~), 51 sections, 16 code blocks [arize-dataset] ⚠ Skill is 3,818 BPE tokens (chars/4 estimate: 3,809) — approaching "comprehensive" range where gains diminish. [arize-evaluator] 📊 arize-evaluator: 7,814 BPE tokens [chars/4: 8,033] (comprehensive ✗), 59 sections, 28 code blocks [arize-evaluator] ⚠ Skill is 7,814 BPE tokens (chars/4 estimate: 8,033) — "comprehensive" skills hurt performance by 2.9pp on average. Consider splitting into 2–3 focused skills. [arize-experiment] 📊 arize-experiment: 4,589 BPE tokens [chars/4: 4,610] (standard ~), 34 sections, 20 code blocks [arize-experiment] ⚠ Skill is 4,589 BPE tokens (chars/4 estimate: 4,610) — approaching "comprehensive" range where gains diminish. [arize-instrumentation] 📊 arize-instrumentation: 4,935 BPE tokens [chars/4: 5,061] (standard ~), 19 sections, 3 code blocks [arize-instrumentation] ⚠ Skill is 4,935 BPE tokens (chars/4 estimate: 5,061) — approaching "comprehensive" range where gains diminish. [arize-link] 📊 arize-link: 1,203 BPE tokens [chars/4: 1,088] (detailed ✓), 9 sections, 6 code blocks [arize-prompt-optimization] 📊 arize-prompt-optimization: 4,451 BPE tokens [chars/4: 4,762] (standard ~), 58 sections, 19 code blocks [arize-prompt-optimization] ⚠ Skill is 4,451 BPE tokens (chars/4 estimate: 4,762) — approaching "comprehensive" range where gains diminish. [arize-trace] 📊 arize-trace: 5,873 BPE tokens [chars/4: 5,831] (comprehensive ✗), 43 sections, 10 code blocks [arize-trace] ⚠ Skill is 5,873 BPE tokens (chars/4 estimate: 5,831) — "comprehensive" skills hurt performance by 2.9pp on average. Consider splitting into 2–3 focused skills. [automate-this] 📊 automate-this: 2,869 BPE tokens [chars/4: 3,107] (standard ~), 16 sections, 8 code blocks [automate-this] ⚠ Skill is 2,869 BPE tokens (chars/4 estimate: 3,107) — approaching "comprehensive" range where gains diminish. [suggest-awesome-github-copilot-agents] 📊 suggest-awesome-github-copilot-agents: 1,467 BPE tokens [chars/4: 1,979] (detailed ✓), 9 sections, 0 code blocks [suggest-awesome-github-copilot-agents] ⚠ No code blocks — agents perform better with concrete snippets and commands. [suggest-awesome-github-copilot-instructions] 📊 suggest-awesome-github-copilot-instructions: 1,499 BPE tokens [chars/4: 1,904] (detailed ✓), 11 sections, 1 code blocks [suggest-awesome-github-copilot-skills] 📊 suggest-awesome-github-copilot-skills: 1,725 BPE tokens [chars/4: 2,018] (detailed ✓), 11 sections, 1 code blocks [az-cost-optimize] 📊 az-cost-optimize: 2,956 BPE tokens [chars/4: 3,218] (standard ~), 12 sections, 7 code blocks [az-cost-optimize] ⚠ Skill is 2,956 BPE tokens (chars/4 estimate: 3,218) — approaching "comprehensive" range where gains diminish. [azure-pricing] 📊 azure-pricing: 2,315 BPE tokens [chars/4: 2,371] (detailed ✓), 23 sections, 4 code blocks [azure-resource-health-diagnose] 📊 azure-resource-health-diagnose: 2,492 BPE tokens [chars/4: 2,859] (detailed ✓), 12 sections, 8 code blocks [import-infrastructure-as-code] 📊 import-infrastructure-as-code: 4,135 BPE tokens [chars/4: 4,680] (standard ~), 30 sections, 11 code blocks [import-infrastructure-as-code] ⚠ Skill is 4,135 BPE tokens (chars/4 estimate: 4,680) — approaching "comprehensive" range where gains diminish. [remember-interactive-programming] 📊 remember-interactive-programming: 175 BPE tokens [chars/4: 221] (compact ✓), 0 sections, 0 code blocks [remember-interactive-programming] ⚠ Skill is only 175 BPE tokens (chars/4 estimate: 221) — may be too sparse to provide actionable guidance. [remember-interactive-programming] ⚠ No section headers — agents navigate structured documents better. [remember-interactive-programming] ⚠ No code blocks — agents perform better with concrete snippets and commands. [remember-interactive-programming] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [content-management-systems] 📊 content-management-systems: 1,040 BPE tokens [chars/4: 1,324] (detailed ✓), 13 sections, 0 code blocks [content-management-systems] ⚠ No code blocks — agents perform better with concrete snippets and commands. [markdown-to-html] 📊 markdown-to-html: 6,711 BPE tokens [chars/4: 6,151] (comprehensive ✗), 100 sections, 50 code blocks [markdown-to-html] ⚠ Skill is 6,711 BPE tokens (chars/4 estimate: 6,151) — "comprehensive" skills hurt performance by 2.9pp on average. Consider splitting into 2–3 focused skills. [markdown-to-html] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [quasi-coder] 📊 quasi-coder: 3,116 BPE tokens [chars/4: 3,907] (standard ~), 37 sections, 11 code blocks [quasi-coder] ⚠ Skill is 3,116 BPE tokens (chars/4 estimate: 3,907) — approaching "comprehensive" range where gains diminish. [web-coder] 📊 web-coder: 4,784 BPE tokens [chars/4: 5,245] (standard ~), 47 sections, 10 code blocks [web-coder] ⚠ Skill is 4,784 BPE tokens (chars/4 estimate: 5,245) — approaching "comprehensive" range where gains diminish. [context-map] 📊 context-map: 249 BPE tokens [chars/4: 287] (compact ✓), 10 sections, 1 code blocks [refactor-plan] 📊 refactor-plan: 356 BPE tokens [chars/4: 355] (compact ✓), 15 sections, 1 code blocks [what-context-needed] 📊 what-context-needed: 209 BPE tokens [chars/4: 219] (compact ✓), 9 sections, 1 code blocks [integrate-context-matic] 📊 integrate-context-matic: 1,496 BPE tokens [chars/4: 1,574] (detailed ✓), 12 sections, 0 code blocks [integrate-context-matic] ⚠ No code blocks — agents perform better with concrete snippets and commands. [integrate-context-matic] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [onboard-context-matic] 📊 onboard-context-matic: 3,136 BPE tokens [chars/4: 3,361] (standard ~), 13 sections, 12 code blocks [onboard-context-matic] ⚠ Skill is 3,136 BPE tokens (chars/4 estimate: 3,361) — approaching "comprehensive" range where gains diminish. [copilot-sdk] 📊 copilot-sdk: 5,749 BPE tokens [chars/4: 6,042] (comprehensive ✗), 63 sections, 41 code blocks [copilot-sdk] ⚠ Skill is 5,749 BPE tokens (chars/4 estimate: 6,042) — "comprehensive" skills hurt performance by 2.9pp on average. Consider splitting into 2–3 focused skills. [aspnet-minimal-api-openapi] 📊 aspnet-minimal-api-openapi: 343 BPE tokens [chars/4: 439] (compact ✓), 5 sections, 0 code blocks [aspnet-minimal-api-openapi] ⚠ No code blocks — agents perform better with concrete snippets and commands. [aspnet-minimal-api-openapi] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [csharp-async] 📊 csharp-async: 379 BPE tokens [chars/4: 466] (compact ✓), 7 sections, 0 code blocks [csharp-async] ⚠ No code blocks — agents perform better with concrete snippets and commands. [csharp-async] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [csharp-mstest] 📊 csharp-mstest: 3,249 BPE tokens [chars/4: 3,458] (standard ~), 31 sections, 22 code blocks [csharp-mstest] ⚠ Skill is 3,249 BPE tokens (chars/4 estimate: 3,458) — approaching "comprehensive" range where gains diminish. [csharp-nunit] 📊 csharp-nunit: 636 BPE tokens [chars/4: 711] (detailed ✓), 8 sections, 0 code blocks [csharp-nunit] ⚠ No code blocks — agents perform better with concrete snippets and commands. [csharp-nunit] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [csharp-tunit] 📊 csharp-tunit: 1,164 BPE tokens [chars/4: 1,287] (detailed ✓), 10 sections, 0 code blocks [csharp-tunit] ⚠ No code blocks — agents perform better with concrete snippets and commands. [csharp-tunit] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [csharp-xunit] 📊 csharp-xunit: 599 BPE tokens [chars/4: 677] (detailed ✓), 8 sections, 0 code blocks [csharp-xunit] ⚠ No code blocks — agents perform better with concrete snippets and commands. [csharp-xunit] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [dotnet-best-practices] 📊 dotnet-best-practices: 602 BPE tokens [chars/4: 808] (detailed ✓), 12 sections, 0 code blocks [dotnet-best-practices] ⚠ No code blocks — agents perform better with concrete snippets and commands. [dotnet-best-practices] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [dotnet-upgrade] 📊 dotnet-upgrade: 1,354 BPE tokens [chars/4: 1,768] (detailed ✓), 1 sections, 0 code blocks [dotnet-upgrade] ⚠ No code blocks — agents perform better with concrete snippets and commands. [dotnet-upgrade] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [csharp-mcp-server-generator] 📊 csharp-mcp-server-generator: 459 BPE tokens [chars/4: 560] (detailed ✓), 9 sections, 0 code blocks [csharp-mcp-server-generator] ⚠ No code blocks — agents perform better with concrete snippets and commands. [postgresql-code-review] 📊 postgresql-code-review: 1,646 BPE tokens [chars/4: 1,847] (detailed ✓), 21 sections, 8 code blocks [postgresql-optimization] 📊 postgresql-optimization: 2,651 BPE tokens [chars/4: 2,834] (standard ~), 36 sections, 20 code blocks [postgresql-optimization] ⚠ Skill is 2,651 BPE tokens (chars/4 estimate: 2,834) — approaching "comprehensive" range where gains diminish. [postgresql-optimization] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [sql-code-review] 📊 sql-code-review: 2,112 BPE tokens [chars/4: 2,220] (detailed ✓), 36 sections, 15 code blocks [sql-optimization] 📊 sql-optimization: 2,212 BPE tokens [chars/4: 2,298] (detailed ✓), 28 sections, 14 code blocks [dataverse-python-advanced-patterns] 📊 dataverse-python-advanced-patterns: 242 BPE tokens [chars/4: 291] (compact ✓), 0 sections, 0 code blocks [dataverse-python-advanced-patterns] ⚠ No section headers — agents navigate structured documents better. [dataverse-python-advanced-patterns] ⚠ No code blocks — agents perform better with concrete snippets and commands. [dataverse-python-production-code] 📊 dataverse-python-production-code: 813 BPE tokens [chars/4: 913] (detailed ✓), 9 sections, 3 code blocks [dataverse-python-quickstart] 📊 dataverse-python-quickstart: 133 BPE tokens [chars/4: 163] (compact ✓), 0 sections, 0 code blocks [dataverse-python-quickstart] ⚠ Skill is only 133 BPE tokens (chars/4 estimate: 163) — may be too sparse to provide actionable guidance. [dataverse-python-quickstart] ⚠ No section headers — agents navigate structured documents better. [dataverse-python-quickstart] ⚠ No code blocks — agents perform better with concrete snippets and commands. [dataverse-python-quickstart] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [dataverse-python-usecase-builder] 📊 dataverse-python-usecase-builder: 1,538 BPE tokens [chars/4: 1,684] (detailed ✓), 38 sections, 5 code blocks [azure-resource-health-diagnose] 📊 azure-resource-health-diagnose: 2,492 BPE tokens [chars/4: 2,859] (detailed ✓), 12 sections, 8 code blocks [multi-stage-dockerfile] 📊 multi-stage-dockerfile: 443 BPE tokens [chars/4: 577] (detailed ✓), 5 sections, 0 code blocks [multi-stage-dockerfile] ⚠ No code blocks — agents perform better with concrete snippets and commands. [multi-stage-dockerfile] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [doublecheck] 📊 doublecheck: 3,254 BPE tokens [chars/4: 3,973] (standard ~), 26 sections, 3 code blocks [doublecheck] ⚠ Skill is 3,254 BPE tokens (chars/4 estimate: 3,973) — approaching "comprehensive" range where gains diminish. [from-the-other-side-vega] 📊 from-the-other-side-vega: 820 BPE tokens [chars/4: 894] (detailed ✓), 9 sections, 0 code blocks [from-the-other-side-vega] ⚠ No code blocks — agents perform better with concrete snippets and commands. [from-the-other-side-vega] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [eyeball] 📊 eyeball: 1,486 BPE tokens [chars/4: 1,637] (detailed ✓), 13 sections, 11 code blocks [geofeed-tuner] 📊 geofeed-tuner: 11,662 BPE tokens [chars/4: 12,215] (comprehensive ✗), 35 sections, 15 code blocks ❌ [geofeed-tuner] File reference './run/data/report-data.json' is 2 directories deep — maximum is 1 level from SKILL.md. ❌ [geofeed-tuner] File reference 'assets/example/01-user-input-rfc8805-feed.csv' is 2 directories deep — maximum is 1 level from SKILL.md. ❌ [geofeed-tuner] File reference './run/data/comments.json' is 2 directories deep — maximum is 1 level from SKILL.md. ❌ [geofeed-tuner] File reference './run/data/report-data.json' is 2 directories deep — maximum is 1 level from SKILL.md. ❌ [geofeed-tuner] File reference './run/data/mcp-server-payload.json' is 2 directories deep — maximum is 1 level from SKILL.md. ❌ [geofeed-tuner] File reference './run/data/mcp-server-payload.json' is 2 directories deep — maximum is 1 level from SKILL.md. ❌ [geofeed-tuner] File reference './run/data/report-data.json' is 2 directories deep — maximum is 1 level from SKILL.md. ❌ [geofeed-tuner] File reference './run/data/report-data.json' is 2 directories deep — maximum is 1 level from SKILL.md. [geofeed-tuner] ⚠ Skill is 11,662 BPE tokens (chars/4 estimate: 12,215) — "comprehensive" skills hurt performance by 2.9pp on average. Consider splitting into 2–3 focused skills. [flowstudio-power-automate-build] 📊 flowstudio-power-automate-build: 5,040 BPE tokens [chars/4: 5,168] (comprehensive ✗), 46 sections, 17 code blocks [flowstudio-power-automate-build] ⚠ Skill is 5,040 BPE tokens (chars/4 estimate: 5,168) — "comprehensive" skills hurt performance by 2.9pp on average. Consider splitting into 2–3 focused skills. [flowstudio-power-automate-build] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [flowstudio-power-automate-debug] 📊 flowstudio-power-automate-debug: 4,320 BPE tokens [chars/4: 4,264] (standard ~), 65 sections, 15 code blocks [flowstudio-power-automate-debug] ⚠ Skill is 4,320 BPE tokens (chars/4 estimate: 4,264) — approaching "comprehensive" range where gains diminish. [flowstudio-power-automate-governance] 📊 flowstudio-power-automate-governance: 5,414 BPE tokens [chars/4: 5,786] (comprehensive ✗), 17 sections, 11 code blocks [flowstudio-power-automate-governance] ⚠ Skill is 5,414 BPE tokens (chars/4 estimate: 5,786) — "comprehensive" skills hurt performance by 2.9pp on average. Consider splitting into 2–3 focused skills. [flowstudio-power-automate-mcp] 📊 flowstudio-power-automate-mcp: 3,243 BPE tokens [chars/4: 3,176] (standard ~), 19 sections, 9 code blocks [flowstudio-power-automate-mcp] ⚠ Skill is 3,243 BPE tokens (chars/4 estimate: 3,176) — approaching "comprehensive" range where gains diminish. [flowstudio-power-automate-monitoring] 📊 flowstudio-power-automate-monitoring: 3,809 BPE tokens [chars/4: 3,609] (standard ~), 25 sections, 14 code blocks [flowstudio-power-automate-monitoring] ⚠ Skill is 3,809 BPE tokens (chars/4 estimate: 3,609) — approaching "comprehensive" range where gains diminish. [playwright-explore-website] 📊 playwright-explore-website: 140 BPE tokens [chars/4: 171] (compact ✓), 2 sections, 0 code blocks [playwright-explore-website] ⚠ Skill is only 140 BPE tokens (chars/4 estimate: 171) — may be too sparse to provide actionable guidance. [playwright-explore-website] ⚠ No code blocks — agents perform better with concrete snippets and commands. [playwright-generate-test] 📊 playwright-generate-test: 178 BPE tokens [chars/4: 220] (compact ✓), 2 sections, 0 code blocks [playwright-generate-test] ⚠ Skill is only 178 BPE tokens (chars/4 estimate: 220) — may be too sparse to provide actionable guidance. [playwright-generate-test] ⚠ No code blocks — agents perform better with concrete snippets and commands. [playwright-generate-test] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [go-mcp-server-generator] 📊 go-mcp-server-generator: 1,740 BPE tokens [chars/4: 1,759] (detailed ✓), 21 sections, 8 code blocks [create-spring-boot-java-project] 📊 create-spring-boot-java-project: 1,092 BPE tokens [chars/4: 1,138] (detailed ✓), 17 sections, 11 code blocks [create-spring-boot-java-project] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [java-docs] 📊 java-docs: 316 BPE tokens [chars/4: 360] (compact ✓), 1 sections, 0 code blocks [java-docs] ⚠ No code blocks — agents perform better with concrete snippets and commands. [java-docs] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [java-junit] 📊 java-junit: 695 BPE tokens [chars/4: 746] (detailed ✓), 8 sections, 0 code blocks [java-junit] ⚠ No code blocks — agents perform better with concrete snippets and commands. [java-junit] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [java-springboot] 📊 java-springboot: 855 BPE tokens [chars/4: 1,034] (detailed ✓), 10 sections, 0 code blocks [java-springboot] ⚠ No code blocks — agents perform better with concrete snippets and commands. [java-springboot] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [java-mcp-server-generator] 📊 java-mcp-server-generator: 4,637 BPE tokens [chars/4: 5,506] (standard ~), 28 sections, 19 code blocks [java-mcp-server-generator] ⚠ Skill is 4,637 BPE tokens (chars/4 estimate: 5,506) — approaching "comprehensive" range where gains diminish. [kotlin-mcp-server-generator] 📊 kotlin-mcp-server-generator: 2,456 BPE tokens [chars/4: 2,672] (detailed ✓), 29 sections, 13 code blocks [mcp-create-adaptive-cards] 📊 mcp-create-adaptive-cards: 2,949 BPE tokens [chars/4: 3,136] (standard ~), 33 sections, 22 code blocks [mcp-create-adaptive-cards] ⚠ Skill is 2,949 BPE tokens (chars/4 estimate: 3,136) — approaching "comprehensive" range where gains diminish. [mcp-create-declarative-agent] 📊 mcp-create-declarative-agent: 1,978 BPE tokens [chars/4: 2,036] (detailed ✓), 30 sections, 12 code blocks [mcp-deploy-manage-agents] 📊 mcp-deploy-manage-agents: 2,036 BPE tokens [chars/4: 2,471] (detailed ✓), 49 sections, 2 code blocks [napkin] 📊 napkin: 1,649 BPE tokens [chars/4: 1,784] (detailed ✓), 12 sections, 4 code blocks [noob-mode] 📊 noob-mode: 2,604 BPE tokens [chars/4: 2,765] (standard ~), 13 sections, 9 code blocks [noob-mode] ⚠ Skill is 2,604 BPE tokens (chars/4 estimate: 2,765) — approaching "comprehensive" range where gains diminish. [openapi-to-application-code] 📊 openapi-to-application-code: 931 BPE tokens [chars/4: 1,144] (detailed ✓), 11 sections, 1 code blocks [openapi-to-application-code] 📊 openapi-to-application-code: 931 BPE tokens [chars/4: 1,144] (detailed ✓), 11 sections, 1 code blocks [openapi-to-application-code] 📊 openapi-to-application-code: 931 BPE tokens [chars/4: 1,144] (detailed ✓), 11 sections, 1 code blocks [openapi-to-application-code] 📊 openapi-to-application-code: 931 BPE tokens [chars/4: 1,144] (detailed ✓), 11 sections, 1 code blocks [openapi-to-application-code] 📊 openapi-to-application-code: 931 BPE tokens [chars/4: 1,144] (detailed ✓), 11 sections, 1 code blocks [creating-oracle-to-postgres-master-migration-plan] 📊 creating-oracle-to-postgres-master-migration-plan: 755 BPE tokens [chars/4: 790] (detailed ✓), 6 sections, 2 code blocks [creating-oracle-to-postgres-migration-bug-report] 📊 creating-oracle-to-postgres-migration-bug-report: 473 BPE tokens [chars/4: 567] (detailed ✓), 6 sections, 0 code blocks [creating-oracle-to-postgres-migration-bug-report] ⚠ No code blocks — agents perform better with concrete snippets and commands. [creating-oracle-to-postgres-migration-bug-report] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [creating-oracle-to-postgres-migration-integration-tests] 📊 creating-oracle-to-postgres-migration-integration-tests: 605 BPE tokens [chars/4: 760] (detailed ✓), 4 sections, 1 code blocks [creating-oracle-to-postgres-migration-integration-tests] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [migrating-oracle-to-postgres-stored-procedures] 📊 migrating-oracle-to-postgres-stored-procedures: 493 BPE tokens [chars/4: 576] (detailed ✓), 2 sections, 1 code blocks [migrating-oracle-to-postgres-stored-procedures] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [planning-oracle-to-postgres-migration-integration-testing] 📊 planning-oracle-to-postgres-migration-integration-testing: 438 BPE tokens [chars/4: 547] (detailed ✓), 4 sections, 1 code blocks [planning-oracle-to-postgres-migration-integration-testing] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [reviewing-oracle-to-postgres-migration] 📊 reviewing-oracle-to-postgres-migration: 605 BPE tokens [chars/4: 762] (detailed ✓), 5 sections, 2 code blocks [scaffolding-oracle-to-postgres-migration-test-project] 📊 scaffolding-oracle-to-postgres-migration-test-project: 527 BPE tokens [chars/4: 632] (detailed ✓), 3 sections, 1 code blocks [scaffolding-oracle-to-postgres-migration-test-project] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [sponsor-finder] 📊 sponsor-finder: 3,085 BPE tokens [chars/4: 2,991] (standard ~), 25 sections, 6 code blocks [sponsor-finder] ⚠ Skill is 3,085 BPE tokens (chars/4 estimate: 2,991) — approaching "comprehensive" range where gains diminish. [phoenix-cli] 📊 phoenix-cli: 2,778 BPE tokens [chars/4: 2,764] (standard ~), 19 sections, 15 code blocks [phoenix-cli] ⚠ Skill is 2,778 BPE tokens (chars/4 estimate: 2,764) — approaching "comprehensive" range where gains diminish. [phoenix-cli] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [phoenix-evals] 📊 phoenix-evals: 1,089 BPE tokens [chars/4: 1,126] (detailed ✓), 5 sections, 0 code blocks [phoenix-evals] ⚠ No code blocks — agents perform better with concrete snippets and commands. [phoenix-evals] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [phoenix-tracing] 📊 phoenix-tracing: 1,488 BPE tokens [chars/4: 1,690] (detailed ✓), 17 sections, 1 code blocks [php-mcp-server-generator] 📊 php-mcp-server-generator: 2,723 BPE tokens [chars/4: 2,697] (standard ~), 37 sections, 25 code blocks [php-mcp-server-generator] ⚠ Skill is 2,723 BPE tokens (chars/4 estimate: 2,697) — approaching "comprehensive" range where gains diminish. [polyglot-test-agent] 📊 polyglot-test-agent: 1,611 BPE tokens [chars/4: 1,676] (detailed ✓), 22 sections, 5 code blocks [power-apps-code-app-scaffold] 📊 power-apps-code-app-scaffold: 1,424 BPE tokens [chars/4: 1,786] (detailed ✓), 18 sections, 2 code blocks [power-bi-dax-optimization] 📊 power-bi-dax-optimization: 1,114 BPE tokens [chars/4: 1,390] (detailed ✓), 18 sections, 6 code blocks [power-bi-model-design-review] 📊 power-bi-model-design-review: 2,126 BPE tokens [chars/4: 2,782] (detailed ✓), 28 sections, 17 code blocks [power-bi-performance-troubleshooting] 📊 power-bi-performance-troubleshooting: 2,078 BPE tokens [chars/4: 2,591] (detailed ✓), 31 sections, 19 code blocks [power-bi-report-design-consultation] 📊 power-bi-report-design-consultation: 2,357 BPE tokens [chars/4: 2,723] (detailed ✓), 22 sections, 13 code blocks [power-platform-architect] 📊 power-platform-architect: 3,104 BPE tokens [chars/4: 3,609] (standard ~), 12 sections, 1 code blocks [power-platform-architect] ⚠ Skill is 3,104 BPE tokens (chars/4 estimate: 3,609) — approaching "comprehensive" range where gains diminish. [mcp-copilot-studio-server-generator] 📊 mcp-copilot-studio-server-generator: 942 BPE tokens [chars/4: 1,063] (detailed ✓), 6 sections, 2 code blocks [power-platform-mcp-connector-suite] 📊 power-platform-mcp-connector-suite: 1,348 BPE tokens [chars/4: 1,590] (detailed ✓), 19 sections, 1 code blocks [power-platform-mcp-connector-suite] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [drawio] 📊 drawio: 920 BPE tokens [chars/4: 805] (detailed ✓), 13 sections, 3 code blocks [md-to-docx] 📊 md-to-docx: 676 BPE tokens [chars/4: 688] (detailed ✓), 9 sections, 3 code blocks [md-to-docx] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [breakdown-epic-arch] 📊 breakdown-epic-arch: 601 BPE tokens [chars/4: 712] (detailed ✓), 12 sections, 0 code blocks [breakdown-epic-arch] ⚠ No code blocks — agents perform better with concrete snippets and commands. [breakdown-epic-arch] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [breakdown-epic-pm] 📊 breakdown-epic-pm: 509 BPE tokens [chars/4: 575] (detailed ✓), 13 sections, 0 code blocks [breakdown-epic-pm] ⚠ No code blocks — agents perform better with concrete snippets and commands. [breakdown-epic-pm] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [breakdown-feature-implementation] 📊 breakdown-feature-implementation: 966 BPE tokens [chars/4: 1,189] (detailed ✓), 9 sections, 2 code blocks [breakdown-feature-implementation] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [breakdown-feature-prd] 📊 breakdown-feature-prd: 557 BPE tokens [chars/4: 620] (detailed ✓), 13 sections, 0 code blocks [breakdown-feature-prd] ⚠ No code blocks — agents perform better with concrete snippets and commands. [breakdown-feature-prd] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [create-github-issues-feature-from-implementation-plan] 📊 create-github-issues-feature-from-implementation-plan: 191 BPE tokens [chars/4: 237] (compact ✓), 4 sections, 0 code blocks [create-github-issues-feature-from-implementation-plan] ⚠ Skill is only 191 BPE tokens (chars/4 estimate: 237) — may be too sparse to provide actionable guidance. [create-github-issues-feature-from-implementation-plan] ⚠ No code blocks — agents perform better with concrete snippets and commands. [create-implementation-plan] 📊 create-implementation-plan: 1,390 BPE tokens [chars/4: 1,575] (detailed ✓), 22 sections, 1 code blocks [create-implementation-plan] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [create-technical-spike] 📊 create-technical-spike: 1,358 BPE tokens [chars/4: 1,643] (detailed ✓), 33 sections, 1 code blocks [update-implementation-plan] 📊 update-implementation-plan: 1,400 BPE tokens [chars/4: 1,595] (detailed ✓), 22 sections, 1 code blocks [update-implementation-plan] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [python-mcp-server-generator] 📊 python-mcp-server-generator: 871 BPE tokens [chars/4: 990] (detailed ✓), 13 sections, 0 code blocks [python-mcp-server-generator] ⚠ No code blocks — agents perform better with concrete snippets and commands. [react-audit-grep-patterns] 📊 react-audit-grep-patterns: 475 BPE tokens [chars/4: 461] (detailed ✓), 13 sections, 1 code blocks [react-audit-grep-patterns] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [react18-batching-patterns] 📊 react18-batching-patterns: 561 BPE tokens [chars/4: 621] (detailed ✓), 4 sections, 1 code blocks [react18-batching-patterns] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [react18-dep-compatibility] 📊 react18-dep-compatibility: 1,388 BPE tokens [chars/4: 1,124] (detailed ✓), 10 sections, 1 code blocks [react18-dep-compatibility] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [react18-enzyme-to-rtl] 📊 react18-enzyme-to-rtl: 870 BPE tokens [chars/4: 917] (detailed ✓), 6 sections, 3 code blocks [react18-legacy-context] 📊 react18-legacy-context: 561 BPE tokens [chars/4: 620] (detailed ✓), 8 sections, 2 code blocks [react18-lifecycle-patterns] 📊 react18-lifecycle-patterns: 768 BPE tokens [chars/4: 910] (detailed ✓), 7 sections, 1 code blocks [react18-lifecycle-patterns] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [react18-string-refs] 📊 react18-string-refs: 549 BPE tokens [chars/4: 527] (detailed ✓), 6 sections, 1 code blocks [react19-concurrent-patterns] 📊 react19-concurrent-patterns: 681 BPE tokens [chars/4: 722] (detailed ✓), 9 sections, 5 code blocks [react19-source-patterns] 📊 react19-source-patterns: 462 BPE tokens [chars/4: 489] (detailed ✓), 4 sections, 1 code blocks [react19-source-patterns] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [react19-test-patterns] 📊 react19-test-patterns: 815 BPE tokens [chars/4: 790] (detailed ✓), 7 sections, 5 code blocks [roundup] 📊 roundup: 2,219 BPE tokens [chars/4: 2,591] (detailed ✓), 28 sections, 0 code blocks [roundup] ⚠ No code blocks — agents perform better with concrete snippets and commands. [roundup] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [roundup-setup] 📊 roundup-setup: 3,202 BPE tokens [chars/4: 3,669] (standard ~), 18 sections, 1 code blocks [roundup-setup] ⚠ Skill is 3,202 BPE tokens (chars/4 estimate: 3,669) — approaching "comprehensive" range where gains diminish. [ruby-mcp-server-generator] 📊 ruby-mcp-server-generator: 3,459 BPE tokens [chars/4: 3,631] (standard ~), 37 sections, 24 code blocks [ruby-mcp-server-generator] ⚠ Skill is 3,459 BPE tokens (chars/4 estimate: 3,631) — approaching "comprehensive" range where gains diminish. [rust-mcp-server-generator] 📊 rust-mcp-server-generator: 3,088 BPE tokens [chars/4: 3,190] (standard ~), 31 sections, 24 code blocks [rust-mcp-server-generator] ⚠ Skill is 3,088 BPE tokens (chars/4 estimate: 3,190) — approaching "comprehensive" range where gains diminish. [salesforce-apex-quality] 📊 salesforce-apex-quality: 1,682 BPE tokens [chars/4: 1,820] (detailed ✓), 14 sections, 4 code blocks [salesforce-apex-quality] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [salesforce-component-standards] 📊 salesforce-component-standards: 2,300 BPE tokens [chars/4: 2,448] (detailed ✓), 20 sections, 4 code blocks [salesforce-component-standards] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [salesforce-flow-design] 📊 salesforce-flow-design: 1,491 BPE tokens [chars/4: 1,740] (detailed ✓), 15 sections, 5 code blocks [salesforce-flow-design] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [ai-prompt-engineering-safety-review] 📊 ai-prompt-engineering-safety-review: 2,185 BPE tokens [chars/4: 2,540] (detailed ✓), 19 sections, 0 code blocks [ai-prompt-engineering-safety-review] ⚠ No code blocks — agents perform better with concrete snippets and commands. [structured-autonomy-generate] 📊 structured-autonomy-generate: 921 BPE tokens [chars/4: 1,033] (detailed ✓), 10 sections, 3 code blocks [structured-autonomy-implement] 📊 structured-autonomy-implement: 209 BPE tokens [chars/4: 272] (compact ✓), 0 sections, 0 code blocks [structured-autonomy-implement] ⚠ No section headers — agents navigate structured documents better. [structured-autonomy-implement] ⚠ No code blocks — agents perform better with concrete snippets and commands. [structured-autonomy-implement] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [structured-autonomy-plan] 📊 structured-autonomy-plan: 677 BPE tokens [chars/4: 759] (detailed ✓), 9 sections, 1 code blocks [swift-mcp-server-generator] 📊 swift-mcp-server-generator: 3,781 BPE tokens [chars/4: 4,374] (standard ~), 30 sections, 18 code blocks [swift-mcp-server-generator] ⚠ Skill is 3,781 BPE tokens (chars/4 estimate: 4,374) — approaching "comprehensive" range where gains diminish. [create-technical-spike] 📊 create-technical-spike: 1,358 BPE tokens [chars/4: 1,643] (detailed ✓), 33 sections, 1 code blocks [ai-prompt-engineering-safety-review] 📊 ai-prompt-engineering-safety-review: 2,185 BPE tokens [chars/4: 2,540] (detailed ✓), 19 sections, 0 code blocks [ai-prompt-engineering-safety-review] ⚠ No code blocks — agents perform better with concrete snippets and commands. [csharp-nunit] 📊 csharp-nunit: 636 BPE tokens [chars/4: 711] (detailed ✓), 8 sections, 0 code blocks [csharp-nunit] ⚠ No code blocks — agents perform better with concrete snippets and commands. [csharp-nunit] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [java-junit] 📊 java-junit: 695 BPE tokens [chars/4: 746] (detailed ✓), 8 sections, 0 code blocks [java-junit] ⚠ No code blocks — agents perform better with concrete snippets and commands. [java-junit] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [playwright-explore-website] 📊 playwright-explore-website: 140 BPE tokens [chars/4: 171] (compact ✓), 2 sections, 0 code blocks [playwright-explore-website] ⚠ Skill is only 140 BPE tokens (chars/4 estimate: 171) — may be too sparse to provide actionable guidance. [playwright-explore-website] ⚠ No code blocks — agents perform better with concrete snippets and commands. [playwright-generate-test] 📊 playwright-generate-test: 178 BPE tokens [chars/4: 220] (compact ✓), 2 sections, 0 code blocks [playwright-generate-test] ⚠ Skill is only 178 BPE tokens (chars/4 estimate: 220) — may be too sparse to provide actionable guidance. [playwright-generate-test] ⚠ No code blocks — agents perform better with concrete snippets and commands. [playwright-generate-test] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [typescript-mcp-server-generator] 📊 typescript-mcp-server-generator: 759 BPE tokens [chars/4: 925] (detailed ✓), 12 sections, 0 code blocks [typescript-mcp-server-generator] ⚠ No code blocks — agents perform better with concrete snippets and commands. [typespec-api-operations] 📊 typespec-api-operations: 2,327 BPE tokens [chars/4: 2,287] (detailed ✓), 35 sections, 16 code blocks [typespec-api-operations] ⚠ No numbered workflow steps — agents follow sequenced procedures more reliably. [typespec-create-agent] 📊 typespec-create-agent: 672 BPE tokens [chars/4: 737] (detailed ✓), 5 sections, 1 code blocks [typespec-create-api-plugin] 📊 typespec-create-api-plugin: 979 BPE tokens [chars/4: 1,013] (detailed ✓), 11 sections, 9 code blocks [winui3-migration-guide] 📊 winui3-migration-guide: 2,196 BPE tokens [chars/4: 2,226] (detailed ✓), 18 sections, 13 code blocks {Ansi.Red}Skill spec conformance failures — fix the errors above.{Ansi.Reset} ❌ Duplicate skill name 'azure-resource-health-diagnose' found in 'plugins/devops-oncall/skills/azure-resource-health-diagnose' (first seen in 'plugins/azure-cloud-development/skills/azure-resource-health-diagnose') ❌ Duplicate skill name 'openapi-to-application-code' found in 'plugins/openapi-to-application-go/skills/openapi-to-application-code' (first seen in 'plugins/openapi-to-application-csharp-dotnet/skills/openapi-to-application-code') ❌ Duplicate skill name 'openapi-to-application-code' found in 'plugins/openapi-to-application-java-spring-boot/skills/openapi-to-application-code' (first seen in 'plugins/openapi-to-application-csharp-dotnet/skills/openapi-to-application-code') ❌ Duplicate skill name 'openapi-to-application-code' found in 'plugins/openapi-to-application-nodejs-nestjs/skills/openapi-to-application-code' (first seen in 'plugins/openapi-to-application-csharp-dotnet/skills/openapi-to-application-code') ❌ Duplicate skill name 'openapi-to-application-code' found in 'plugins/openapi-to-application-python-fastapi/skills/openapi-to-application-code' (first seen in 'plugins/openapi-to-application-csharp-dotnet/skills/openapi-to-application-code') ❌ Duplicate skill name 'create-technical-spike' found in 'plugins/technical-spike/skills/create-technical-spike' (first seen in 'plugins/project-planning/skills/create-technical-spike') ❌ Duplicate skill name 'ai-prompt-engineering-safety-review' found in 'plugins/testing-automation/skills/ai-prompt-engineering-safety-review' (first seen in 'plugins/security-best-practices/skills/ai-prompt-engineering-safety-review') ❌ Duplicate skill name 'csharp-nunit' found in 'plugins/testing-automation/skills/csharp-nunit' (first seen in 'plugins/csharp-dotnet-development/skills/csharp-nunit') ❌ Duplicate skill name 'java-junit' found in 'plugins/testing-automation/skills/java-junit' (first seen in 'plugins/java-development/skills/java-junit') ❌ Duplicate skill name 'playwright-explore-website' found in 'plugins/testing-automation/skills/playwright-explore-website' (first seen in 'plugins/frontend-web-dev/skills/playwright-explore-website') ❌ Duplicate skill name 'playwright-generate-test' found in 'plugins/testing-automation/skills/playwright-generate-test' (first seen in 'plugins/frontend-web-dev/skills/playwright-generate-test') Found 1 agent(s) ❌ [agent:Quality Playbook] Agent name 'Quality Playbook' does not match filename 'quality-playbook.agent.md' (expected 'Quality Playbook.agent.md'). ❌ [agent:Quality Playbook] Agent name 'Quality Playbook' contains invalid characters — must be lowercase alphanumeric and hyphens only. Validated 1 agent(s)

{Ansi.Red}Agent spec conformance failures — fix the errors above.{Ansi.Reset}

</details>
> **Note:** The validator returned a non-zero exit code. Please review the findings above before merge.

@andrewstellman andrewstellman force-pushed the quality-playbook-v1.4.0 branch from a2b9013 to 57eae8d Compare April 15, 2026 14:42
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

andrewstellman and others added 6 commits April 15, 2026 11:03
- Recheck mode: say "recheck" after fixing bugs to verify fixes without
  re-running the full pipeline (2-10 min vs 60-90 min)
- Fixed 19 bugs from bootstrap self-audit: eval injection in
  quality_gate.sh, bash 3.2 empty array crashes, required artifacts
  downgraded to WARN, json_key_count false positives, missing artifact
  checks, documentation inconsistencies
- quality_gate.sh: integration-results.json validation depth parity,
  #### heading detection, functional test alternative name patterns

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
25 bug fixes from Sonnet 4.6 self-audit (3 HIGH, 8 MEDIUM, 14 LOW):
- Nullglob-safe artifact detection (find replaces ls-glob)
- Severity-prefixed bug ID support (BUG-H1, BUG-M3, BUG-L6)
- TDD sidecar-to-log cross-validation
- Recheck-results.json gate validation
- Phase 5 entry gate, SEED_CHECKS.md in artifact contract table
- Run metadata JSON spec for multi-model comparison
- Canonical enum fixes in quality_gate.sh and references

All 25 fixes verified by Sonnet recheck (25/25 FIXED).

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Comment thread skills/quality-playbook/quality_gate.sh Outdated
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should live within the scripts folder per the skills spec

Comment thread skills/quality-playbook/quality_gate.sh Outdated
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This file shouldn't have to do directory walking to find the SKILL.md file, that should be always ../ from the location of the bash file

Comment thread agents/quality-playbook.agent.md Outdated

**If the skill is not installed**, tell the user:

> The quality playbook skill isn't installed in this repository yet. Install it from the [quality-playbook repository](https://github.com/andrewstellman/quality-playbook):
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the reason for "installing" this repo? Generally, we try to avoid having agents/skills clone external repos as that results in opacity in the security supply chain of what you install.

@andrewstellman
Copy link
Copy Markdown
Contributor Author

andrewstellman commented Apr 28, 2026 via email

@aaronpowell
Copy link
Copy Markdown
Contributor

fine to iterate on this PR. We squash merge so "messy" history isn't really a problem

github-actions Bot and others added 2 commits May 8, 2026 06:23
- SKILL.md updated to v1.5.6 (2738 lines, was 2322 at v1.4.2). Banner
  + version stamps + JSON schema examples all bumped.
- references/ expanded to 16 files (was 12). New: challenge_gate.md,
  code-only-mode.md, orchestrator_protocol.md, run_state_schema.md.
  Existing files updated to current QPB content.
- phase_prompts/ added (9 files). New v1.5.x directory carrying
  externalized phase prompts the runner uses.
- agents/ added inside the skill bundle (3 files: quality-playbook.agent.md,
  quality-playbook-claude.agent.md, calibration_orchestrator.md). The
  v1.5.6 install bundle includes these so README's Step 4
  'claude --agent agents/...' invocation resolves from the install
  destination.
- bin/citation_verifier.py added — required by quality_gate.py for
  byte-equality citation verification (per QPB BUG-005 closure).
- quality_gate.sh retired in v1.4.5; replaced with quality_gate.py
  (Python rewrite with json.load + 108-test unit suite at upstream).

agents/quality-playbook.agent.md (top-level awesome-copilot agent file)
install instructions rewritten to install from awesome-copilot's own
skills/quality-playbook/ folder rather than from the andrewstellman/
quality-playbook upstream — adopters discovering the skill via
awesome-copilot can install without leaving the marketplace. Locations
list expanded to all four supported tools (Copilot, Cursor, Claude
Code, Continue). Pointer to upstream preserved as alternative for
users who want the full script-driven install UX (auto-detect, marker
directory creation, smoke checks).

Closes github#1402.
@github-actions github-actions Bot added agent PR touches agents new-submission PR adds at least one new contribution skills PR touches skills labels May 8, 2026
The skills index table row for quality-playbook was still listing the
v1.4.2 bundled assets (quality_gate.sh + 12 references files). v1.5.6
ships 31 files: SKILL.md, quality_gate.py, bin/citation_verifier.py,
3 agents, 9 phase_prompts, 16 references, plus LICENSE.txt. Updated
the row so the index reflects what's actually in skills/quality-playbook/.
Brings in new awesome-copilot main branch additions (skills + plugins
landed since the PR branch last synced) ahead of QPB v1.5.6 PR review.
Conflicts in plugins/ files (flowstudio-power-automate, gem-team) and
docs/README.plugins.md resolved by taking origin/main's versions —
these conflicts are unrelated to the QPB skill update; the PR has no
opinion on plugin content.
@andrewstellman andrewstellman requested a review from dvelton as a code owner May 9, 2026 01:22
@github-actions github-actions Bot added branched-main PR appears to include plugin files materialized from main and removed skills PR touches skills agent PR touches agents new-submission PR adds at least one new contribution labels May 9, 2026
…ook to v1.5.6

Resolved 39 conflicts by taking upstream's version (no QPB-specific
changes were in conflict — all conflicts were unrelated cookbook,
plugin, website, and docs files that drifted on upstream).

Synced skills/quality-playbook/ to QPB v1.5.6 (31 bundled assets):
agents/, phase_prompts/, bin/citation_verifier.py, quality_gate.py,
plus 16 references files. Updated docs/README.skills.md row with
v1.5.6 description and full bundled-assets list.
@github-actions github-actions Bot added skill-check-error Skill validator reported errors skill-check-warning Skill validator reported warnings labels May 9, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

branched-main PR appears to include plugin files materialized from main skill-check-error Skill validator reported errors skill-check-warning Skill validator reported warnings

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants