NEW Browse AI tools across categories — updated daily. See what's new →
Official Testing & Quality

quality-playbook

Run a complete quality engineering audit on any codebase. Derives behavioral requirements from the code, generates spec-traced functional tests, runs a three-pass code review with regression tests, executes a multi-model spec audit (Council of Three), and produces a consolidated bug report with TDD-verified patches. Finds the 35% of real defects that structural code review alone cannot catch. Works with any language. Trigger on 'quality playbook', 'spec audit', 'Council of Three', 'fitness-to...

This skill ships only metadata — no inline instructions. See the source repo for details.

Install this skill

1

One command (all agents)

Runs the npx skills CLI which auto-detects every AI coding agent you have installed (Claude Code, Cursor, Codex, OpenCode, Windsurf, Copilot, and 51 more).

npx skills add https://github.com/github/awesome-copilot
Alternative: shorthand form
npx skills add github/awesome-copilot
2

Install to a specific agent

Pick the agent you use. The CLI writes the skill to that agent's standard skill directory.

npx skills add github/awesome-copilot --agent claude-code
npx skills add github/awesome-copilot --agent cursor
npx skills add github/awesome-copilot --agent codex
npx skills add github/awesome-copilot --agent opencode
npx skills add github/awesome-copilot --agent github-copilot
npx skills add github/awesome-copilot --agent windsurf
3

Manual install (no CLI)

Prefer to skip the CLI? Clone the repo and drop the skill folder into your agent's skills directory.

git clone https://github.com/github/awesome-copilot.git
cp -r awesome-copilot ~/.claude/skills/

For other agents, replace ~/.claude/skills/ with their skill directory — see the full list.

4

Use it

Once installed, ask your agent to "use the quality-playbook skill" or describe what you want (e.g. "Run a complete quality engineering audit on any codebase. Derives behavioral req"). Most agents auto-discover the skill from its SKILL.md description — no slash command needed.

Requires: Node.js 18+ for npx skills. Skill files are MIT-style permissive by default — check the source repo for the actual license.

SKILL.md source

---
name: quality-playbook
description: Run a complete quality engineering audit on any codebase. Derives behavioral requirements from the code, generates spec-traced functional tests, runs a three-pass code review with regression tests, ex
---

Related skills 6

agent-browser

★ Featured Official

Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction. Also use...

vercel-labs 297k
Testing & Quality

grill-me

★ Featured

Interview the user relentlessly about a plan or design until reaching shared understanding, resolving each branch of the decision tree. Use when user wants to stress-test a plan, get grilled on their design, or mentions "grill me".

mattpocock 192k
Testing & Quality

grill-with-docs

★ Featured

Grilling session that challenges your plan against the existing domain model, sharpens terminology, and updates documentation (CONTEXT.md, ADRs) inline as decisions crystallise. Use when user wants to stress-test a plan against their project's language and documented decisions.

mattpocock 138k
Testing & Quality

minimal-run-and-audit

★ Featured

RigorPilot trusted execution and reporting skill for README-first deep learning repo reproduction. Use when the task is specifically to capture or normalize evidence from the selected smoke test or documented inference or evaluation command and write standardized `repro_outputs/` files, including patch notes when repository files changed. Do not use for training execution, initial repo intake, generic environment setup, paper lookup, target selection, hidden scientific-meaning changes, or end...

lllllllama 127k
Testing & Quality

polish

★ Featured

Performs a final quality pass fixing alignment, spacing, consistency, and micro-detail issues before shipping. Use when the user mentions polish, finishing touches, pre-launch review, something looks off, or wants to go from good to great.

pbakaus 86k
Testing & Quality

critique

★ Featured

Evaluate design from a UX perspective, assessing visual hierarchy, information architecture, emotional resonance, cognitive load, and overall quality with quantitative scoring, persona-based testing, automated anti-pattern detection, and actionable feedback. Use when the user asks to review, critique, evaluate, or give feedback on a design or component.

pbakaus 83k
Testing & Quality