Testing & Quality

Test Skill With Subagents

Test any Claude skill using RED-GREEN-REFACTOR cycle with subagent pressure testing to verify the skill resists agent rationalization and bypass attempts

AuthorNeoLabHQ

Version1.0.0

LicenseMIT

Token count~681

UpdatedJun 5, 2026

Install

Quick install

via npx skills · works with 57+ agents

npx skills add https://github.com/NeoLabHQ/context-engineering-kit/tree/master/plugins/customaize-agent/skills/test-skill

Or pick agent:

npx skills add NeoLabHQ/context-engineering-kit --skill "Test Skill with Subagents" --agent claude-code

npx skills add NeoLabHQ/context-engineering-kit --skill "Test Skill with Subagents" --agent cursor

npx skills add NeoLabHQ/context-engineering-kit --skill "Test Skill with Subagents" --agent codex

npx skills add NeoLabHQ/context-engineering-kit --skill "Test Skill with Subagents" --agent opencode

npx skills add NeoLabHQ/context-engineering-kit --skill "Test Skill with Subagents" --agent github-copilot

npx skills add NeoLabHQ/context-engineering-kit --skill "Test Skill with Subagents" --agent windsurf

More install options

Shorthand — useful for multi-skill repos:

npx skills add NeoLabHQ/context-engineering-kit --skill "Test Skill with Subagents"

Manual — clone the repo and drop the folder into your agent's skills directory:

git clone https://github.com/NeoLabHQ/context-engineering-kit.git

cp -r context-engineering-kit/plugins/customaize-agent/skills/test-skill ~/.claude/skills/

How to use: Once installed, ask your agent to "use the Test Skill with Subagents skill" or describe what you want (e.g. "Test any Claude skill using RED-GREEN-REFACTOR cycle with subagent pressure test"). Requires Node.js 18+.

Test Skill with Subagents

Test any Claude skill using RED-GREEN-REFACTOR cycle with subagent pressure testing to verify the skill resists agent rationalization and bypass attempts

What is it?
Testing skills is just TDD applied to process documentation.

You run scenarios without the skill (RED - watch agent fail), write skill addressing those failures (GREEN - watch agent comply), then close loopholes (REFACTOR - stay compliant).

Core principle: If you didn't watch an agent fail without the skill, you don't know if the skill prevents the right failures.

REQUIRED BACKGROUND: You MUST understand superpowers:test-driven-development before using this skill. That skill defines the fundamental RED-GREEN-REFACTOR cycle. This skill provides skill-specific test formats (pressure scenarios, rationalization tables).

Complete worked example: See examples/CLAUDE_MD_TESTING.md for a full test campaign testing CLAUDE.md documentation variants.

How to use it?

`IMPORTANT: This is a real scenario. You must choose and act.
Don't ask hypothetical questions - make the actual decision.

You have access to: [skill-being-tested]
`

Make agent believe it's real work, not a quiz.

Key Features

Test any Claude skill using RED-GREEN-REFACTOR cycle with subagent pressure testing to verify the skill resists agent rationalization and bypass attempts
Seamless integration with Claude's development workflow
Comprehensive guidelines and best practices for test skill with subagentsView on GitHub

GitHub Stats

StarsForksLast UpdateAuthorNeoLabHQLicenseGPL-3.0Version1.0.0

Features

Related Skills

Test Any Prompt

Universal prompt testing methodology using RED-GREEN-REFACTOR cycle with subagents, supporting A/B testing and regression testing for commands, hooks, skills, and production prompts

433NeoLabHQAI & MLTesting00

Review Local Changes

Multi-agent code review system for uncommitted changes with 6 specialized reviewer roles (security, bug, quality, contract, testing, history), confidence scoring and false positive filtering

433NeoLabHQDeveloper ToolsTesting00

Review Pull Request

Multi-agent PR review system with specialized reviewers, inline comments, and automatic PR description generation using GitHub CLI

433NeoLabHQDeveloper ToolsTesting00

---

Source: https://github.com/NeoLabHQ/context-engineering-kit/tree/master/plugins/customaize-agent/skills/test-skill
Author: NeoLabHQ
License: https://www.gnu.org/licenses/gpl-3.0.html
GitHub Stars: 433
Tags: skill-testing, tdd, pressure-testing, subagent, quality-assurance

SKILL.md source

---
name: Test Skill with Subagents
description: Test any Claude skill using RED-GREEN-REFACTOR cycle with subagent pressure testing to verify the skill resists agent rationalization and bypass attempts
---

# Test Skill with Subagents

Test any Claude skill using RED-GREEN-REFACTOR cycle with subagent pressure testing to verify the skill resists agent rationalization and bypass attempts

What is it?
Testing skills is just TDD applied to process documentation.

You run scenarios without the skill (RED - watch agent fail), write skill addressing those failures (GREEN - watch agent comply), then close loopholes (REFACTOR - stay compliant).

Core principle: If you didn't watch an agent fail without the skill, you don't know if the skill prevents the right failures.

REQUIRED BACKGROUND: You MUST understand superpowers:test-driven-development before using this skill. That skill defines the fundamental RED-GREEN-REFACTOR cycle. This skill provides skill-specific test formats (pressure scenarios, rationalization tables).

Complete worked example: See examples/CLAUDE_MD_TESTING.md for a full test campaign testing CLAUDE.md documentation variants.

## How to use it?

```
`IMPORTANT: This is a real scenario. You must choose and act.
Don't ask hypothetical questions - make the actual decision.

You have access to: [skill-being-tested]
`
```

Make agent believe it's real work, not a quiz.

## Key Features

* Test any Claude skill using RED-GREEN-REFACTOR cycle with subagent pressure testing to verify the skill resists agent rationalization and bypass attempts
* Seamless integration with Claude's development workflow
* Comprehensive guidelines and best practices for test skill with subagentsView on GitHub

### GitHub Stats
StarsForksLast UpdateAuthorNeoLabHQLicenseGPL-3.0Version1.0.0

### Categories
AI & MLTesting

### Tags
skill-testingtddpressure-testingsubagentquality-assurance

### Features

## Related Skills
More from AI & ML

### Test Any Prompt
Universal prompt testing methodology using RED-GREEN-REFACTOR cycle with subagents, supporting A/B testing and regression testing for commands, hooks, skills, and production prompts

433NeoLabHQAI & MLTesting00

### Review Local Changes
Multi-agent code review system for uncommitted changes with 6 specialized reviewer roles (security, bug, quality, contract, testing, history), confidence scoring and false positive filtering

433NeoLabHQDeveloper ToolsTesting00

### Review Pull Request
Multi-agent PR review system with specialized reviewers, inline comments, and automatic PR description generation using GitHub CLI

433NeoLabHQDeveloper ToolsTesting00

---

**Source**: https://github.com/NeoLabHQ/context-engineering-kit/tree/master/plugins/customaize-agent/skills/test-skill
**Author**: NeoLabHQ
**License**: https://www.gnu.org/licenses/gpl-3.0.html
**GitHub Stars**: 433
**Tags**: skill-testing, tdd, pressure-testing, subagent, quality-assurance

Testing & Quality