NEW Browse AI tools across categories — updated daily. See what's new →

Autoresearch

Autonomous iteration loop: modify, verify, keep/discard against any metric

Version1.0.0
LicenseMIT
Token count~594
UpdatedJun 5, 2026

Install

Quick install

via npx skills · works with 57+ agents
npx skills add https://github.com/uditgoenka/autoresearch
Or pick agent:
npx skills add uditgoenka/autoresearch --agent claude-code
npx skills add uditgoenka/autoresearch --agent cursor
npx skills add uditgoenka/autoresearch --agent codex
npx skills add uditgoenka/autoresearch --agent opencode
npx skills add uditgoenka/autoresearch --agent github-copilot
npx skills add uditgoenka/autoresearch --agent windsurf
More install options

Shorthand — useful for multi-skill repos:

npx skills add uditgoenka/autoresearch

Manual — clone the repo and drop the folder into your agent's skills directory:

git clone https://github.com/uditgoenka/autoresearch.git
cp -r autoresearch ~/.claude/skills/
How to use: Once installed, ask your agent to "use the Autoresearch skill" or describe what you want (e.g. "Autonomous iteration loop: modify, verify, keep/discard against any metric"). Requires Node.js 18+.

Autoresearch

Autonomous iteration loop: modify, verify, keep/discard against any metric

---
name: autoresearch
description: "Autonomous iteration loop: modify, verify, keep/discard against any metric"
version: 2.1.0
---

Autoresearch — Autonomous Goal-directed Iteration

Safety Invariants (all subcommands)

  • Never push, publish, or deploy without explicit user approval.
  • Bounded by default. Override with Iterations: unlimited.
  • All results logged to autoresearch/{subcommand}-{YYMMDD}-{HHMM}/ directory.
  • Chain handoff via handoff.json. Evals reads *-results.tsv.

Subcommands

| Command | Does | Default Iterations |
|---|---|---|
| $autoresearch | Iterate against a metric: modify → verify → keep/discard | 25 |
| $autoresearch plan | Convert a goal into validated Scope, Metric, Verify config | N/A |
| $autoresearch debug | Hunt bugs: hypothesize → test → falsify → repeat | 15 |
| $autoresearch fix | Crush errors one-by-one until zero remain | 20 |
| $autoresearch security | STRIDE + OWASP audit with red-team personas | 15 |
| $autoresearch ship | Ship through 8 phases: checklist → dry-run → deploy → verify | N/A |
| $autoresearch scenario | Generate edge cases across 12 dimensions | 20 |
| $autoresearch predict | 5 expert personas debate before implementation | N/A |
| $autoresearch learn | Scout codebase → generate docs → validate → fix loop | 10 |
| $autoresearch reason | Adversarial debate with blind judges until convergence | 8 |
| $autoresearch probe | 8 personas interrogate requirements until saturation | 15 |
| $autoresearch improve | Research ICP challenges, discover improvements, generate PRDs | 15 |
| $autoresearch evals | Analyze iteration results: trends, plateaus, regressions | N/A |

Universal Flags

| Flag | Applies To | Purpose |
|---|---|---|
| Iterations: N | All looping | Set iteration count |
| Iterations: unlimited | All looping | Opt-in unbounded |
| --evals | All looping | Mid-loop checkpoints + final summary |
| --evals-interval N | All looping | Override checkpoint frequency |
| --chain <targets> | All | Sequential handoff after completion |
| --<subcommand> | All | Shorthand for --chain <subcommand> |

---

Source: https://github.com/uditgoenka/autoresearch
Author: uditgoenka
Discovered via: skillsdirectory.com
Genre: ai-agents

SKILL.md source

---
name: Autoresearch
description: Autonomous iteration loop: modify, verify, keep/discard against any metric
---

# Autoresearch

Autonomous iteration loop: modify, verify, keep/discard against any metric

---
name: autoresearch
description: "Autonomous iteration loop: modify, verify, keep/discard against any metric"
version: 2.1.0
---

# Autoresearch — Autonomous Goal-directed Iteration

## Safety Invariants (all subcommands)
- Never push, publish, or deploy without explicit user approval.
- Bounded by default. Override with `Iterations: unlimited`.
- All results logged to `autoresearch/{subcommand}-{YYMMDD}-{HHMM}/` directory.
- Chain handoff via `handoff.json`. Evals reads `*-results.tsv`.

## Subcommands

| Command | Does | Default Iterations |
|---|---|---|
| `$autoresearch` | Iterate against a metric: modify → verify → keep/discard | 25 |
| `$autoresearch plan` | Convert a goal into validated Scope, Metric, Verify config | N/A |
| `$autoresearch debug` | Hunt bugs: hypothesize → test → falsify → repeat | 15 |
| `$autoresearch fix` | Crush errors one-by-one until zero remain | 20 |
| `$autoresearch security` | STRIDE + OWASP audit with red-team personas | 15 |
| `$autoresearch ship` | Ship through 8 phases: checklist → dry-run → deploy → verify | N/A |
| `$autoresearch scenario` | Generate edge cases across 12 dimensions | 20 |
| `$autoresearch predict` | 5 expert personas debate before implementation | N/A |
| `$autoresearch learn` | Scout codebase → generate docs → validate → fix loop | 10 |
| `$autoresearch reason` | Adversarial debate with blind judges until convergence | 8 |
| `$autoresearch probe` | 8 personas interrogate requirements until saturation | 15 |
| `$autoresearch improve` | Research ICP challenges, discover improvements, generate PRDs | 15 |
| `$autoresearch evals` | Analyze iteration results: trends, plateaus, regressions | N/A |

## Universal Flags

| Flag | Applies To | Purpose |
|---|---|---|
| `Iterations: N` | All looping | Set iteration count |
| `Iterations: unlimited` | All looping | Opt-in unbounded |
| `--evals` | All looping | Mid-loop checkpoints + final summary |
| `--evals-interval N` | All looping | Override checkpoint frequency |
| `--chain <targets>` | All | Sequential handoff after completion |
| `--<subcommand>` | All | Shorthand for `--chain <subcommand>` |


---

**Source**: https://github.com/uditgoenka/autoresearch
**Author**: uditgoenka
**Discovered via**: skillsdirectory.com
**Genre**: ai-agents

Related skills 6

running-claude-code-via-litellm-copilot

★ Featured

Use when routing Claude Code through a local LiteLLM proxy to GitHub Copilot, reducing direct Anthropic spend, configuring ANTHROPIC_BASE_URL or ANTHROPIC_MODEL overrides, or troubleshooting Copilot proxy setup failures such as model-not-found, no localhost traffic, or GitHub 401/403 auth errors.

xixu-me 155k
AI & ML

skills-cli

★ Featured

Use when users ask to discover, install, list, check, update, remove, back up, restore, sync, or initialize Agent Skills, mention `bunx skills`, `npx skills`, `skills.sh`, or `skills-lock.json`, ask "find a skill for X", or want help extending agent capabilities with installable skills.

xixu-me 155k
AI & ML

repo-intake-and-plan

★ Featured

Narrow RigorPilot helper for README-first deep learning repo reproduction. Use when the task is specifically to scan a repository, read the README and common project files, extract documented commands, classify inference, evaluation, and training candidates, and return the smallest trustworthy reproduction plan to the main orchestrator. Do not use for environment setup, asset download, command execution, final reporting, paper lookup, or end-to-end orchestration.

lllllllama 127k
AI & ML

image-to-video

★ Featured

Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning ite...

agentspace-so 121k
AI & ML

video-edit

★ Featured

Edit existing video on RunComfy — this skill is a smart router that matches the user's intent to the right edit model in the RunComfy catalog. Picks Wan 2.7 Edit-Video (general restyle / background swap / packaging swap, identity + motion preservation), Kling 2.6 Pro Motion Control (transfer precise motion from a reference video to a target character), or Lucy Edit Restyle (lightweight identity-stable restyle / outfit swap). Bundles each model's documented prompting patterns so the skill gets...

agentspace-so 121k
AI & ML

nano-banana-2

★ Featured

Generate images with Google Nano Banana 2 (Gemini-family flash-tier text-to-image) on RunComfy — bundled with the model's documented prompting patterns so the skill gets sharper output than naive prompting against the same model. Documents Nano Banana 2's strengths (rapid iteration, in-image typography rendering, predictable framing, optional web-grounded context), the resolution-tier pricing, the safety-tolerance dial, and when to route to Nano Banana Pro / GPT Image 2 / Flux 2 / Seedream in...

agentspace-so 121k
AI & ML