AI & ML

Autoresearch

Autonomous iteration loop: modify, verify, keep/discard against any metric

Authoruditgoenka

Version1.0.0

LicenseMIT

Token count~594

UpdatedJun 5, 2026

Install

Quick install

via npx skills · works with 57+ agents

npx skills add https://github.com/uditgoenka/autoresearch

Or pick agent:

npx skills add uditgoenka/autoresearch --agent claude-code

npx skills add uditgoenka/autoresearch --agent cursor

npx skills add uditgoenka/autoresearch --agent codex

npx skills add uditgoenka/autoresearch --agent opencode

npx skills add uditgoenka/autoresearch --agent github-copilot

npx skills add uditgoenka/autoresearch --agent windsurf

More install options

Shorthand — useful for multi-skill repos:

npx skills add uditgoenka/autoresearch

Manual — clone the repo and drop the folder into your agent's skills directory:

git clone https://github.com/uditgoenka/autoresearch.git

cp -r autoresearch ~/.claude/skills/

How to use: Once installed, ask your agent to "use the Autoresearch skill" or describe what you want (e.g. "Autonomous iteration loop: modify, verify, keep/discard against any metric"). Requires Node.js 18+.

Autoresearch

Autonomous iteration loop: modify, verify, keep/discard against any metric

---
name: autoresearch
description: "Autonomous iteration loop: modify, verify, keep/discard against any metric"
version: 2.1.0
---

Autoresearch — Autonomous Goal-directed Iteration

Safety Invariants (all subcommands)

Never push, publish, or deploy without explicit user approval.
Bounded by default. Override with Iterations: unlimited.
All results logged to autoresearch/{subcommand}-{YYMMDD}-{HHMM}/ directory.
Chain handoff via handoff.json. Evals reads *-results.tsv.

Subcommands

| Command | Does | Default Iterations |
|---|---|---|
| $autoresearch | Iterate against a metric: modify → verify → keep/discard | 25 |
| $autoresearch plan | Convert a goal into validated Scope, Metric, Verify config | N/A |
| $autoresearch debug | Hunt bugs: hypothesize → test → falsify → repeat | 15 |
| $autoresearch fix | Crush errors one-by-one until zero remain | 20 |
| $autoresearch security | STRIDE + OWASP audit with red-team personas | 15 |
| $autoresearch ship | Ship through 8 phases: checklist → dry-run → deploy → verify | N/A |
| $autoresearch scenario | Generate edge cases across 12 dimensions | 20 |
| $autoresearch predict | 5 expert personas debate before implementation | N/A |
| $autoresearch learn | Scout codebase → generate docs → validate → fix loop | 10 |
| $autoresearch reason | Adversarial debate with blind judges until convergence | 8 |
| $autoresearch probe | 8 personas interrogate requirements until saturation | 15 |
| $autoresearch improve | Research ICP challenges, discover improvements, generate PRDs | 15 |
| $autoresearch evals | Analyze iteration results: trends, plateaus, regressions | N/A |

Universal Flags

| Flag | Applies To | Purpose |
|---|---|---|
| Iterations: N | All looping | Set iteration count |
| Iterations: unlimited | All looping | Opt-in unbounded |
| --evals | All looping | Mid-loop checkpoints + final summary |
| --evals-interval N | All looping | Override checkpoint frequency |
| --chain <targets> | All | Sequential handoff after completion |
| --<subcommand> | All | Shorthand for --chain <subcommand> |

---

Source: https://github.com/uditgoenka/autoresearch
Author: uditgoenka
Discovered via: skillsdirectory.com
Genre: ai-agents

SKILL.md source

---
name: Autoresearch
description: Autonomous iteration loop: modify, verify, keep/discard against any metric
---

# Autoresearch

Autonomous iteration loop: modify, verify, keep/discard against any metric

---
name: autoresearch
description: "Autonomous iteration loop: modify, verify, keep/discard against any metric"
version: 2.1.0
---

# Autoresearch — Autonomous Goal-directed Iteration

## Safety Invariants (all subcommands)
- Never push, publish, or deploy without explicit user approval.
- Bounded by default. Override with `Iterations: unlimited`.
- All results logged to `autoresearch/{subcommand}-{YYMMDD}-{HHMM}/` directory.
- Chain handoff via `handoff.json`. Evals reads `*-results.tsv`.

## Subcommands

| Command | Does | Default Iterations |
|---|---|---|
| `$autoresearch` | Iterate against a metric: modify → verify → keep/discard | 25 |
| `$autoresearch plan` | Convert a goal into validated Scope, Metric, Verify config | N/A |
| `$autoresearch debug` | Hunt bugs: hypothesize → test → falsify → repeat | 15 |
| `$autoresearch fix` | Crush errors one-by-one until zero remain | 20 |
| `$autoresearch security` | STRIDE + OWASP audit with red-team personas | 15 |
| `$autoresearch ship` | Ship through 8 phases: checklist → dry-run → deploy → verify | N/A |
| `$autoresearch scenario` | Generate edge cases across 12 dimensions | 20 |
| `$autoresearch predict` | 5 expert personas debate before implementation | N/A |
| `$autoresearch learn` | Scout codebase → generate docs → validate → fix loop | 10 |
| `$autoresearch reason` | Adversarial debate with blind judges until convergence | 8 |
| `$autoresearch probe` | 8 personas interrogate requirements until saturation | 15 |
| `$autoresearch improve` | Research ICP challenges, discover improvements, generate PRDs | 15 |
| `$autoresearch evals` | Analyze iteration results: trends, plateaus, regressions | N/A |

## Universal Flags

| Flag | Applies To | Purpose |
|---|---|---|
| `Iterations: N` | All looping | Set iteration count |
| `Iterations: unlimited` | All looping | Opt-in unbounded |
| `--evals` | All looping | Mid-loop checkpoints + final summary |
| `--evals-interval N` | All looping | Override checkpoint frequency |
| `--chain <targets>` | All | Sequential handoff after completion |
| `--<subcommand>` | All | Shorthand for `--chain <subcommand>` |


---

**Source**: https://github.com/uditgoenka/autoresearch
**Author**: uditgoenka
**Discovered via**: skillsdirectory.com
**Genre**: ai-agents

AI & ML