Import Summarizer
Convert and summarize reference materials (.docx, .pdf, .pptx, .html, .txt, .md) into context-budget-friendly indexed summaries. Use this skill when the user asks to "import a document", "convert a...
Convert and summarize reference materials (.docx, .pdf, .pptx, .html, .txt, .md) into context-budget-friendly indexed summaries. Use this skill when the user asks to "import a document", "convert a PDF", "read a .docx file", "summarize a reference", "process reference materials", or when any CKW agent needs to convert non-markdown files to readable text and generate summaries for the reference index.
Install
Quick install
npx skills add https://github.com/RDEL-Group/compound-knowledge-worknpx skills add RDEL-Group/compound-knowledge-work --agent claude-codenpx skills add RDEL-Group/compound-knowledge-work --agent cursornpx skills add RDEL-Group/compound-knowledge-work --agent codexnpx skills add RDEL-Group/compound-knowledge-work --agent opencodenpx skills add RDEL-Group/compound-knowledge-work --agent github-copilotnpx skills add RDEL-Group/compound-knowledge-work --agent windsurfMore install options
Shorthand — useful for multi-skill repos:
npx skills add RDEL-Group/compound-knowledge-workManual — clone the repo and drop the folder into your agent's skills directory:
git clone https://github.com/RDEL-Group/compound-knowledge-work.gitcp -r compound-knowledge-work ~/.claude/skills/Import Summarizer
Convert and summarize reference materials (.docx, .pdf, .pptx, .html, .txt, .md) into context-budget-friendly indexed summaries. Use this skill when the user asks to "import a document", "convert a PDF", "read a .docx file", "summarize a reference", "process reference materials", or when any CKW agent needs to convert non-markdown files to readable text and generate summaries for the reference index.
---
name: import-summarizer
description: >
Convert and summarize reference materials (.docx, .pdf, .pptx, .html, .txt, .md)
into context-budget-friendly indexed summaries. Use this skill when the user asks
to "import a document", "convert a PDF", "read a .docx file", "summarize a reference",
"process reference materials", or when any CKW agent needs to convert non-markdown
files to readable text and generate summaries for the reference index.
compatibility: macOS (textutil), pandoc, or python3 with python-docx/PyPDF2
---
Import Summarizer
Convert and process reference materials into indexed, context-budget-friendly summaries. This is the gateway for all reference materials entering a CKW project.
When to Use
/ckw:new-project --from-prdneeds to read a PRD document/ckw:import-referenceprocesses reference materials- Any agent needs to convert a non-markdown document to readable text
Document Conversion
Supported formats
| Format | macOS (preferred) | Cross-platform fallback | Last resort |
|--------|-------------------|------------------------|-------------|
| .docx | textutil -convert txt -stdout | pandoc -t markdown | python3 with python-docx |
| .pdf | textutil -convert txt -stdout | pandoc -t markdown | python3 with PyPDF2 |
| .pptx | textutil -convert txt -stdout | pandoc -t markdown | python3 with python-pptx |
| .txt | Direct read | Direct read | Direct read |
| .md | Direct read | Direct read | Direct read |
| .html | textutil -convert txt -stdout | pandoc -t markdown | Strip tags with sed |
Convert the document
Execute scripts/convert_document.sh <filepath> for document conversion. The script uses a cascading fallback strategy: textutil (macOS) → pandoc → Python libraries.
Detect the file type from its extension. For .md and .txt, read directly. For all other supported formats, run the conversion script. If no converter is available, tell the user what to install.
Summarization
After converting to readable text, generate a summary index file.
Input
- Converted text content
- Original file path and metadata (size, type, date)
Output
Save toreference/.index/{filename}.md using the template in assets/summary-template.md.
Rules
- Preserve specifics — Names, dates, dollar amounts, percentages, technical specs must be exact
- Flag structure — Note if the document has tables, appendices, scoring rubrics, or forms
- Estimate tokens — Use
word_count * 1.3as token estimate in the YAML frontmatter - Map sections — Map major sections so the context-loader can pull specific parts
- Don't interpret — Summarize what the document says, not what it means for the project. Interpretation is the planner's job.
Batch Mode
When processing multiple files (e.g., during /ckw:adopt-project):
Process each file sequentially. After all files, present a summary:
Imported 4 reference files:
Satellite_PRD_FY2026.docx (~4,500 tokens) — Product requirements
Competitor_Analysis.pdf (~2,100 tokens) — Market research
Brand_Guidelines.docx (~1,800 tokens) — Voice and tone
Past_Proposal_Win.pdf (~6,200 tokens) — Reference example
Total reference budget: ~14,600 tokens
Error Handling
- No converter available — Tell the user what to install: "Install pandoc (
brew install pandoc) or run on macOS where textutil is built in." - Garbled output (common with complex PDFs) — Warn the user and suggest pasting the content manually
- Very large file (>50,000 tokens estimated) — Warn about context budget impact and ask the user to identify which sections are most relevant
---
Source: https://github.com/RDEL-Group/compound-knowledge-work
Author: RDEL-Group
Discovered via: skillsdirectory.com
Genre: ai-agents
SKILL.md source
---
name: Import Summarizer
description: Convert and summarize reference materials (.docx, .pdf, .pptx, .html, .txt, .md) into context-budget-friendly indexed summaries. Use this skill when the user asks to "import a document", "convert a...
---
# Import Summarizer
Convert and summarize reference materials (.docx, .pdf, .pptx, .html, .txt, .md) into context-budget-friendly indexed summaries. Use this skill when the user asks to "import a document", "convert a PDF", "read a .docx file", "summarize a reference", "process reference materials", or when any CKW agent needs to convert non-markdown files to readable text and generate summaries for the reference index.
---
name: import-summarizer
description: >
Convert and summarize reference materials (.docx, .pdf, .pptx, .html, .txt, .md)
into context-budget-friendly indexed summaries. Use this skill when the user asks
to "import a document", "convert a PDF", "read a .docx file", "summarize a reference",
"process reference materials", or when any CKW agent needs to convert non-markdown
files to readable text and generate summaries for the reference index.
compatibility: macOS (textutil), pandoc, or python3 with python-docx/PyPDF2
---
# Import Summarizer
Convert and process reference materials into indexed, context-budget-friendly summaries. This is the gateway for all reference materials entering a CKW project.
## When to Use
- `/ckw:new-project --from-prd` needs to read a PRD document
- `/ckw:import-reference` processes reference materials
- Any agent needs to convert a non-markdown document to readable text
## Document Conversion
### Supported formats
| Format | macOS (preferred) | Cross-platform fallback | Last resort |
|--------|-------------------|------------------------|-------------|
| .docx | `textutil -convert txt -stdout` | `pandoc -t markdown` | `python3` with python-docx |
| .pdf | `textutil -convert txt -stdout` | `pandoc -t markdown` | `python3` with PyPDF2 |
| .pptx | `textutil -convert txt -stdout` | `pandoc -t markdown` | `python3` with python-pptx |
| .txt | Direct read | Direct read | Direct read |
| .md | Direct read | Direct read | Direct read |
| .html | `textutil -convert txt -stdout` | `pandoc -t markdown` | Strip tags with sed |
### Convert the document
Execute `scripts/convert_document.sh <filepath>` for document conversion. The script uses a cascading fallback strategy: textutil (macOS) → pandoc → Python libraries.
Detect the file type from its extension. For `.md` and `.txt`, read directly. For all other supported formats, run the conversion script. If no converter is available, tell the user what to install.
## Summarization
After converting to readable text, generate a summary index file.
### Input
- Converted text content
- Original file path and metadata (size, type, date)
### Output
Save to `reference/.index/{filename}.md` using the template in `assets/summary-template.md`.
### Rules
1. **Preserve specifics** — Names, dates, dollar amounts, percentages, technical specs must be exact
2. **Flag structure** — Note if the document has tables, appendices, scoring rubrics, or forms
3. **Estimate tokens** — Use `word_count * 1.3` as token estimate in the YAML frontmatter
4. **Map sections** — Map major sections so the context-loader can pull specific parts
5. **Don't interpret** — Summarize what the document says, not what it means for the project. Interpretation is the planner's job.
## Batch Mode
When processing multiple files (e.g., during `/ckw:adopt-project`):
Process each file sequentially. After all files, present a summary:
```
Imported 4 reference files:
Satellite_PRD_FY2026.docx (~4,500 tokens) — Product requirements
Competitor_Analysis.pdf (~2,100 tokens) — Market research
Brand_Guidelines.docx (~1,800 tokens) — Voice and tone
Past_Proposal_Win.pdf (~6,200 tokens) — Reference example
Total reference budget: ~14,600 tokens
```
## Error Handling
- **No converter available** — Tell the user what to install: "Install pandoc (`brew install pandoc`) or run on macOS where textutil is built in."
- **Garbled output** (common with complex PDFs) — Warn the user and suggest pasting the content manually
- **Very large file** (>50,000 tokens estimated) — Warn about context budget impact and ask the user to identify which sections are most relevant
---
**Source**: https://github.com/RDEL-Group/compound-knowledge-work
**Author**: RDEL-Group
**Discovered via**: skillsdirectory.com
**Genre**: ai-agents
Related skills 6
running-claude-code-via-litellm-copilot
Use when routing Claude Code through a local LiteLLM proxy to GitHub Copilot, reducing direct Anthropic spend, configuring ANTHROPIC_BASE_URL or ANTHROPIC_MODEL overrides, or troubleshooting Copilot proxy setup failures such as model-not-found, no localhost traffic, or GitHub 401/403 auth errors.
skills-cli
Use when users ask to discover, install, list, check, update, remove, back up, restore, sync, or initialize Agent Skills, mention `bunx skills`, `npx skills`, `skills.sh`, or `skills-lock.json`, ask "find a skill for X", or want help extending agent capabilities with installable skills.
repo-intake-and-plan
Narrow RigorPilot helper for README-first deep learning repo reproduction. Use when the task is specifically to scan a repository, read the README and common project files, extract documented commands, classify inference, evaluation, and training candidates, and return the smallest trustworthy reproduction plan to the main orchestrator. Do not use for environment setup, asset download, command execution, final reporting, paper lookup, or end-to-end orchestration.
image-to-video
Animate any still image on RunComfy — this skill is a smart router that matches the user's intent to the right i2v model in the RunComfy catalog. Picks HappyHorse 1.0 I2V (Arena #1, native audio, identity preservation) for general animations, Wan 2.7 with `audio_url` for custom-voiceover lip-sync, or Seedance 2.0 Pro for multi-modal animation from image + reference video + reference audio. Bundles each model's documented prompting patterns so the caller gets sharper output without burning ite...
video-edit
Edit existing video on RunComfy — this skill is a smart router that matches the user's intent to the right edit model in the RunComfy catalog. Picks Wan 2.7 Edit-Video (general restyle / background swap / packaging swap, identity + motion preservation), Kling 2.6 Pro Motion Control (transfer precise motion from a reference video to a target character), or Lucy Edit Restyle (lightweight identity-stable restyle / outfit swap). Bundles each model's documented prompting patterns so the skill gets...
nano-banana-2
Generate images with Google Nano Banana 2 (Gemini-family flash-tier text-to-image) on RunComfy — bundled with the model's documented prompting patterns so the skill gets sharper output than naive prompting against the same model. Documents Nano Banana 2's strengths (rapid iteration, in-image typography rendering, predictable framing, optional web-grounded context), the resolution-tier pricing, the safety-tolerance dial, and when to route to Nano Banana Pro / GPT Image 2 / Flux 2 / Seedream in...