Uni Image
Unified multi-platform AI image generation. Supports Volcengine Seedream, Alibaba Qwen Image, and Google Gemini (Nano Banana). Switch between models with a dropdown selector.
Install
Quick install
npx skills add https://github.com/sangjiexun/openclaw-skill-UniImagenpx skills add sangjiexun/openclaw-skill-UniImage --agent claude-codenpx skills add sangjiexun/openclaw-skill-UniImage --agent cursornpx skills add sangjiexun/openclaw-skill-UniImage --agent codexnpx skills add sangjiexun/openclaw-skill-UniImage --agent opencodenpx skills add sangjiexun/openclaw-skill-UniImage --agent github-copilotnpx skills add sangjiexun/openclaw-skill-UniImage --agent windsurfMore install options
Shorthand — useful for multi-skill repos:
npx skills add sangjiexun/openclaw-skill-UniImageManual — clone the repo and drop the folder into your agent's skills directory:
git clone https://github.com/sangjiexun/openclaw-skill-UniImage.gitcp -r openclaw-skill-UniImage ~/.claude/skills/uni-image
Unified multi-platform AI image generation. Supports Volcengine Seedream, Alibaba Qwen Image, and Google Gemini (Nano Banana). Switch between models with a dropdown selector.
---
name: uni-image
description: Unified multi-platform AI image generation. Supports Volcengine Seedream, Alibaba Qwen Image, and Google Gemini (Nano Banana). Switch between models with a dropdown selector.
homepage: https://github.com/sangjiexun/openclaw-skill-UniImage
metadata:
{
"openclaw":
{
"emoji": "🖼️",
"requires":
{
"env":
[
"ARK_API_KEY",
"DASHSCOPE_IMAGE_KEY",
"GOOGLE_API_KEY",
],
},
"primaryEnv": "ARK_API_KEY",
},
}
---
UniImage — 统一多平台 AI 图片生成
Multi-platform AI image generation skill for OpenClaw. Supports switching between providers via a dropdown model selector on the paint page.
Supported Platforms
| Model ID | Display Name | Provider | Description |
|----------|-------------|----------|-------------|
| doubao-seedream-5-0-260128 | Seedream 5.0 | Volcengine Ark | 多角色超强一致性,中文处理能力极强 |
| qwen-image-plus | Qwen Image | Alibaba DashScope | 单图一致性强,适合中文相关的多图处理场景 |
| gemini-3-pro-image-preview | 香蕉 Pro | Google Gemini | 最强修图模型,适合电商和专业设计 |
| gemini-3.1-flash-image-preview | 香蕉 V2 | Google Gemini | 最新香蕉模型,极致速度和超高性价比 |
Architecture
Renderer (paint page)
↓ fetch(/v1/images/generations)
↓ [fetch interceptor rewrites URL + model]
UniImage Proxy (port 18800)
├── Volcengine Ark API (Seedream)
├── DashScope API (Qwen Image, async poll)
└── Google Gemini API (Nano Banana)
Files
uni-image-proxy.js— HTTP proxy server with multi-provider routinguni-image-inject.js— Renderer injection script (fetch wrapper + model selector UI)SKILL.md— This skill manifest
Configuration
Configure API keys via the 🔑 button on the paint page, or set environment variables:
# Volcengine Ark (Seedream)
ARK_API_KEY=your-volcengine-key
# Alibaba DashScope (Qwen Image)
DASHSCOPE_IMAGE_KEY=your-dashscope-key
# Google AI (Nano Banana / Gemini)
GOOGLE_API_KEY=your-google-api-key
Keys are stored in ~/.openclaw-dev/uni-image-config.json.
Usage
- Navigate to the 绘画助手 (Paint) page
- Select a model from the 模型平台 dropdown
- Click 🔑 to configure the API key for the selected provider
- Enter a prompt and generate images
Run (CLI)
# Seedream (default)
node uni-image-proxy.js &
curl -X POST http://127.0.0.1:18800/v1/images/generations \
-H "Content-Type: application/json" \
-d '{"model":"doubao-seedream-5-0-260128","prompt":"a cute cat","size":"1024x1024"}'
# Qwen Image
curl -X POST http://127.0.0.1:18800/v1/images/generations \
-H "Content-Type: application/json" \
-d '{"model":"qwen-image-plus","prompt":"a cute cat","size":"1024*1024"}'
# Google Gemini (Nano Banana)
curl -X POST http://127.0.0.1:18800/v1/images/generations \
-H "Content-Type: application/json" \
-d '{"model":"gemini-3.1-flash-image-preview","prompt":"a cute cat"}'
---
Source: https://github.com/sangjiexun/openclaw-skill-UniImage
Author: sangjiexun
Discovered via: skillsdirectory.com
Genre: code-quality
SKILL.md source
---
name: uni-image
description: Unified multi-platform AI image generation. Supports Volcengine Seedream, Alibaba Qwen Image, and Google Gemini (Nano Banana). Switch between models with a dropdown selector.
---
# uni-image
Unified multi-platform AI image generation. Supports Volcengine Seedream, Alibaba Qwen Image, and Google Gemini (Nano Banana). Switch between models with a dropdown selector.
---
name: uni-image
description: Unified multi-platform AI image generation. Supports Volcengine Seedream, Alibaba Qwen Image, and Google Gemini (Nano Banana). Switch between models with a dropdown selector.
homepage: https://github.com/sangjiexun/openclaw-skill-UniImage
metadata:
{
"openclaw":
{
"emoji": "🖼️",
"requires":
{
"env":
[
"ARK_API_KEY",
"DASHSCOPE_IMAGE_KEY",
"GOOGLE_API_KEY",
],
},
"primaryEnv": "ARK_API_KEY",
},
}
---
# UniImage — 统一多平台 AI 图片生成
Multi-platform AI image generation skill for OpenClaw. Supports switching between providers via a dropdown model selector on the paint page.
## Supported Platforms
| Model ID | Display Name | Provider | Description |
|----------|-------------|----------|-------------|
| `doubao-seedream-5-0-260128` | Seedream 5.0 | Volcengine Ark | 多角色超强一致性,中文处理能力极强 |
| `qwen-image-plus` | Qwen Image | Alibaba DashScope | 单图一致性强,适合中文相关的多图处理场景 |
| `gemini-3-pro-image-preview` | 香蕉 Pro | Google Gemini | 最强修图模型,适合电商和专业设计 |
| `gemini-3.1-flash-image-preview` | 香蕉 V2 | Google Gemini | 最新香蕉模型,极致速度和超高性价比 |
## Architecture
```
Renderer (paint page)
↓ fetch(/v1/images/generations)
↓ [fetch interceptor rewrites URL + model]
UniImage Proxy (port 18800)
├── Volcengine Ark API (Seedream)
├── DashScope API (Qwen Image, async poll)
└── Google Gemini API (Nano Banana)
```
## Files
- `uni-image-proxy.js` — HTTP proxy server with multi-provider routing
- `uni-image-inject.js` — Renderer injection script (fetch wrapper + model selector UI)
- `SKILL.md` — This skill manifest
## Configuration
Configure API keys via the 🔑 button on the paint page, or set environment variables:
```bash
# Volcengine Ark (Seedream)
ARK_API_KEY=your-volcengine-key
# Alibaba DashScope (Qwen Image)
DASHSCOPE_IMAGE_KEY=your-dashscope-key
# Google AI (Nano Banana / Gemini)
GOOGLE_API_KEY=your-google-api-key
```
Keys are stored in `~/.openclaw-dev/uni-image-config.json`.
## Usage
1. Navigate to the **绘画助手** (Paint) page
2. Select a model from the **模型平台** dropdown
3. Click **🔑** to configure the API key for the selected provider
4. Enter a prompt and generate images
## Run (CLI)
```bash
# Seedream (default)
node uni-image-proxy.js &
curl -X POST http://127.0.0.1:18800/v1/images/generations \
-H "Content-Type: application/json" \
-d '{"model":"doubao-seedream-5-0-260128","prompt":"a cute cat","size":"1024x1024"}'
# Qwen Image
curl -X POST http://127.0.0.1:18800/v1/images/generations \
-H "Content-Type: application/json" \
-d '{"model":"qwen-image-plus","prompt":"a cute cat","size":"1024*1024"}'
# Google Gemini (Nano Banana)
curl -X POST http://127.0.0.1:18800/v1/images/generations \
-H "Content-Type: application/json" \
-d '{"model":"gemini-3.1-flash-image-preview","prompt":"a cute cat"}'
```
---
**Source**: https://github.com/sangjiexun/openclaw-skill-UniImage
**Author**: sangjiexun
**Discovered via**: skillsdirectory.com
**Genre**: code-quality
Related skills 6
caveman
Ultra-compressed communication mode. Cuts token usage ~75% by speaking like caveman while keeping full technical accuracy. Supports intensity levels: lite, full (default), ultra, wenyan-lite, wenyan-full, wenyan-ultra. Use when user says "caveman mode", "talk like caveman", "use caveman", "less tokens", "be brief", or invokes /caveman. Also auto-triggers when token efficiency is requested.
secure-linux-web-hosting
Use when setting up, hardening, or reviewing a cloud server for self-hosting, including DNS, SSH, firewalls, Nginx, static-site hosting, reverse-proxying an app, HTTPS with Let's Encrypt or ACME clients, safe HTTP-to-HTTPS redirects, or optional post-launch network tuning such as BBR.
readme-i18n
Use when the user wants to translate a repository README, make a repo multilingual, localize docs, add a language switcher, internationalize the README, or update localized README variants in a GitHub-style repository.
lark-shared
Use when first setting up lark-cli, running auth login, switching user/bot identity (--as), handling permission denied or scope errors, needing to update lark-cli, or seeing _notice in JSON output.
improve-codebase-architecture
Find deepening opportunities in a codebase, informed by the domain language in CONTEXT.md and the decisions in docs/adr/. Use when the user wants to improve architecture, find refactoring opportunities, consolidate tightly-coupled modules, or make a codebase more testable and AI-navigable.
paper-context-resolver
Optional RigorPilot helper for README-first deep learning repo reproduction. Use only when the README and repository files leave a narrow reproduction-critical gap and the task is to resolve a specific paper detail such as dataset split, preprocessing, evaluation protocol, checkpoint mapping, or runtime assumption from primary paper sources while recording conflicts. Do not use for general paper summary, repo scanning, environment setup, command execution, title-only paper lookup, or replacin...