NEW Browse AI tools across categories — updated daily. See what's new →
Official Mobile

huggingface-vision-trainer

Trains and fine-tunes vision models for object detection (D-FINE, RT-DETR v2, DETR, YOLOS), image classification (timm models — MobileNetV3, MobileViT, ResNet, ViT/DINOv3 — plus any Transformers classifier), and SAM/SAM2 segmentation using Hugging Face Transformers on Hugging Face Jobs cloud GPUs. Covers COCO-format dataset preparation, Albumentations augmentation, mAP/mAR evaluation, accuracy metrics, SAM segmentation with bbox/point prompts, DiceCE loss, hardware selection, cost estimation,...

This skill ships only metadata — no inline instructions. See the source repo for details.

Install this skill

1

One command (all agents)

Runs the npx skills CLI which auto-detects every AI coding agent you have installed (Claude Code, Cursor, Codex, OpenCode, Windsurf, Copilot, and 51 more).

npx skills add https://github.com/huggingface/skills/tree/HEAD/skills/huggingface-vision-trainer
Alternative: shorthand form
npx skills add huggingface/skills --skill huggingface-vision-trainer
2

Install to a specific agent

Pick the agent you use. The CLI writes the skill to that agent's standard skill directory.

npx skills add huggingface/skills --skill huggingface-vision-trainer --agent claude-code
npx skills add huggingface/skills --skill huggingface-vision-trainer --agent cursor
npx skills add huggingface/skills --skill huggingface-vision-trainer --agent codex
npx skills add huggingface/skills --skill huggingface-vision-trainer --agent opencode
npx skills add huggingface/skills --skill huggingface-vision-trainer --agent github-copilot
npx skills add huggingface/skills --skill huggingface-vision-trainer --agent windsurf
3

Manual install (no CLI)

Prefer to skip the CLI? Clone the repo and drop the skill folder into your agent's skills directory.

git clone https://github.com/huggingface/skills.git
cp -r skills/skills/huggingface-vision-trainer ~/.claude/skills/

For other agents, replace ~/.claude/skills/ with their skill directory — see the full list.

4

Use it

Once installed, ask your agent to "use the huggingface-vision-trainer skill" or describe what you want (e.g. "Trains and fine-tunes vision models for object detection (D-FINE, RT-DETR v2, DE"). Most agents auto-discover the skill from its SKILL.md description — no slash command needed.

Requires: Node.js 18+ for npx skills. Skill files are MIT-style permissive by default — check the source repo for the actual license.

SKILL.md source

---
name: huggingface-vision-trainer
description: Trains and fine-tunes vision models for object detection (D-FINE, RT-DETR v2, DETR, YOLOS), image classification (timm models — MobileNetV3, MobileViT, ResNet, ViT/DINOv3 — plus any Transformers class
---

Related skills 6