Official Mobile

huggingface-vision-trainer

Trains and fine-tunes vision models for object detection (D-FINE, RT-DETR v2, DETR, YOLOS), image classification (timm models — MobileNetV3, MobileViT, ResNet, ViT/DINOv3 — plus any Transformers classifier), and SAM/SAM2 segmentation using Hugging Face Transformers on Hugging Face Jobs cloud GPUs. Covers COCO-format dataset preparation, Albumentations augmentation, mAP/mAR evaluation, accuracy metrics, SAM segmentation with bbox/point prompts, DiceCE loss, hardware selection, cost estimation,...

This skill ships only metadata — no inline instructions. See the source repo for details.

Install this skill

One command (all agents)

Runs the npx skills CLI which auto-detects every AI coding agent you have installed (Claude Code, Cursor, Codex, OpenCode, Windsurf, Copilot, and 51 more).

npx skills add https://github.com/huggingface/skills/tree/HEAD/skills/huggingface-vision-trainer

Alternative: shorthand form

npx skills add huggingface/skills --skill huggingface-vision-trainer

Install to a specific agent

Pick the agent you use. The CLI writes the skill to that agent's standard skill directory.

npx skills add huggingface/skills --skill huggingface-vision-trainer --agent claude-code

npx skills add huggingface/skills --skill huggingface-vision-trainer --agent cursor

npx skills add huggingface/skills --skill huggingface-vision-trainer --agent codex

npx skills add huggingface/skills --skill huggingface-vision-trainer --agent opencode

npx skills add huggingface/skills --skill huggingface-vision-trainer --agent github-copilot

npx skills add huggingface/skills --skill huggingface-vision-trainer --agent windsurf

Manual install (no CLI)

Prefer to skip the CLI? Clone the repo and drop the skill folder into your agent's skills directory.

git clone https://github.com/huggingface/skills.git

cp -r skills/skills/huggingface-vision-trainer ~/.claude/skills/

For other agents, replace ~/.claude/skills/ with their skill directory — see the full list.

Use it

Once installed, ask your agent to "use the huggingface-vision-trainer skill" or describe what you want (e.g. "Trains and fine-tunes vision models for object detection (D-FINE, RT-DETR v2, DE"). Most agents auto-discover the skill from its SKILL.md description — no slash command needed.

Requires: Node.js 18+ for npx skills. Skill files are MIT-style permissive by default — check the source repo for the actual license.

SKILL.md source

---
name: huggingface-vision-trainer
description: Trains and fine-tunes vision models for object detection (D-FINE, RT-DETR v2, DETR, YOLOS), image classification (timm models — MobileNetV3, MobileViT, ResNet, ViT/DINOv3 — plus any Transformers class
---

Related skills 6

vercel-react-native-skills

★ Featured Official

React Native and Expo best practices for building performant mobile apps. Use when building React Native components, optimizing list performance, implementing animations, or working with native modules. Triggers on tasks involving React Native, Expo, mobile performance, or native platform APIs.

vercel-labs 124k

Mobile

adapt

★ Featured

Adapt designs to work across different screen sizes, devices, contexts, or platforms. Implements breakpoints, fluid layouts, and touch targets. Use when the user mentions responsive design, mobile layouts, breakpoints, viewport adaptation, or cross-device compatibility.

pbakaus 82k

Mobile