NEW Browse AI tools across categories — updated daily. See what's new →

MiMo-V2-Omni Multimodal

By Xiaomi

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...

OpenRouter rank
#93
0.00% share
Quality Index
43
Context window
262K tokens
Input price
$0.40 / 1M tok
Output price
$2.00 / 1M tok
Modalities
textaudioimagevideo
External ID
xiaomi/mimo-v2-omni

Rank over time

Lower is better. Trend lines from each ranking source where this model has appeared.

Score over time

ELO from Arena, quality index from Artificial Analysis, or token-share % from OpenRouter.

Back to all rankings