MiMo-V2-Omni Multimodal

By Xiaomi

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step...

OpenRouter rank

#93

0.00% share

Quality Index

Context window

262K tokens

Input price

$0.40 / 1M tok

Output price

$2.00 / 1M tok

Modalities

textaudioimagevideo

External ID

xiaomi/mimo-v2-omni

Rank over time

Lower is better. Trend lines from each ranking source where this model has appeared.

Score over time

ELO from Arena, quality index from Artificial Analysis, or token-share % from OpenRouter.

Back to all rankings