# MiMo-V2.5 Voice

> Bilingual ASR for dialects, code-switching, and songs

MiMo-V2.5-ASR is an 8B open-source speech recognition model from Xiaomi that transcribes Mandarin, English, eight Chinese dialects, code-switched speech, and song lyrics. It is designed for ML engineers, researchers, and developers building real-world voice applications, providing production-grade accuracy in challenging audio conditions.

- Website: https://platform.xiaomimimo.com/docs/usage-guide/speech-synthesis-v2.5
- Categories: Audio
- Verified: no
- Canonical: https://toolhaus.ai/product/mimo-v2-5-voice
