<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Xiaomi MiMo Archives - Gizmochina</title>
	<atom:link href="https://www.gizmochina.com/tag/xiaomi-mimo/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.gizmochina.com/tag/xiaomi-mimo/</link>
	<description>Latest Tech News, Product Reviews and Deals</description>
	<lastBuildDate>Tue, 09 Jun 2026 01:46:22 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=5.9.9</generator>
	<item>
		<title>Xiaomi MiMo-V2.5-Pro gets UltraSpeed Mode, breaks 1,000 tokens/second speed on standard GPUs</title>
		<link>https://www.gizmochina.com/2026/06/09/xiaomi-mimo-v2-5-pro-ultraspeed-mode-1000-tokens-per-second/</link>
		
		<dc:creator><![CDATA[Rajesh Regmi]]></dc:creator>
		<pubDate>Tue, 09 Jun 2026 01:45:31 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Xiaomi]]></category>
		<category><![CDATA[Xiaomi MiMo]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=741345</guid>

					<description><![CDATA[<img width="300" height="200" src="https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-300x200.jpg?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Xiaomi MiMo 7 billion parameter LLM" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-300x200.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-768x512.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-696x464.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-630x420.jpg 630w, https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM.jpg 900w" sizes="(max-width: 300px) 100vw, 300px" /><p>Xiaomi&#8216;s large language model family, MiMo, has officially launched UltraSpeed mode for MiMo-V2.5-Pro. Developed jointly with TileRT, the 1-trillion-parameter model can run on general-purpose GPUs while breaking the 1,000 tokens-per-second generation barrier. Xiaomi says this milestone is possible through the &#8220;ultimate co-design&#8221; of the model and its underlying system. To put that in perspective, MiMo-V2-Flash, [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2026/06/09/xiaomi-mimo-v2-5-pro-ultraspeed-mode-1000-tokens-per-second/">Xiaomi MiMo-V2.5-Pro gets UltraSpeed Mode, breaks 1,000 tokens/second speed on standard GPUs</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="200" src="https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-300x200.jpg?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Xiaomi MiMo 7 billion parameter LLM" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-300x200.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-768x512.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-696x464.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM-630x420.jpg 630w, https://www.gizmochina.com/wp-content/uploads/2025/05/Xiaomi-MiMo-7-billion-parameter-LLM.jpg 900w" sizes="(max-width: 300px) 100vw, 300px" />
<p><a href="https://www.gizmochina.com/category/xiaomi/" target="_blank" rel="noreferrer noopener">Xiaomi</a>&#8216;s large language model family, MiMo, has officially launched UltraSpeed mode for MiMo-V2.5-Pro. Developed jointly with TileRT, the 1-trillion-parameter model can run on general-purpose GPUs while breaking the 1,000 tokens-per-second generation barrier.</p>



<p>Xiaomi says this milestone is possible through the &#8220;ultimate co-design&#8221; of the model and its underlying system.</p>



<figure class="wp-block-image size-large is-resized"><img loading="lazy" src="https://www.gizmochina.com/wp-content/uploads/2026/06/Xiaomi-MiMo-V2.5-Pro-UltraSpeed-Mode-1024x551.webp?x10805" alt="Xiaomi MiMo-V2.5-Pro UltraSpeed Mode" class="wp-image-741346" width="578" height="311" srcset="https://www.gizmochina.com/wp-content/uploads/2026/06/Xiaomi-MiMo-V2.5-Pro-UltraSpeed-Mode-1024x551.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2026/06/Xiaomi-MiMo-V2.5-Pro-UltraSpeed-Mode-300x162.webp 300w, https://www.gizmochina.com/wp-content/uploads/2026/06/Xiaomi-MiMo-V2.5-Pro-UltraSpeed-Mode-768x414.webp 768w, https://www.gizmochina.com/wp-content/uploads/2026/06/Xiaomi-MiMo-V2.5-Pro-UltraSpeed-Mode-1536x827.webp 1536w, https://www.gizmochina.com/wp-content/uploads/2026/06/Xiaomi-MiMo-V2.5-Pro-UltraSpeed-Mode-696x375.webp 696w, https://www.gizmochina.com/wp-content/uploads/2026/06/Xiaomi-MiMo-V2.5-Pro-UltraSpeed-Mode-1068x575.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2026/06/Xiaomi-MiMo-V2.5-Pro-UltraSpeed-Mode-780x420.webp 780w, https://www.gizmochina.com/wp-content/uploads/2026/06/Xiaomi-MiMo-V2.5-Pro-UltraSpeed-Mode.webp 1920w" sizes="(max-width: 578px) 100vw, 578px" /><figcaption>Make a Snake game in 10 seconds</figcaption></figure>



<p>To put that in perspective,<a href="https://www.gizmochina.com/2025/12/18/xiaomi-mimo-v2-flash-most-interesting-things-about-it/" target="_blank" rel="noreferrer noopener"> MiMo-V2-Flash</a>, an earlier model in the family, was already generating responses at 150 tokens per second when it launched in December 2025. It translates to roughly 110 words per second, meaning the AI is generating text faster than the fastest human can read or speak. </p>



<p>The new UltraSpeed mode pushes that ceiling much higher, with Xiaomi claiming roughly 10 times faster output than standard MiMo-V2.5-Pro API access.</p>



<h2><strong>Xiaomi MiMo-V2.5-Pro ​​UltraSpeed ​​mode is more expensive to use</strong></h2>



<p>That speed-up comes at a cost. Literally. The MiMo-V2.5-Pro-UltraSpeed API is priced at 3x the standard rate. For reference, the regular MiMo-V2.5-Pro charges 0.025 yuan per million tokens on a cache hit, 3 yuan on a cache miss for input, and 6 yuan per million tokens for output.&nbsp;</p>



<p>Meanwhile, Xiaomi says the UltraSpeed mode is a &#8220;3x price increase” but offers a “10x output experience.&#8221; Note that the Token Plan is not supported for UltraSpeed; this is API trial access only.</p>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="640" src="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-1024x640.png?x10805" alt="" class="wp-image-719785" srcset="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-1024x640.png 1024w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-300x188.png 300w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-768x480.png 768w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-696x435.png 696w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-1068x668.png 1068w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-672x420.png 672w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo.png 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure></div>



<p>Due to the constrained supply of high-speed inference resources, Xiaomi is running an application-based trial from June 9 to June 23, 2026. There&#8217;s no guaranteed approval timeline or success rate, and Xiaomi says it will prioritize enterprises and professional developers with genuine business needs.</p>



<p>Those who get the approval will get a two-week free Chat experience, with some guardrails to keep things fair: a maximum of 10 queue entries per account per day, sessions capped at 30 minutes, and an automatic resource release if idle for more than 5 minutes.</p>



<p><a href="https://www.gizmochina.com/2026/03/19/xiaomi-unveils-mimo-v2-pro-its-flagship-llm-with-over-1tb-of-parameters/" target="_blank" rel="noreferrer noopener">MiMo-V2.5-Pro itself launched in April 2026</a> as part of Xiaomi&#8217;s growing model family, which now spans text, voice, and multimodal capabilities. </p>



<p>For more daily updates, please visit our&nbsp;<a href="https://www.gizmochina.com/news/" target="_blank" rel="noreferrer noopener"><strong>News Section</strong></a>.</p>



<p><strong>Stay ahead in tech!</strong> Join our <a href="https://t.me/gizmochinaofficial" target="_blank" rel="noreferrer noopener">Telegram community</a> and <a href="https://gizmochina.beehiiv.com/subscribe" target="_blank" rel="noreferrer noopener">sign up for our daily newsletter</a> of <em>top stories!</em> <img src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4a1.png" alt="💡" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>



<p>I<a href="https://ultraspeed.xiaomimimo.com/#/" target="_blank" rel="noreferrer noopener">Via</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2026/06/09/xiaomi-mimo-v2-5-pro-ultraspeed-mode-1000-tokens-per-second/">Xiaomi MiMo-V2.5-Pro gets UltraSpeed Mode, breaks 1,000 tokens/second speed on standard GPUs</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Object Caching 32/35 objects using Redis
Page Caching using Disk: Enhanced 
Content Delivery Network Full Site Delivery via cloudflare
Database Caching 15/21 queries in 0.005 seconds using Redis
Fragment Caching 2/3 fragments using Redis

Served from: www.gizmochina.com @ 2026-06-09 01:53:35 by W3 Total Cache
-->