<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>LLM Archives - Gizmochina</title>
	<atom:link href="https://www.gizmochina.com/tag/llm/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.gizmochina.com/tag/llm/</link>
	<description>Latest Tech News, Product Reviews and Deals</description>
	<lastBuildDate>Thu, 18 Dec 2025 06:22:56 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=5.9.9</generator>
	<item>
		<title>Xiaomi MiMo-V2-Flash LLM Just Dropped: These Are the Most Interesting Things About It</title>
		<link>https://www.gizmochina.com/2025/12/18/xiaomi-mimo-v2-flash-most-interesting-things-about-it/</link>
		
		<dc:creator><![CDATA[Soumyakanti]]></dc:creator>
		<pubDate>Thu, 18 Dec 2025 06:22:10 +0000</pubDate>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[AI News]]></category>
		<category><![CDATA[Featured]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Top Stories]]></category>
		<category><![CDATA[Launch]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[Xiaomi]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=719782</guid>

					<description><![CDATA[<img width="300" height="181" src="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-300x181.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Xiaomi" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-300x181.webp 300w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-1024x619.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-768x464.webp 768w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-696x421.webp 696w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-1068x645.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-695x420.webp 695w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi.webp 1248w" sizes="(max-width: 300px) 100vw, 300px" /><p>Xiaomi has unveiled its most advanced open-source large language model to date, called MiMo-V2-Flash, as part of its expanding push into foundation AI. The new model focuses on high-speed performance and an efficient architecture, with strong capabilities in reasoning and code generation. Xiaomi positions MiMo-V2-Flash as a direct competitor to leading models such as DeepSeek [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2025/12/18/xiaomi-mimo-v2-flash-most-interesting-things-about-it/">Xiaomi MiMo-V2-Flash LLM Just Dropped: These Are the Most Interesting Things About It</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="181" src="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-300x181.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Xiaomi" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-300x181.webp 300w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-1024x619.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-768x464.webp 768w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-696x421.webp 696w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-1068x645.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-695x420.webp 695w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi.webp 1248w" sizes="(max-width: 300px) 100vw, 300px" />
<p><a href="https://www.gizmochina.com/category/xiaomi/" target="_blank" rel="noreferrer noopener">Xiaomi</a> has unveiled its most advanced open-source large language model to date, called MiMo-V2-Flash, as part of its expanding push into foundation AI. The new model focuses on high-speed performance and an efficient architecture, with strong capabilities in reasoning and code generation.</p>



<p>Xiaomi positions MiMo-V2-Flash as a direct competitor to leading models such as DeepSeek V3.2 and Claude 4.5 Sonnet. Let’s take a closer look at how the model works, its key features, and how to access it.</p>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="640" src="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-1024x640.png?x10805" alt="Xiaomi MiMo" class="wp-image-719785" srcset="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-1024x640.png 1024w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-300x188.png 300w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-768x480.png 768w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-696x435.png 696w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-1068x668.png 1068w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-672x420.png 672w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo.png 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure></div>



<h3><strong>Purpose-Built for Speed and Agents</strong></h3>



<p>MiMo-V2-Flash is a Mixture-of-Experts (MoE) model with 309 billion total parameters and 15 billion active parameters. The model is purpose-built for AI agent scenarios and multi-turn interactions that require fast inference.</p>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="626" src="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1-1024x626.png?x10805" alt="Xiaomi MiMo-V2-Flash" class="wp-image-719784" srcset="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1-1024x626.png 1024w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1-300x183.png 300w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1-768x469.png 768w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1-696x425.png 696w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1-1068x652.png 1068w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1-687x420.png 687w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1.png 1226w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure></div>



<p>Xiaomi uses a 1:5 hybrid attention architecture, which combines Global Attention and Sliding Window Attention (SWA) with a 128-token window. The native context length is 32,000 tokens, and the model is trained with support for up to 256,000 tokens.</p>



<p>This design helps MiMo-V2-Flash maintain high efficiency while scaling across long-context tasks. Xiaomi claims it delivers output faster than several leading models, including DeepSeek and Claude, while maintaining lower operational costs.</p>



<h3><strong>Benchmark Performance and Pricing</strong></h3>



<p>Benchmark results show MiMo-V2-Flash performing at the top tier across various domains. The model ranks in the top two among open-source models in reasoning tasks such as AIME 2025 and GPQA-Diamond.</p>



<p>In software engineering benchmarks like SWE-Bench Verified and SWE-Bench Multilingual, it outperforms other open-source models and reaches levels comparable to GPT-5 and Claude 4.5 Sonnet.</p>



<p>Xiaomi has priced the API at $0.1 per million input tokens and $0.3 per million output tokens. The API is currently available for free for a limited time. According to the company, MiMo-V2-Flash generates responses at 150 tokens per second, while maintaining only 2.5% of Claude’s inference cost.</p>



<h3><strong>Technical Innovations Inside</strong></h3>



<p>The architecture includes Multi-Token Prediction (MTP), which allows the model to generate multiple tokens in parallel and verify them before output. This method increases decoding throughput without increasing attention or memory overhead. Xiaomi reports that with a three-layer MTP, the model reaches 2.0 to 2.6 times speed improvement compared to standard decoding.</p>



<div class="wp-block-image"><figure class="aligncenter size-full"><img loading="lazy" width="715" height="304" src="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1.jpg?x10805" alt="Xiaomi MiMo-V2-Flash" class="wp-image-719783" srcset="https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1.jpg 715w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1-300x128.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2025/12/Xiaomi-MiMo-V2-Flash-1-696x296.jpg 696w" sizes="(max-width: 715px) 100vw, 715px" /></figure></div>



<p>Xiaomi also introduced a new post-training method called Multi-Teacher Online Policy Distillation (MOPD). The technique uses multiple teacher models to guide the student through token-level rewards in an on-policy learning process. It allows the model to achieve high performance with less than 1/50th of the training resources needed in traditional RL pipelines. MOPD also supports plug-and-play teachers, enabling continuous self-improvement cycles.</p>



<h3><strong>How to Access It?</strong></h3>



<p>Xiaomi has launched a web AI chat interface called MiMo Studio at aistudio.xiaomimimo.com, allowing users to interact directly with the model. The service supports web search, agent workflows, and code generation. It also features a toggle for switching between instant replies and slower “thinking” responses for deeper reasoning.</p>



<p>The model can generate functional HTML web pages and integrates well with development tools like Claude Code and Cursor. <a href="https://mimo.xiaomi.com/blog/mimo-v2-flash" target="_blank" rel="noreferrer noopener">Xiaomi has also showcased</a> creative and functional web demos.</p>



<h3><strong>Fully Open-Source</strong></h3>



<p>MiMo-V2-Flash is fully open-source under the MIT license. Model weights are available on Hugging Face, and all inference code is published on GitHub.</p>



<p>The company contributed inference code to SGLang on launch day and aims to grow developer adoption by offering transparent, low-cost access to high-performance AI tools.</p>



<p>MiMo-V2-Flash reflects Xiaomi’s shift toward becoming a serious player in the <a href="https://www.gizmochina.com/tag/ai/" target="_blank" rel="noreferrer noopener">AI</a> space. It brings competitive reasoning, fast code generation, and efficient agent deployment to the open-source ecosystem.</p>



<p>In related AI news, China has <a href="https://www.gizmochina.com/2025/12/15/china-changsha-traffic-police-ai-smart-glasses/" target="_blank" rel="noreferrer noopener">equipped traffic police with AI-powered smart glasses</a> for real-time vehicle inspections, while a separate report highlights how even so-called <a href="https://www.gizmochina.com/2025/12/14/lack-of-control-inside-an-all-ai-company-even-ai-employees-need-humans/" target="_blank" rel="noreferrer noopener">“all-AI companies” still require human oversight</a> due to limits in autonomous decision-making.</p>



<div style="height:100px" aria-hidden="true" class="wp-block-spacer"></div>



<p>For more daily updates, please visit our<a href="https://www.gizmochina.com/news/">&nbsp;<strong>News Section</strong></a>.</p>



<p><strong>Stay ahead in tech!</strong> Join our <a href="https://t.me/gizmochinaofficial" target="_blank" rel="noreferrer noopener">Telegram community</a> and <a href="https://gizmochina.beehiiv.com/subscribe" target="_blank" rel="noreferrer noopener">sign up for our daily newsletter</a> of <em>top stories!</em> <img src="https://s.w.org/images/core/emoji/13.1.0/72x72/1f4a1.png" alt="💡" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2025/12/18/xiaomi-mimo-v2-flash-most-interesting-things-about-it/">Xiaomi MiMo-V2-Flash LLM Just Dropped: These Are the Most Interesting Things About It</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>India working on affordable AI models to rival ChatGPT &#038; DeepSeek</title>
		<link>https://www.gizmochina.com/2025/02/02/india-working-on-affordable-ai-models-to-rival-chatgpt-deepseek/</link>
		
		<dc:creator><![CDATA[Sean]]></dc:creator>
		<pubDate>Sun, 02 Feb 2025 10:45:33 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[ChatGPT]]></category>
		<category><![CDATA[DeepSeek]]></category>
		<category><![CDATA[India]]></category>
		<category><![CDATA[LLM]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=671552</guid>

					<description><![CDATA[<img width="293" height="300" src="https://www.gizmochina.com/wp-content/uploads/2025/01/image-87-293x300.png?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="AI chip" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2025/01/image-87-293x300.png 293w, https://www.gizmochina.com/wp-content/uploads/2025/01/image-87-411x420.png 411w, https://www.gizmochina.com/wp-content/uploads/2025/01/image-87-356x364.png 356w, https://www.gizmochina.com/wp-content/uploads/2025/01/image-87.png 577w" sizes="(max-width: 293px) 100vw, 293px" /><p>Artificial Intelligence has taken the world by storm with LLM (Large Language Models) being immensely popular for their diverse functionality. ChatGPT is a great example of an AI model, along with new disruptive models like DeepSeek from China. But now, it appears that India seeks to rival these with its own AI model, which could [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2025/02/02/india-working-on-affordable-ai-models-to-rival-chatgpt-deepseek/">India working on affordable AI models to rival ChatGPT &amp; DeepSeek</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="293" height="300" src="https://www.gizmochina.com/wp-content/uploads/2025/01/image-87-293x300.png?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="AI chip" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2025/01/image-87-293x300.png 293w, https://www.gizmochina.com/wp-content/uploads/2025/01/image-87-411x420.png 411w, https://www.gizmochina.com/wp-content/uploads/2025/01/image-87-356x364.png 356w, https://www.gizmochina.com/wp-content/uploads/2025/01/image-87.png 577w" sizes="(max-width: 293px) 100vw, 293px" />
<p>Artificial Intelligence has taken the world by storm with LLM (Large Language Models) being immensely popular for their diverse functionality. ChatGPT is a great example of an AI model, along with new disruptive models like DeepSeek from China. But now, it appears that India seeks to rival these with its own AI model, which could be arriving as early as this year.</p>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="683" src="https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1024x683.jpg?x10805" alt="" class="wp-image-615524" srcset="https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1024x683.jpg 1024w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-300x200.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-768x512.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1536x1024.jpg 1536w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-2048x1365.jpg 2048w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-696x464.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1068x712.jpg 1068w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1920x1280.jpg 1920w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-630x420.jpg 630w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption>OpenAI and ChatGPT (REUTERS/Dado Ruvic/Illustration/File Photo)</figcaption></figure></div>



<h2>Indian Govt to soon launch an affordable AI model</h2>



<p>During a recent AI event, Ashwini Vaishnaw, the Union Minister of Electronics and Information Technology stated that India is working on its own foundational AI model. The minister further added that this will function similarly to DeepSeek and ChatGPT but for an affordable development cost. The government official stated that this new AI model could be ready in just 8 to 10 months.</p>



<div class="wp-block-image"><figure class="aligncenter size-full"><img loading="lazy" width="781" height="439" src="https://www.gizmochina.com/wp-content/uploads/2025/02/Ashwini-Vaishnaw-on-AI.jpg?x10805" alt="India AI Model rival ChatGPT" class="wp-image-671558" srcset="https://www.gizmochina.com/wp-content/uploads/2025/02/Ashwini-Vaishnaw-on-AI.jpg 781w, https://www.gizmochina.com/wp-content/uploads/2025/02/Ashwini-Vaishnaw-on-AI-300x169.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2025/02/Ashwini-Vaishnaw-on-AI-768x432.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2025/02/Ashwini-Vaishnaw-on-AI-696x391.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2025/02/Ashwini-Vaishnaw-on-AI-747x420.jpg 747w" sizes="(max-width: 781px) 100vw, 781px" /></figure></div>



<p>In the event by the Indian AI Mission, Ashwini Vaishnaw revealed that researchers in India have been developing an AI ecosystem framework to support its own foundational AI model. This is being developed to offer an experience tailored to Indian users. It will also understand the linguistic and contextual needs of the Indian users, bringing inclusivity while eliminating biases.</p>



<p>The Union Minister of Electronics and Information Technology also talked about India&#8217;s computation prowess since the domestic AI model is being developed with a computational facility that employs 18,693 GPUs. To recall, ChatGPT was trained using around 25,000 GPUs, while DeepSeek was trained with 2,000 GPUs. </p>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="640" src="https://www.gizmochina.com/wp-content/uploads/2025/01/DeepSeek-1024x640.png?x10805" alt="DeepSeek" class="wp-image-671096" srcset="https://www.gizmochina.com/wp-content/uploads/2025/01/DeepSeek-1024x640.png 1024w, https://www.gizmochina.com/wp-content/uploads/2025/01/DeepSeek-300x188.png 300w, https://www.gizmochina.com/wp-content/uploads/2025/01/DeepSeek-768x480.png 768w, https://www.gizmochina.com/wp-content/uploads/2025/01/DeepSeek-696x435.png 696w, https://www.gizmochina.com/wp-content/uploads/2025/01/DeepSeek-1068x668.png 1068w, https://www.gizmochina.com/wp-content/uploads/2025/01/DeepSeek-672x420.png 672w, https://www.gizmochina.com/wp-content/uploads/2025/01/DeepSeek.png 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption>DeepSeek</figcaption></figure></div>



<p>A typical popular AI model like ChatGPT costs about $3 to use for an hour, India&#8217; AI model could cost just Rs 100 (roughly $1.15) thanks to government subsidy. This news also arrives after UC Berkeley researchers managed to <a href="https://www.gizmochina.com/2025/01/31/uc-berkeley-researchers-managed-to-replicate-deepseek-ai-for-only-30/" target="_blank" rel="noreferrer noopener">replicate DeepSeek AI for only $30</a>.</p>



<p>For more daily updates, please visit our <a href="https://www.gizmochina.com/news/" target="_blank" rel="noreferrer noopener">News Section</a>.</p>



<p>Tech enthusiast? Get the latest news first! Follow<a href="https://t.me/gizmochinaofficial"> our Telegram channel</a> and<a href="https://gizmochina.beehiiv.com/subscribe"> subscribe to our free newsletter</a> for your daily tech fix!</p>



<p></p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2025/02/02/india-working-on-affordable-ai-models-to-rival-chatgpt-deepseek/">India working on affordable AI models to rival ChatGPT &amp; DeepSeek</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>GPT-4 Outperformed Junior &#038; Trainee Eye Doctors on a Mock Exam</title>
		<link>https://www.gizmochina.com/2024/04/18/gpt-4-outperformed-junior-eye-doctors/</link>
		
		<dc:creator><![CDATA[Anubhav]]></dc:creator>
		<pubDate>Thu, 18 Apr 2024 14:48:30 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[GPT-4]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[OpenAI]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=619669</guid>

					<description><![CDATA[<img width="300" height="200" src="https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-300x200.jpg?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-300x200.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1024x683.jpg 1024w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-768x512.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1536x1024.jpg 1536w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-2048x1365.jpg 2048w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-696x464.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1068x712.jpg 1068w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1920x1280.jpg 1920w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-630x420.jpg 630w" sizes="(max-width: 300px) 100vw, 300px" /><p>A new study suggests large language models (LLMs) like GPT-4 may have a future in ophthalmology, but limitations and risks remain. Researchers from Cambridge University tested GPT-4, along with other LLMs, against human ophthalmologists on a mock exam. GPT-4 answered 60 out of 87 questions correctly in the exam The results were intriguing. GPT-4 answered [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/04/18/gpt-4-outperformed-junior-eye-doctors/">GPT-4 Outperformed Junior &amp; Trainee Eye Doctors on a Mock Exam</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="200" src="https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-300x200.jpg?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-300x200.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1024x683.jpg 1024w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-768x512.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1536x1024.jpg 1536w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-2048x1365.jpg 2048w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-696x464.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1068x712.jpg 1068w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-1920x1280.jpg 1920w, https://www.gizmochina.com/wp-content/uploads/2024/04/Chatgpt-630x420.jpg 630w" sizes="(max-width: 300px) 100vw, 300px" />
<p>A new study suggests large language models (<a href="http://gizmochina.com/tag/llm">LLMs</a>) like <a href="http://gizmochina.com/tag/gpt-4">GPT-4</a> may have a future in ophthalmology, but limitations and risks remain. Researchers from Cambridge University tested GPT-4, along with other LLMs, against human ophthalmologists on a mock exam.</p>



<h3>GPT-4 answered 60 out of 87 questions correctly in the exam</h3>



<p>The results were intriguing. GPT-4 answered 60 out of 87 questions correctly, exceeding the performance of trainee doctors (average: 59.7) and junior doctors (average: 37). However, it fell short of the average score achieved by expert ophthalmologists (66.4). Other LLMs, like PaLM 2 and <a href="http://gizmochina.com/tag/gpt-3.5">GPT-3.5</a>, performed less impressively.</p>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="683" src="https://www.gizmochina.com/wp-content/uploads/2023/11/ChatGPT-2-1024x683.webp?x10805" alt="ChatGPT" class="wp-image-581877" srcset="https://www.gizmochina.com/wp-content/uploads/2023/11/ChatGPT-2-1024x683.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2023/11/ChatGPT-2-300x200.webp 300w, https://www.gizmochina.com/wp-content/uploads/2023/11/ChatGPT-2-768x512.webp 768w, https://www.gizmochina.com/wp-content/uploads/2023/11/ChatGPT-2-696x464.webp 696w, https://www.gizmochina.com/wp-content/uploads/2023/11/ChatGPT-2-1068x712.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2023/11/ChatGPT-2-630x420.webp 630w, https://www.gizmochina.com/wp-content/uploads/2023/11/ChatGPT-2.webp 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure></div>



<p>While these findings hint at potential benefits, researchers highlight significant risks. The study&#8217;s limited question pool raises concerns about generalizability. More importantly, LLMs are prone to &#8220;hallucinating,&#8221; fabricating information that could lead to misdiagnosis of serious conditions like cataracts or cancer. Additionally, the lack of nuance inherent in LLMs could exacerbate inaccuracies.</p>



<p>The study clearly emphasizes the need for further research and development before LLMs can be considered reliable tools for medical diagnosis. Since there is a lot of risk involved in anything concerning medical diagnoses, we might have to wait for a long time before LLMs are incorporated in mainstream medical situations. </p>



<p><strong><span style="text-decoration: underline">RELATED:</span></strong></p>



<ul><li><a href="https://www.gizmochina.com/2024/04/12/chatgpt-gpt-4-turbo-upgrade/">ChatGPT Gets Smarter for Premium Users with GPT-4 Turbo Upgrade</a></li><li><a href="https://www.gizmochina.com/2024/04/05/gpt-5-june-likely-invites-out/">GPT-5 will Likely Come out in June, Red Team Testing Invites are being Sent Out</a></li><li><a href="https://www.gizmochina.com/2024/04/11/lenovo-legion-y700-2024-latest-gaming-tablet-with-enhanced-display-now-available-at-giztop/">Lenovo Legion Y700 2024: Latest Gaming Tablet with Enhanced Display now available at Giztop</a></li><li><a href="https://www.gizmochina.com/2024/04/11/redmagic-magnetic-vc-cooler-5-pro-now-available-at-a-discounted-price-at-geekwills/">REDMAGIC Magnetic VC Cooler 5 Pro now available at a discounted price at GeekWills</a></li><li><a href="https://www.gizmochina.com/2024/04/06/top-hyperos-features-you-absolutely-cant-miss/">Top 6 HyperOS Features You Absolutely Can’t Miss</a></li></ul>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="SAMSUNG Galaxy S24 Ultra vs OnePlus 12 Global: Gaming! Gaming! All About Gaming!" width="696" height="392" src="https://www.youtube.com/embed/TkHvL7v-_f0?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>



<p>(<a href="https://www.engadget.com/gpt-4-performed-close-to-the-level-of-expert-doctors-in-eye-assessments-131517436.html">Via</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/04/18/gpt-4-outperformed-junior-eye-doctors/">GPT-4 Outperformed Junior &amp; Trainee Eye Doctors on a Mock Exam</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>A New Open Source LLM, DBRX Claims to be the Most Powerful &#8211; Here are the Scores</title>
		<link>https://www.gizmochina.com/2024/03/28/open-source-llm-dbrx-powerful/</link>
		
		<dc:creator><![CDATA[Anubhav]]></dc:creator>
		<pubDate>Thu, 28 Mar 2024 01:50:47 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Large Language Models]]></category>
		<category><![CDATA[LLM]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=614238</guid>

					<description><![CDATA[<img width="300" height="107" src="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="DBRX" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066.webp 660w" sizes="(max-width: 300px) 100vw, 300px" /><p>A whole new contender has entered the ring of large language models (LLMs). Databricks, a company specializing in data processing, has unveiled DBRX, claiming it to be the most powerful open-source LLM yet. But is it backing those claims up? Let&#8217;s find out. 132 billion parameters is a big number &#8211; GPT-3.5 has 175 billion [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/28/open-source-llm-dbrx-powerful/">A New Open Source LLM, DBRX Claims to be the Most Powerful &#8211; Here are the Scores</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="107" src="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="DBRX" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066.webp 660w" sizes="(max-width: 300px) 100vw, 300px" />
<p>A whole new contender has entered the ring of<a href="http://gizmochina.com/tag/large-language-models"> large language models</a> (LLMs). Databricks, a company specializing in data processing, has unveiled DBRX, claiming it to be the most powerful open-source <a href="http://gizmochina.com/tag/llm">LLM </a>yet. But is it backing those claims up? Let&#8217;s find out.</p>



<h2>132 billion parameters is a big number &#8211; GPT-3.5 has 175 billion parameters</h2>



<p>DBRX utilizes a transformer architecture and boasts a massive 132 billion parameters. It leverages a unique approach called a Mixture-of-Experts (MoE) model, consisting of 16 individual expert networks. During any given task, only 4 of these experts are active, utilizing 36 billion parameters for efficiency. <a href="http://gizmochina.com/tag/gpt-4">GPT 4</a> also uses an MoE model. </p>



<div class="wp-block-image"><figure class="aligncenter size-full"><img loading="lazy" width="660" height="235" src="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066.webp?x10805" alt="DBRX" class="wp-image-614239" srcset="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066.webp 660w, https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp 300w" sizes="(max-width: 660px) 100vw, 660px" /></figure></div>



<p>Databricks compares DBRX to other prominent open-source LLMs like <a href="http://gizmochina.com/tag/meta">Meta</a>&#8216;s <a href="http://gizmochina.com/tag/llama">Llama </a>2-70B, Mixtral (from France&#8217;s MixtralAI), and Grok-1 (developed by <a href="http://gizmochina.com/tag/elon-musk">Elon Musk</a>&#8216;s xAI). DBRX reportedly outperforms its rivals in several key areas:</p>



<ul><li><strong>Language Understanding:</strong>&nbsp;DBRX achieves a score of 73.7%, surpassing GPT-3.5 (70.0%), Llama 2-70B (69.8%), Mixtral (71.4%), and Grok-1 (73.0%).</li><li><strong>Programming Ability:</strong>&nbsp;Here, DBRX demonstrates a significant lead with a score of 70.1%, compared to GPT-3.5&#8217;s 48.1%, Llama 2-70B&#8217;s 32.3%, Mixtral&#8217;s 54.8%, and Grok-1&#8217;s 63.2%.</li><li><strong>Mathematics:</strong>&nbsp;DBRX takes another win with a score of 66.9%, edging out<a href="http://gizmochina.com/tag/gpt-3.5"> GPT-3.5 </a>(57.1%), Llama 2-70B (54.1%), Mixtral (61.1%), and <a href="http://gizmochina.com/tag/grok">Grok</a>-1 (62.9%).</li></ul>



<p>Databricks attributes DBRX&#8217;s speed to its MoE architecture, built upon their MegaBlocks research and open-source projects. This allows the model to output tokens at a very high rate. Additionally, Databricks positions DBRX as the most advanced open-source MoE model currently available, potentially paving the way for future advancements in the field.</p>



<p>The open-source nature of DBRX allows for wider adoption and contribution from the <a href="http://gizmochina.com/tag/developers">developer </a>community. This could accelerate further development and potentially solidify DBRX&#8217;s position as a leading LLM.</p>



<iframe loading="lazy" src="https://embeds.beehiiv.com/7eb90650-7442-41c3-92e8-0c32f95fdb2c" data-test-id="beehiiv-embed" width="100%" height="320" frameborder="0" scrolling="no" style="border-radius: 4px; border: 2px solid #e5e7eb; margin: 0; background-color: transparent;"></iframe>



<p><strong><span style="text-decoration: underline">RELATED:</span></strong></p>



<ul><li><a href="https://www.gizmochina.com/2023/12/11/alibaba-launches-seallm-ai-southeast-asian-languages/">Alibaba Launches SeaLLM, an AI for Southeast Asian Languages</a></li><li><a href="https://www.gizmochina.com/2023/11/16/baidu-ceo-china-focus-ai/">Baidu’s CEO Warns Against Solely Prioritizing the Launch of New LLMs in China</a></li><li><a href="https://www.gizmochina.com/2023/11/07/get-100-off-on-lenovo-legion-y700-2023-gamin-tablet-at-giztop/">Lenovo Legion Y700 2023: Save $100 on this 8-inch gaming Android tablet</a></li><li><a href="https://www.gizmochina.com/2023/12/29/unlock-savings-take-3-off-on-every-giztop-product-under-the-new-year-sale-extravaganza/">Unlock Savings: Discount on Every Giztop Product under the New Year Sale&nbsp;</a></li><li><a href="https://www.gizmochina.com/how-to/turn-off-samsung-phone-without-using-screen/">How to turn off any Samsung phone without using screen (5 methods)</a></li></ul>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="Redmi K70 SD &amp; Pro Full Review: Best 2K straight screen phones to buy" width="696" height="392" src="https://www.youtube.com/embed/UKlraVz6eH4?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>



<p>(<a href="https://www.ithome.com/0/758/552.htm">Via</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/28/open-source-llm-dbrx-powerful/">A New Open Source LLM, DBRX Claims to be the Most Powerful &#8211; Here are the Scores</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Google has introduced VideoPOET breaking new ground in coherent video generation</title>
		<link>https://www.gizmochina.com/2023/12/21/google-videopoet-10-second-coherent-video-generation/</link>
		
		<dc:creator><![CDATA[Debasish]]></dc:creator>
		<pubDate>Thu, 21 Dec 2023 07:17:37 +0000</pubDate>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[LLM]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=592509</guid>

					<description><![CDATA[<img width="300" height="164" src="https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-300x164.png?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-300x164.png 300w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-1024x559.png 1024w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-768x420.png 768w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-1536x839.png 1536w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-696x380.png 696w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-1068x580.png 1068w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-769x420.png 769w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2.png 1618w" sizes="(max-width: 300px) 100vw, 300px" /><p>After Microsoft&#8216;s Copilot AI gets the ability to generate audio clips from text prompts, Google has introduced VideoPoet, a large language model (LLM) that pushes the boundaries in video generation with 10-second clips that produce fewer artifacts. The model supports an array of video generation tasks, including text-to-video conversion, image-to-video transformation, video stylization, inpainting, and [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2023/12/21/google-videopoet-10-second-coherent-video-generation/">Google has introduced VideoPOET breaking new ground in coherent video generation</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="164" src="https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-300x164.png?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-300x164.png 300w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-1024x559.png 1024w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-768x420.png 768w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-1536x839.png 1536w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-696x380.png 696w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-1068x580.png 1068w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2-769x420.png 769w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-2.png 1618w" sizes="(max-width: 300px) 100vw, 300px" />
<p>After <a href="https://www.gizmochina.com/tag/microsoft/" target="_blank" rel="noreferrer noopener">Microsoft</a>&#8216;s Copilot AI<a href="https://www.gizmochina.com/2023/12/20/microsofts-copilot-teams-up-with-suno-for-ai-music-creation-in-edge/" target="_blank" rel="noreferrer noopener"> gets the ability to generate audio clips</a> from text prompts, <a href="https://www.gizmochina.com/category/google/" target="_blank" rel="noreferrer noopener">Google</a> has introduced VideoPoet, a large language model (LLM) that pushes the boundaries in video generation with 10-second clips that produce fewer artifacts. The model supports an array of video generation tasks, including text-to-video conversion, image-to-video transformation, video stylization, inpainting, and video-to-audio functionalities.</p>



<h2>It generates 10-sec video clips from text prompts and is also able to animate still images</h2>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="349" src="https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-1024x349.png?x10805" alt="" class="wp-image-592515" srcset="https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-1024x349.png 1024w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-300x102.png 300w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-768x262.png 768w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-1536x524.png 1536w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-696x237.png 696w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-1068x364.png 1068w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-1920x655.png 1920w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET-1231x420.png 1231w, https://www.gizmochina.com/wp-content/uploads/2023/12/VideoPOET.png 1999w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure></div>



<p>Unlike its predecessors, VideoPoet sets itself apart by excelling in the generation of coherent large-motion videos. The model showcases its prowess by producing ten-second long videos, leaving its competition, including Gen-2 behind. Notably, VideoPoet doesn&#8217;t rely on specific data for video generation, distinguishing it from other models that require detailed input for optimal results.</p>



<p>This multifaceted capability is made possible by leveraging a multi-modal large model, setting it on a trajectory to potentially become the mainstream in video generation.</p>



<p>Google&#8217;s VideoPOET takes a departure from the prevailing trend in video generation models, which predominantly rely on diffusion-based approaches. Instead, VideoPoet harnesses the power of large language models (LLMs). The model seamlessly integrates various video generation tasks within a single LLM, eliminating the need for separately trained components for each function.</p>



<p>The resulting videos exhibit variable length and diverse actions and styles based on the input text content. Additionally, VideoPoet can perform the conversion of input images into animations based on provided prompts, showcasing its adaptability across different inputs.</p>



<p>The release of VideoPOET adds a new dimension to <a href="https://www.gizmochina.com/tag/ai/" target="_blank" rel="noreferrer noopener">AI</a>-driven video generation, hinting at the possibilities that lie ahead in 2024.</p>



<p><strong>Related:</strong></p>



<ul><li><a href="https://www.gizmochina.com/2023/11/27/get-xiaomi-13-ultra-premium-5g-phone-for-as-low-as-799-at-giztop/">Xiaomi 13 Ultra Premium Camera Phone is now only $799</a></li><li><a href="https://www.gizmochina.com/2023/11/19/get-100-discount-on-alldocube-iwork-gt-12-at-geekwills-coupon/">Alldocube iWork GT 12: AMD 2-in-1 laptop, $100 off and free keyboard</a></li><li><a href="https://www.gizmochina.com/2023/12/05/get-100-off-on-xiaomi-14-pro-at-giztop-1tb-variant/">Get $100 OFF on Xiaomi 14 Pro at Giztop (1TB Variant)</a></li><li><a href="https://www.gizmochina.com/guides/best-apple-watch-cases-in-2023-spigen-otterbox-casetify-more%ef%bf%bc/">Best Apple Watch Cases in 2023: Spigen, Otterbox, Casetify &amp; More</a></li><li><a href="https://www.gizmochina.com/guides/best-feature-phones-with-upi-support-2023-nokia-dominates/">Best Feature Phones with UPI support 2023: Nokia Dominates</a></li></ul>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="OnePlus 12 Review: The OnePlus Phone With The Fewest Cons" width="696" height="392" src="https://www.youtube.com/embed/XqxF96Kq-l8?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
</div></figure>



<p>(<a href="https://sites.research.google/videopoet/">Source</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2023/12/21/google-videopoet-10-second-coherent-video-generation/">Google has introduced VideoPOET breaking new ground in coherent video generation</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Alibaba Launches SeaLLM, an AI for Southeast Asian Languages</title>
		<link>https://www.gizmochina.com/2023/12/11/alibaba-launches-seallm-ai-southeast-asian-languages/</link>
		
		<dc:creator><![CDATA[Anubhav]]></dc:creator>
		<pubDate>Mon, 11 Dec 2023 15:01:51 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Alibaba]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[China]]></category>
		<category><![CDATA[LLM]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=589689</guid>

					<description><![CDATA[<img width="300" height="170" src="https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-300x170.png?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Alibaba" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-300x170.png 300w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-1024x579.png 1024w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-768x434.png 768w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-696x393.png 696w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-1068x604.png 1068w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-743x420.png 743w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825.png 1228w" sizes="(max-width: 300px) 100vw, 300px" /><p>Alibaba&#8216;s Damo Academy, in a calculated, precise move to strengthen its footprint in Southeast Asia, has unveiled a new AI-driven language model specifically designed for this diverse region. This innovative tool, called SeaLLM, is a testament to Alibaba&#8217;s recognition of Southeast Asia&#8217;s potential as a key market. It&#8217;s tailored to understand and interact in languages [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2023/12/11/alibaba-launches-seallm-ai-southeast-asian-languages/">Alibaba Launches SeaLLM, an AI for Southeast Asian Languages</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="170" src="https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-300x170.png?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Alibaba" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-300x170.png 300w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-1024x579.png 1024w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-768x434.png 768w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-696x393.png 696w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-1068x604.png 1068w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-743x420.png 743w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825.png 1228w" sizes="(max-width: 300px) 100vw, 300px" />
<p><a href="http://gizmochina.com/tag/alibaba">Alibaba</a>&#8216;s Damo Academy, in a calculated, precise move to strengthen its footprint in Southeast Asia, has unveiled a new <a href="http://gizmochina.com/tag/ai">AI</a>-driven <a href="http://gizmochina.com/tag/llm">language model</a> specifically designed for this diverse region. This innovative tool, called SeaLLM, is a testament to Alibaba&#8217;s recognition of Southeast Asia&#8217;s potential as a key market. It&#8217;s tailored to understand and interact in languages like Vietnamese, Indonesian, Thai, Malay, and several others, demonstrating a significant leap in bridging linguistic and cultural gaps in AI technology.</p>



<h3>Southeast Asia&#8217;s linguistic diversity contributes to multiple AI applications</h3>



<p>The development of SeaLLM is particularly noteworthy given the linguistic diversity of Southeast Asia. This region, with its myriad of languages, presents unique challenges and opportunities for AI applications. By focusing on languages that are often underrepresented in global technology advancements, Alibaba is not just expanding its market reach but also contributing to the inclusiveness and accessibility of AI technology.</p>



<figure class="wp-block-image size-large"><img loading="lazy" width="1024" height="579" src="https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-1024x579.png?x10805" alt="Alibaba" class="wp-image-529192" srcset="https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-1024x579.png 1024w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-300x170.png 300w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-768x434.png 768w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-696x393.png 696w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-1068x604.png 1068w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825-743x420.png 743w, https://www.gizmochina.com/wp-content/uploads/2023/04/Screenshot-2023-04-12-011825.png 1228w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>



<p>Furthermore, SeaLLM&#8217;s enhanced capabilities in handling non-Latin scripts and its superior performance in understanding and translating low-resource languages is a game-changer. It means that businesses and communities in Southeast Asia can leverage AI more effectively, fostering better communication and understanding across different cultures.</p>



<p>This move by Alibaba also signifies a broader trend in the AI landscape, where regional customization is becoming increasingly important. As AI technology becomes more pervasive, its ability to cater to specific regional needs and languages will be crucial in determining its success and impact.</p>



<p>However, despite these advancements, the AI industry, particularly in <a href="http://gizmochina.com/tag/china">China</a>, faces ongoing challenges. Issues like US chip restrictions and the search for more universally appealing applications are hurdles that need to be addressed. Nonetheless, innovations like SeaLLM are steps in the right direction, showcasing how AI can be more inclusive and beneficial to a wider range of communities.</p>



<p><strong><span style="text-decoration: underline">RELATED:</span></strong></p>



<ul><li><a href="https://www.gizmochina.com/2023/11/30/alibaba-ai-integration-customer-relationship-management/">Alibaba Introduces an AI-powered Customer Relationship Management Tool</a></li><li><a href="https://www.gizmochina.com/2023/11/29/jack-ma-new-prepackaged-food-company-china/">Alibaba’s Co-Founder Jack Ma is Now Treading into the Prepackaged Food Industry in China</a></li><li><a href="https://www.gizmochina.com/2023/11/07/get-100-off-on-lenovo-legion-y700-2023-gamin-tablet-at-giztop/">Lenovo Legion Y700 2023: Save $100 on this 8-inch gaming Android tablet</a></li><li><a href="https://www.gizmochina.com/guides/best-messaging-apps-for-android-in-2023/">Best Messaging Apps for Android in 2023</a></li></ul>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="Xiaomi 14 Pro Full Review: Maybe it should be called Xiaomi 14 Plus" width="696" height="392" src="https://www.youtube.com/embed/q1iokKhAZ5k?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
</div></figure>



<p>(<a href="https://www.scmp.com/tech/article/3244693/alibaba-research-unit-unveils-llm-tailored-southeast-asia-e-commerce-giant-pushes-ai-fast-growth">Via</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2023/12/11/alibaba-launches-seallm-ai-southeast-asian-languages/">Alibaba Launches SeaLLM, an AI for Southeast Asian Languages</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>China’s Baidu launched an industry-grade medical AI model</title>
		<link>https://www.gizmochina.com/2023/09/19/chinas-baidu-launched-an-industry-grade-medical-ai-model/</link>
		
		<dc:creator><![CDATA[Anurag]]></dc:creator>
		<pubDate>Tue, 19 Sep 2023 15:47:53 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[Baidu]]></category>
		<category><![CDATA[Lingyi]]></category>
		<category><![CDATA[LLM]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=568000</guid>

					<description><![CDATA[<img width="300" height="169" src="https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-300x169.jpg?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="baidu" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-300x169.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-1024x576.jpg 1024w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-768x432.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-696x392.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-1068x601.jpg 1068w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-747x420.jpg 747w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu.jpg 1280w" sizes="(max-width: 300px) 100vw, 300px" /><p>Hot on the heels of launching its AI chatbot Ernie, China’s Baidu has now unveiled another large language model (LLM) with the aim of improving the digitization and intelligence of the healthcare industry. The new industry-grade AI model is called Lingyi (machine translation gives “Spiritual Doctor”) and it is currently available for trial use in [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2023/09/19/chinas-baidu-launched-an-industry-grade-medical-ai-model/">China’s Baidu launched an industry-grade medical AI model</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="169" src="https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-300x169.jpg?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="baidu" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-300x169.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-1024x576.jpg 1024w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-768x432.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-696x392.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-1068x601.jpg 1068w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-747x420.jpg 747w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu.jpg 1280w" sizes="(max-width: 300px) 100vw, 300px" />
<p>Hot on the heels of launching its AI chatbot <a href="https://www.gizmochina.com/2023/09/04/baidus-ernie-bot-takes-china-by-storm-but-faces-challenges-in-public-debut/">Ernie</a>, China’s Baidu has now unveiled another large language model (LLM) with the aim of improving the digitization and intelligence of the healthcare industry. The new industry-grade AI model is called Lingyi (machine translation gives “Spiritual Doctor”) and it is currently available for trial use in both upstream and downstream healthcare sectors.</p>



<p>The LLM can generate structured medical records from free-text input and accurately analyze and generate patient complaints, medical histories, and more based on doctor-patient conversations. It can do simultaneous parsing of multiple Chinese and English medical literature articles, enabling intelligent question-answering based on the content of the literature.</p>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="576" src="https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-1024x576.jpg?x10805" alt="baidu" class="wp-image-568001" srcset="https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-1024x576.jpg 1024w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-300x169.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-768x432.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-696x392.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-1068x601.jpg 1068w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu-747x420.jpg 747w, https://www.gizmochina.com/wp-content/uploads/2023/09/baidu.jpg 1280w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure></div>



<p>When it comes to assisting in diagnosis and treatment, the Lingyi Large Model offers real-time understanding of a patient&#8217;s condition through multi-turn dialogues. It assists doctors in diagnosing diseases, and recommending treatment plans. It also serves as a 24-hour &#8220;healthcare manager&#8221; for patients and provides various capabilities to pharmaceutical companies, including professional training and medical information support, among others.</p>



<p>Currently, the Lingyi Large Model is available is offered in Lite, flagship, and custom versions, each tailored to different needs and application scenarios. Baidu will let partners access the large model through API integration or embed it as plugins into existing product systems.</p>



<p><em><a href="https://www.ithome.com/0/720/221.htm">ITHome</a></em> reports Baidu has already partnered with companies like Gushengtang and Ling Jiashe and is selectively open to over 200 medical institutions, including public hospitals, pharmaceutical companies, internet hospital platforms, and chain pharmacies.</p>



<p><strong>RELATED</strong>:</p>



<ul><li><a href="https://www.gizmochina.com/guides/oppo-find-n3-flip-vs-oppo-find-n2-flip-specs-comparison/">Oppo Find N3 Flip vs Oppo Find N2 Flip: Specs Comparison</a></li><li><a href="https://www.gizmochina.com/2023/08/30/china-high-bandwidth-memory-chips/">From Dependency to Independence: China’s Pursuit of Homegrown HBM Technology</a></li><li><a href="https://www.gizmochina.com/2023/01/30/baidu-ai-chatbot-rivaling-openai-chatgpt/">Baidu to launch AI Chatbot, Rivaling OpenAI’s ChatGPT</a></li><li><a href="https://www.gizmochina.com/2023/08/30/microsoft-xbox-gaming-ai-team/">Microsoft aims to revolutionize gaming using AI, with Xbox Gaming AI Team</a></li><li><a href="https://www.gizmochina.com/2023/05/22/microsoft-bing-overtakes-baidu-china-desktop-search-engine/">Microsoft Bing surpasses Baidu as China’s leading desktop search engine</a></li></ul>



<p><iframe loading="lazy" title="IROK FE 98 Pro Wireless Mechanical Keyboard Review: Great keyboard for just getting started!" width="696" height="392" src="https://www.youtube.com/embed/qnnoIo0it_A?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe></p>



<p><a href="https://finance.sina.com.cn/tech/roll/2023-09-19/doc-imznhaca8017865.shtml">(Via)</a></p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2023/09/19/chinas-baidu-launched-an-industry-grade-medical-ai-model/">China’s Baidu launched an industry-grade medical AI model</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Object Caching 94/134 objects using Redis
Page Caching using Disk: Enhanced 
Content Delivery Network Full Site Delivery via cloudflare
Database Caching 10/34 queries in 0.013 seconds using Redis
Fragment Caching 2/3 fragments using Redis

Served from: www.gizmochina.com @ 2026-05-12 18:05:10 by W3 Total Cache
-->