<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Large Language Models Archives - Gizmochina</title>
	<atom:link href="https://www.gizmochina.com/tag/large-language-models/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.gizmochina.com/tag/large-language-models/</link>
	<description>Latest Tech News, Product Reviews and Deals</description>
	<lastBuildDate>Tue, 23 Jul 2024 18:02:08 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=5.9.9</generator>
	<item>
		<title>Meta Unveils New Open-source Language Model, Llama 3.1 with an increased Context Length of 128K tokens</title>
		<link>https://www.gizmochina.com/2024/07/23/meta-new-open-source-llm-llama-3-1/</link>
		
		<dc:creator><![CDATA[Anubhav]]></dc:creator>
		<pubDate>Tue, 23 Jul 2024 18:02:01 +0000</pubDate>
				<category><![CDATA[Meta]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Large Language Models]]></category>
		<category><![CDATA[LLama]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=640744</guid>

					<description><![CDATA[<img width="300" height="169" src="https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-300x169.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Meta Llama 3.1" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-300x169.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-1024x576.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-768x432.webp 768w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-696x392.webp 696w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-1068x601.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-747x420.webp 747w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png.webp 1440w" sizes="(max-width: 300px) 100vw, 300px" /><p>Meta unveiled its latest open-source language model, Llama 3.1, on July 23rd. This new iteration boasts several improvements, including enhanced inference capabilities, broader multilingual support, and a significant increase in context length to 128K tokens. The new LLM is comparable to GPT-4, GPT-4o, and Claude 3.5 Sonnet The star of the show is the flagship 405B parameter Llama 3.1-405B. This powerhouse model, according [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/07/23/meta-new-open-source-llm-llama-3-1/">Meta Unveils New Open-source Language Model, Llama 3.1 with an increased Context Length of 128K tokens</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="169" src="https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-300x169.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Meta Llama 3.1" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-300x169.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-1024x576.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-768x432.webp 768w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-696x392.webp 696w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-1068x601.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-747x420.webp 747w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png.webp 1440w" sizes="(max-width: 300px) 100vw, 300px" />
<p><a href="http://gizmochina.com/tag/meta">Meta</a> unveiled its latest open-source language model, <a href="http://gizmochina.com/tag/llama">Llama </a>3.1, on July 23rd. This new iteration boasts several improvements, including enhanced inference capabilities, broader multilingual support, and a significant increase in context length to 128K tokens.</p>



<h3>The new LLM is comparable to GPT-4, GPT-4o, and Claude 3.5 Sonnet</h3>



<p>The star of the show is the flagship 405B parameter Llama 3.1-405B. This powerhouse model, according to Meta, rivals the performance of leading closed-source models in tasks like common-sense reasoning, guidance, mathematics, tool use, and multilingual translation. Meta compares its capabilities to <a href="http://gizmochina.com/tag/gpt-4">GPT-4</a>, <a href="http://gizmochina.com/tag/gpt-4o">GPT-4o</a>, and Claude 3.5 Sonnet.</p>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="576" src="https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-1024x576.webp?x10805" alt="" class="wp-image-640745" srcset="https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-1024x576.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-300x169.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-768x432.webp 768w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-696x392.webp 696w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-1068x601.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png-747x420.webp 747w, https://www.gizmochina.com/wp-content/uploads/2024/07/e585a03b-93a8-437f-8156-1858bf7d13cb.png.webp 1440w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure></div>



<p>But the improvements extend beyond the top tier.&nbsp;The 8B and 70B parameter versions of Llama 3.1 are also said to be highly competitive with other open-source and closed-source models of similar sizes.</p>



<p>For those eager to experiment, Llama 3.1 is now downloadable from Meta&#8217;s official website and Hugging Face. Additionally, over 25 major partners including cloud giants like <a href="http://gizmochina.com/tag/aws">AWS</a>, Azure, and <a href="http://gizmochina.com/tag/google-cloud">Google Cloud</a>, alongside hardware manufacturers like <a href="http://gizmochina.com/tag/nvidia">Nvidia</a> and <a href="http://gizmochina.com/tag/dell">Dell</a>, have been confirmed as ready to support the new model.</p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="Red Magic 9 Pro Plus Bumblebee Edition Unboxing: Impressive Co-branding!" width="696" height="392" src="https://www.youtube.com/embed/_yZ9DP_5EQc?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>



<p>(<a href="https://www.ithome.com/0/783/796.htm">Via</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/07/23/meta-new-open-source-llm-llama-3-1/">Meta Unveils New Open-source Language Model, Llama 3.1 with an increased Context Length of 128K tokens</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>A New Open Source LLM, DBRX Claims to be the Most Powerful &#8211; Here are the Scores</title>
		<link>https://www.gizmochina.com/2024/03/28/open-source-llm-dbrx-powerful/</link>
		
		<dc:creator><![CDATA[Anubhav]]></dc:creator>
		<pubDate>Thu, 28 Mar 2024 01:50:47 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Large Language Models]]></category>
		<category><![CDATA[LLM]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=614238</guid>

					<description><![CDATA[<img width="300" height="107" src="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="DBRX" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066.webp 660w" sizes="(max-width: 300px) 100vw, 300px" /><p>A whole new contender has entered the ring of large language models (LLMs). Databricks, a company specializing in data processing, has unveiled DBRX, claiming it to be the most powerful open-source LLM yet. But is it backing those claims up? Let&#8217;s find out. 132 billion parameters is a big number &#8211; GPT-3.5 has 175 billion [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/28/open-source-llm-dbrx-powerful/">A New Open Source LLM, DBRX Claims to be the Most Powerful &#8211; Here are the Scores</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="107" src="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="DBRX" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066.webp 660w" sizes="(max-width: 300px) 100vw, 300px" />
<p>A whole new contender has entered the ring of<a href="http://gizmochina.com/tag/large-language-models"> large language models</a> (LLMs). Databricks, a company specializing in data processing, has unveiled DBRX, claiming it to be the most powerful open-source <a href="http://gizmochina.com/tag/llm">LLM </a>yet. But is it backing those claims up? Let&#8217;s find out.</p>



<h2>132 billion parameters is a big number &#8211; GPT-3.5 has 175 billion parameters</h2>



<p>DBRX utilizes a transformer architecture and boasts a massive 132 billion parameters. It leverages a unique approach called a Mixture-of-Experts (MoE) model, consisting of 16 individual expert networks. During any given task, only 4 of these experts are active, utilizing 36 billion parameters for efficiency. <a href="http://gizmochina.com/tag/gpt-4">GPT 4</a> also uses an MoE model. </p>



<div class="wp-block-image"><figure class="aligncenter size-full"><img loading="lazy" width="660" height="235" src="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066.webp?x10805" alt="DBRX" class="wp-image-614239" srcset="https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066.webp 660w, https://www.gizmochina.com/wp-content/uploads/2024/03/00913eec-49cd-4d7b-a5a6-ad51449ab066-300x107.webp 300w" sizes="(max-width: 660px) 100vw, 660px" /></figure></div>



<p>Databricks compares DBRX to other prominent open-source LLMs like <a href="http://gizmochina.com/tag/meta">Meta</a>&#8216;s <a href="http://gizmochina.com/tag/llama">Llama </a>2-70B, Mixtral (from France&#8217;s MixtralAI), and Grok-1 (developed by <a href="http://gizmochina.com/tag/elon-musk">Elon Musk</a>&#8216;s xAI). DBRX reportedly outperforms its rivals in several key areas:</p>



<ul><li><strong>Language Understanding:</strong>&nbsp;DBRX achieves a score of 73.7%, surpassing GPT-3.5 (70.0%), Llama 2-70B (69.8%), Mixtral (71.4%), and Grok-1 (73.0%).</li><li><strong>Programming Ability:</strong>&nbsp;Here, DBRX demonstrates a significant lead with a score of 70.1%, compared to GPT-3.5&#8217;s 48.1%, Llama 2-70B&#8217;s 32.3%, Mixtral&#8217;s 54.8%, and Grok-1&#8217;s 63.2%.</li><li><strong>Mathematics:</strong>&nbsp;DBRX takes another win with a score of 66.9%, edging out<a href="http://gizmochina.com/tag/gpt-3.5"> GPT-3.5 </a>(57.1%), Llama 2-70B (54.1%), Mixtral (61.1%), and <a href="http://gizmochina.com/tag/grok">Grok</a>-1 (62.9%).</li></ul>



<p>Databricks attributes DBRX&#8217;s speed to its MoE architecture, built upon their MegaBlocks research and open-source projects. This allows the model to output tokens at a very high rate. Additionally, Databricks positions DBRX as the most advanced open-source MoE model currently available, potentially paving the way for future advancements in the field.</p>



<p>The open-source nature of DBRX allows for wider adoption and contribution from the <a href="http://gizmochina.com/tag/developers">developer </a>community. This could accelerate further development and potentially solidify DBRX&#8217;s position as a leading LLM.</p>



<iframe loading="lazy" src="https://embeds.beehiiv.com/7eb90650-7442-41c3-92e8-0c32f95fdb2c" data-test-id="beehiiv-embed" width="100%" height="320" frameborder="0" scrolling="no" style="border-radius: 4px; border: 2px solid #e5e7eb; margin: 0; background-color: transparent;"></iframe>



<p><strong><span style="text-decoration: underline">RELATED:</span></strong></p>



<ul><li><a href="https://www.gizmochina.com/2023/12/11/alibaba-launches-seallm-ai-southeast-asian-languages/">Alibaba Launches SeaLLM, an AI for Southeast Asian Languages</a></li><li><a href="https://www.gizmochina.com/2023/11/16/baidu-ceo-china-focus-ai/">Baidu’s CEO Warns Against Solely Prioritizing the Launch of New LLMs in China</a></li><li><a href="https://www.gizmochina.com/2023/11/07/get-100-off-on-lenovo-legion-y700-2023-gamin-tablet-at-giztop/">Lenovo Legion Y700 2023: Save $100 on this 8-inch gaming Android tablet</a></li><li><a href="https://www.gizmochina.com/2023/12/29/unlock-savings-take-3-off-on-every-giztop-product-under-the-new-year-sale-extravaganza/">Unlock Savings: Discount on Every Giztop Product under the New Year Sale&nbsp;</a></li><li><a href="https://www.gizmochina.com/how-to/turn-off-samsung-phone-without-using-screen/">How to turn off any Samsung phone without using screen (5 methods)</a></li></ul>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="Redmi K70 SD &amp; Pro Full Review: Best 2K straight screen phones to buy" width="696" height="392" src="https://www.youtube.com/embed/UKlraVz6eH4?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>



<p>(<a href="https://www.ithome.com/0/758/552.htm">Via</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/28/open-source-llm-dbrx-powerful/">A New Open Source LLM, DBRX Claims to be the Most Powerful &#8211; Here are the Scores</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Apple&#8217;s New MM1 Large Language Model Blurs the Lines Between Image and Text</title>
		<link>https://www.gizmochina.com/2024/03/16/apple-multimodal-large-language-models/</link>
		
		<dc:creator><![CDATA[Anubhav]]></dc:creator>
		<pubDate>Sat, 16 Mar 2024 19:10:14 +0000</pubDate>
				<category><![CDATA[Apple]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Large Language Models]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=611647</guid>

					<description><![CDATA[<img width="300" height="222" src="https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-300x222.png?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Apple" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-300x222.png 300w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-768x568.png 768w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-485x360.png 485w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-696x515.png 696w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-568x420.png 568w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-80x60.png 80w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637.png 819w" sizes="(max-width: 300px) 100vw, 300px" /><p>Apple&#8216;s research team has taken a huge step forward with their new &#8220;MM1&#8221; multi-modal large language model. This exciting development was detailed in a recent paper titled &#8220;MM1: Methods, Analysis &#38; Insights from Multimodal LLM Pre-training&#8221;, and it showcases a model with impressive capabilities in both image recognition and natural language reasoning. The model is [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/16/apple-multimodal-large-language-models/">Apple&#8217;s New MM1 Large Language Model Blurs the Lines Between Image and Text</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="222" src="https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-300x222.png?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Apple" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-300x222.png 300w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-768x568.png 768w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-485x360.png 485w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-696x515.png 696w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-568x420.png 568w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637-80x60.png 80w, https://www.gizmochina.com/wp-content/uploads/2023/09/Screenshot-2023-09-08-184637.png 819w" sizes="(max-width: 300px) 100vw, 300px" />
<p><a href="http://gizmochina.com/tag/apple">Apple</a>&#8216;s research team has taken a huge step forward with their new &#8220;MM1&#8221; multi-modal <a href="http://gizmochina.com/tag/large-language-model">large language model</a>. This exciting development was detailed in a recent paper titled &#8220;MM1: Methods, Analysis &amp; Insights from Multimodal LLM Pre-training&#8221;, and it showcases a model with impressive capabilities in both image recognition and natural language reasoning.</p>



<h2>The model is available in 3 billion, 7 billion and 30 billion parameter sizes</h2>



<p>MM1 comes in three sizes: 3 billion, 7 billion, and 30 billion parameters. Researchers used these models to conduct experiments, pinpointing the key factors that influence performance. Interestingly, image resolution and the number of image tags have a greater impact than visual language connectors, and different pre-training data sets can significantly affect the model&#8217;s effectiveness.</p>



<div class="wp-block-image"><figure class="aligncenter size-full"><img loading="lazy" width="995" height="604" src="https://www.gizmochina.com/wp-content/uploads/2023/11/Screenshot-2023-11-04-214259.png?x10805" alt="Apple" class="wp-image-580161" srcset="https://www.gizmochina.com/wp-content/uploads/2023/11/Screenshot-2023-11-04-214259.png 995w, https://www.gizmochina.com/wp-content/uploads/2023/11/Screenshot-2023-11-04-214259-300x182.png 300w, https://www.gizmochina.com/wp-content/uploads/2023/11/Screenshot-2023-11-04-214259-768x466.png 768w, https://www.gizmochina.com/wp-content/uploads/2023/11/Screenshot-2023-11-04-214259-696x422.png 696w, https://www.gizmochina.com/wp-content/uploads/2023/11/Screenshot-2023-11-04-214259-692x420.png 692w" sizes="(max-width: 995px) 100vw, 995px" /></figure></div>



<p>The research team meticulously built MM1 using a &#8220;Mixture of Experts&#8221; architecture and a &#8220;Top-2 Gating&#8221; method. This approach not only yielded excellent results in pre-training benchmarks, but also translated to strong performance on existing multi-modal benchmarks. Even after fine-tuning for specific tasks, MM1 models maintained competitive performance.</p>



<p>Testing revealed that the MM1-3B-Chat and MM1-7B-Chat models outperform most similarly sized competitors in the market. These models particularly shine in tasks like VQAv2 (question answering based on an image and text), TextVQA (text-based question answering about an image), and ScienceQA (scientific question answering). However, the overall performance of MM1 doesn&#8217;t quite surpass <a href="http://gizmochina.com/tag/google-gemini">Google&#8217;s Gemini</a> or <a href="http://gizmochina.com/tag/openai">OpenAI</a>&#8216;s<a href="http://gizmochina.com/tag/gpt-4"> GPT-4</a>V models (yet). While MM1 may not be the absolute leader yet, it still is a significant leap forward for Apple in artificial intelligence. The company also recently acquired DarwinAI, read more about that <a href="https://www.gizmochina.com/2024/03/15/apple-darwinai-artificial-intelligence/">here</a>.</p>



<p><strong><span style="text-decoration: underline">RELATED:</span></strong></p>



<ul><li><a href="https://www.gizmochina.com/2024/03/15/apple-darwinai-artificial-intelligence/">Apple Acquires DarwinAI, Expect Lots of Artificial Intelligence-powered Features in the Future</a></li><li><a href="https://www.gizmochina.com/2024/03/16/apple-oled-ipad-foldable-iphone/">Apple may Launch an OLED iPad Air in 2028 and a Foldable iPhone in 2026 as Per Reports</a></li><li><a href="https://www.gizmochina.com/2023/12/05/get-100-off-on-xiaomi-14-pro-at-giztop-1tb-variant/">Get $100 OFF on Xiaomi 14 Pro at Giztop (1TB Variant)</a></li><li><a href="https://www.gizmochina.com/2023/11/07/get-100-off-on-lenovo-legion-y700-2023-gamin-tablet-at-giztop/">Lenovo Legion Y700 2023: Save $100 on this 8-inch gaming Android tablet</a></li><li><a href="https://www.gizmochina.com/how-to/turn-off-samsung-phone-without-using-screen/">How to turn off any Samsung phone without using screen (5 methods)</a></li></ul>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="Xiaomi 14 Ultra Full Review: I prefer to call it &quot;13S Ultra&quot;" width="696" height="392" src="https://www.youtube.com/embed/S2waR16nk1o?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
</div></figure>



<p>(<a href="https://www.gizmochina.com/2024/03/16/apple-oled-ipad-foldable-iphone/">VIA</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/16/apple-multimodal-large-language-models/">Apple&#8217;s New MM1 Large Language Model Blurs the Lines Between Image and Text</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Google DeepMind&#8217;s SIMA is Training to Become Your New In-Game Teammate, Here&#8217;s How</title>
		<link>https://www.gizmochina.com/2024/03/14/google-deepmind-sima-ai-game-buddy/</link>
		
		<dc:creator><![CDATA[Anubhav]]></dc:creator>
		<pubDate>Thu, 14 Mar 2024 01:42:14 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Gaming]]></category>
		<category><![CDATA[Large Language Models]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=611117</guid>

					<description><![CDATA[<img width="300" height="200" src="https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-300x200.jpg?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-300x200.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-1024x682.jpg 1024w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-768x512.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-696x464.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-1068x712.jpg 1068w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-630x420.jpg 630w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai.jpg 1136w" sizes="(max-width: 300px) 100vw, 300px" /><p>Get ready for a new kind of gaming buddy! Google DeepMind has introduced SIMA, a large language model being trained to become your in-game teammate. Is this what AI was meant for? Sounds about right. The AI companion will perceive elements of both the map and the gameplay SIMA, which stands for &#8220;Scalable, Instructable, Multiworld [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/14/google-deepmind-sima-ai-game-buddy/">Google DeepMind&#8217;s SIMA is Training to Become Your New In-Game Teammate, Here&#8217;s How</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="200" src="https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-300x200.jpg?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-300x200.jpg 300w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-1024x682.jpg 1024w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-768x512.jpg 768w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-696x464.jpg 696w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-1068x712.jpg 1068w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai-630x420.jpg 630w, https://www.gizmochina.com/wp-content/uploads/2023/11/google-sundar-pichai.jpg 1136w" sizes="(max-width: 300px) 100vw, 300px" />
<p>Get ready for a new kind of gaming buddy! <a href="http://gizmochina.com/tag/google-deepmind">Google DeepMind</a> has introduced SIMA, a<a href="http://gizmochina.com/tag/large-language-models"> large language model </a>being trained to become your in-game teammate. Is this what AI was meant for? Sounds about right.</p>



<h3>The AI companion will perceive elements of both the map and the gameplay</h3>



<p>SIMA, which stands for &#8220;Scalable, Instructable, Multiworld Agent,&#8221; is currently under development, but it has the potential to revolutionize the way we play games. Unlike traditional <a href="http://gizmochina.com/tag/artificial-intelligence">AI </a>companions, SIMA won&#8217;t simply be another NPC character. This model is designed to be a cooperative teammate, understanding your actions and adapting its own accordingly. Imagine getting a co-op buddy in Borderlands who lets you loot first before doing it themselves. How cool would that be?</p>



<div class="wp-block-image"><figure class="aligncenter"><img src="https://img.ithome.com/newsuploadfiles/2024/3/b6cc18eb-399e-4def-a83e-944a86606010.jpg?x-bce-process=image/quality,q_75/format,f_webp" alt="" /></figure></div>



<p>To achieve this, SIMA works on a combination of <a href="http://gizmochina.com/tag/natural-language-processing">natural language processing</a> and image recognition. This allows it to perceive the 3D game world and respond to your instructions and actions. To train this AI teammate, Google has partnered with eight game developers, including big studios behind titles like No Man&#8217;s Sky and Valheim.</p>



<p>Through these collaborations, SIMA is learning the fundamentals of gameplay – from basic actions like turning left and climbing ladders to utilizing menus and maps. While complex tasks like resource gathering and camp building are beyond its current capabilities, Google expects SIMA&#8217;s skillset to expand significantly in the future. It won&#8217;t be long before gamers can use a Google AI game buddy to fill up the third slot in their <a href="http://gizmochina.com/tag/apex-legends">Apex Legends</a> Lobby. </p>



<p><strong><span style="text-decoration: underline">RELATED:</span></strong></p>



<ul><li><a href="https://www.gizmochina.com/2024/03/12/google-pixel-8-series-finally-gets-support-display-output-via-usb-c/">Google Pixel 8 series finally gets support for display output via USB-C</a></li><li><a href="https://www.gizmochina.com/2024/03/11/google-accidentally-confirms-pixel-8a-teases-a-new-battery-feature/">Google accidentally confirms Pixel 8a, teases a new battery feature</a></li><li><a href="https://www.gizmochina.com/2024/01/09/get-redmi-k70-pro-for-discounted-price-of-499-at-giztop/">Get Redmi K70 Pro for discounted price of $499</a></li><li><a href="https://www.gizmochina.com/2023/12/18/get-100-off-on-vivo-x100-pro-at-giztop/">Get $100 Off on Vivo X100 Pro at Giztop</a></li><li><a href="https://www.gizmochina.com/how-to/add-custom-gifs-and-stickers-to-whatsapp/">How to add custom GIFs and stickers to WhatsApp</a></li></ul>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="Redmi K70 SD &amp; Pro Full Review: Best 2K straight screen phones to buy" width="696" height="392" src="https://www.youtube.com/embed/UKlraVz6eH4?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
</div></figure>



<p>(<a href="https://www.ithome.com/0/755/525.htm">Via</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/14/google-deepmind-sima-ai-game-buddy/">Google DeepMind&#8217;s SIMA is Training to Become Your New In-Game Teammate, Here&#8217;s How</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Claude 3 is the Newest AI Chatbot Competitor, Claims to Surpass ChatGPT &#038; Google&#8217;s Gemini</title>
		<link>https://www.gizmochina.com/2024/03/05/claude-3-ai-chatbot-chatgpt/</link>
		
		<dc:creator><![CDATA[Anubhav]]></dc:creator>
		<pubDate>Tue, 05 Mar 2024 03:13:22 +0000</pubDate>
				<category><![CDATA[News]]></category>
		<category><![CDATA[AI Chatbots]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[ChatGPT]]></category>
		<category><![CDATA[Large Language Models]]></category>
		<guid isPermaLink="false">https://www.gizmochina.com/?p=609153</guid>

					<description><![CDATA[<img width="300" height="169" src="https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-300x169.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Claude 3" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-300x169.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-1024x576.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-768x432.webp 768w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-696x392.webp 696w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-1068x601.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-747x420.webp 747w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg.webp 1200w" sizes="(max-width: 300px) 100vw, 300px" /><p>A new challenger has emerged to shake up the landscape of AI and chatbots. Anthropic, an AI startup, has unveiled its &#8220;Claude 3&#8221; family, a trio of large language models (LLMs) claiming to surpass Google&#8217;s Gemini and OpenAI&#8216;s ChatGPT in various benchmarks. Claude 3 has three different variations: Haiku, Sonnet and Opus Claude 3 comes [&#8230;]</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/05/claude-3-ai-chatbot-chatgpt/">Claude 3 is the Newest AI Chatbot Competitor, Claims to Surpass ChatGPT &amp; Google&#8217;s Gemini</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></description>
										<content:encoded><![CDATA[<img width="300" height="169" src="https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-300x169.webp?x10805" class="webfeedsFeaturedVisual wp-post-image" alt="Claude 3" loading="lazy" style="display: block; margin: auto; margin-bottom: 5px;max-width: 100%;" link_thumbnail="" srcset="https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-300x169.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-1024x576.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-768x432.webp 768w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-696x392.webp 696w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-1068x601.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-747x420.webp 747w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg.webp 1200w" sizes="(max-width: 300px) 100vw, 300px" />
<p>A new challenger has emerged to shake up the landscape of AI and chatbots. Anthropic, an AI startup, has unveiled its &#8220;Claude 3&#8221; family, a trio of large language models (<a href="http://gizmochina.com/tag/large-language-models">LLMs</a>) claiming to surpass <a href="http://gizmochina.com/tag/google-gemini">Google&#8217;s Gemini</a> and <a href="http://gizmochina.com/tag/openai">OpenAI</a>&#8216;s <a href="http://gizmochina.com/tag/chatgpt">ChatGPT</a> in various benchmarks.</p>



<h3>Claude 3 has three different variations: Haiku, Sonnet and Opus</h3>



<p>Claude 3 comes in three distinct flavors: Haiku, Sonnet, and Opus, each offering varying levels of capability. Anthropic boasts that the entire family delivers exceptional performance across multiple dimensions – multimodality (handling different data types), improved accuracy, enhanced context understanding, and faster response times. Additionally, the new models exhibit a greater willingness to tackle challenging questions, addressing a limitation found in earlier Claude versions that sometimes shied away from prompts deemed risky.</p>



<div class="wp-block-image"><figure class="aligncenter size-large"><img loading="lazy" width="1024" height="576" src="https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-1024x576.webp?x10805" alt="Claude 3" class="wp-image-609160" srcset="https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-1024x576.webp 1024w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-300x169.webp 300w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-768x432.webp 768w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-696x392.webp 696w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-1068x601.webp 1068w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg-747x420.webp 747w, https://www.gizmochina.com/wp-content/uploads/2024/03/9PkgmsrBcbvHErmh9NADLW-1200-80.jpg.webp 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure></div>



<p>While all three models offer a significant performance boost, Opus takes center stage as the most potent member of the family. Anthropic claims it demonstrates &#8220;near-human levels of comprehension&#8221; for complex tasks, further showcasing its capabilities through a &#8220;Needle in a Haystack&#8221; evaluation, where it excelled at recalling information with near-perfect accuracy. Opus is also touted as a problem-solving whiz, adept at handling math challenges, generating computer code, and exhibiting superior reasoning abilities compared to <a href="http://gizmochina.com/tag/gpt-4">GPT-4</a>.</p>



<p>However, no technology is perfect, and Claude 3 is no exception. While Anthropic emphasizes improved accuracy, the issue of &#8220;hallucinations&#8221; – factually incorrect information generated by the models – persists, albeit at a significantly reduced rate compared to previous iterations. Additionally, Opus encounters some lag in responding to queries, exhibiting speeds comparable to the earlier Claude 2 model.</p>



<p>Despite these limitations, Haiku and Sonnet each have their own strengths. Haiku shines in delivering quick responses and extracting information from unstructured data, although it might stumble when faced with complex math problems. Sonnet, a larger-scale model, aims to assist users with mundane tasks, even parsing text from images. Opus, on the other hand, is ideally suited for handling large-scale operations.</p>



<p>Currently, Sonnet and Opus are available for purchase, while a free version of Claude remains accessible on Anthropic&#8217;s website. Haiku&#8217;s launch date is still under wraps, but the company assures a soon-to-come release. The primary target audience for Claude 3 appears to be businesses seeking to automate specific workflows. Users will likely encounter these models integrated into online chatbots. </p>



<p><strong><span style="text-decoration: underline">RELATED:</span></strong></p>



<ul><li><a href="https://www.gizmochina.com/2024/02/14/chatgpt-openai-memory-remember/">ChatGPT will Now Remember Things You Talked About</a></li><li><a href="https://www.gizmochina.com/2024/01/30/chatgpt-assistant-nothing-phone/">Here’s how to run ChatGPT voice assistant on your Nothing Phone</a></li><li><a href="https://www.gizmochina.com/2023/11/27/get-xiaomi-13-ultra-premium-5g-phone-for-as-low-as-799-at-giztop/">Xiaomi 13 Ultra Premium Camera Phone is now only $799</a></li><li><a href="https://www.gizmochina.com/2024/01/05/get-50-discount-on-xiaomi-band-8-pro-genshin-impact-edition-at-giztop-coupon/">Xiaomi Band 8 Genshin Impact custom edion get a huge discount.</a></li><li><a href="https://www.gizmochina.com/2023/11/07/get-100-off-on-lenovo-legion-y700-2023-gamin-tablet-at-giztop/">Lenovo Legion Y700 2023: Save $100 on this 8-inch gaming Android tablet</a></li></ul>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="HONOR Magic V2 RSR Unboxing &amp; Review: This is a Porsche you can afford." width="696" height="392" src="https://www.youtube.com/embed/vWpjj7WAO9Q?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
</div></figure>



<p>(<a href="https://www.techradar.com/computing/artificial-intelligence/chatgpt-gets-a-big-new-rival-as-anthropic-claims-its-claude-3-ais-beat-it">Via</a>)</p>
<p>The post <a rel="nofollow" href="https://www.gizmochina.com/2024/03/05/claude-3-ai-chatbot-chatgpt/">Claude 3 is the Newest AI Chatbot Competitor, Claims to Surpass ChatGPT &amp; Google&#8217;s Gemini</a> appeared first on <a rel="nofollow" href="https://www.gizmochina.com">Gizmochina</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Object Caching 37/100 objects using Redis
Page Caching using Disk: Enhanced 
Content Delivery Network Full Site Delivery via cloudflare
Database Caching 16/33 queries in 0.013 seconds using Redis
Fragment Caching 2/3 fragments using Redis

Served from: www.gizmochina.com @ 2026-04-20 07:52:40 by W3 Total Cache
-->