Microsoft’s Phi-3-vision Small Language Model Brings Image Analysis to Mobile Devices

May 22, 2024

Microsoft is expanding its Phi-3 family of small language models with the introduction of Phi-3-vision. Unlike its siblings, Phi-3-vision isn’t just focused on text – it’s a multimodal model that can analyze and understand images as well.

The model is great for recognizing objects in images

This 4.2 billion parameter model is designed for mobile devices and excels at general visual reasoning tasks. Users can ask Phi-3-vision questions about images or charts, and it will provide insightful answers. While not an image generation tool like DALL-E or Stable Diffusion, Phi-3-vision excels at image analysis and comprehension.

The arrival of Phi-3-vision comes on the heels of Phi-3-mini, the smallest member of the Phi-3 family at 3.8 billion parameters. The complete family now includes Phi-3-mini, Phi-3-vision, Phi-3-small (7 billion parameters), and Phi-3-medium (14 billion parameters).

This focus on smaller models reflects a growing trend in AI development. Smaller models require less processing power and memory, making them ideal for mobile devices and other resource-constrained environments. Microsoft has already seen success with this approach, with its Orca-Math model reportedly surpassing larger competitors in solving math problems. Phi-3-vision is currently available in preview, while the rest of the Phi-3 family (mini, small, and medium) can be accessed through Azure’s model library.

RELATED:

(Via)

Microsoft’s Phi-3-vision Small Language Model Brings Image Analysis to Mobile Devices

The model is great for recognizing objects in images

Comments

Casio launches three Oceanus limited edition watches inspired by Japanese Awa Indigo

Galaxy S27 Ultra tipped to be the first to use UFS 5.0 storage

Huawei Watch Fit 5 Pro spotted in real-life images with slimmer bezels, larger display

The model is great for recognizing objects in images

Comments

RELATED ARTICLESMORE FROM AUTHOR

Windows 11 March update reportedly causing crashes and freezes for some users

Why AI PCs dominated CES 2026 announcements

Microsoft’s December update frustrates some Windows 11 users with slower performance

Casio launches three Oceanus limited edition watches inspired by Japanese Awa Indigo

Galaxy S27 Ultra tipped to be the first to use UFS 5.0 storage

Huawei Watch Fit 5 Pro spotted in real-life images with slimmer bezels, larger display

RELATED ARTICLES MORE FROM AUTHOR