Monday, 16 June 2025
27.4 C
Singapore
30.6 C
Thailand
23.9 C
Indonesia
28.7 C
Philippines

Microsoft introduces the Phi-3 Mini, its most compact AI model to date

Microsoft launches Phi-3 Mini, a compact AI model designed for efficiency and tailored applications, now available on Azure and other platforms.

Microsoft has officially unveiled the Phi-3 Mini, the latest and smallest addition to its lineup of AI models. As part of a trio of new releases, the Phi-3 Mini kicks off the series with a compact framework featuring 3.8 billion parameters. This model is specifically designed for a smaller data set, in contrast to the expansive datasets used by larger models such as GPT-4. Available now on platforms like Azure, Hugging Face, and Ollama, the Phi-3 Mini is just the beginning, with the upcoming releases of Phi-3 Small and Phi-3 Medium boasting 7 billion and 14 billion parameters, respectively.

Parameters in this context are indicative of the model’s ability to process and understand complex instructions. Following the release of the Phi-2 in December, which matched the performance of larger models, Microsoft asserts that the new Phi-3 Mini surpasses its predecessor in efficiency and capability. This advancement means that the Phi-3 Mini can deliver responses with the level of sophistication expected from models ten times its size.

Tailored learning through innovative methods

Eric Boyd, the Corporate Vice President of Microsoft Azure AI Platform, explained to The Verge how the Phi-3 Mini manages to achieve such high performance. “It’s comparable to larger LLMs like GPT-3.5, just in a smaller form factor,” said Boyd. The development team employed a novel training approach they call a “curriculum,” inspired by the learning progression seen in children. This method involved using simplified text structures and vocabulary, akin to children’s literature, to effectively train the Phi-3 Mini on complex topics.

To supplement the limited availability of children’s books, the team created over 3,000 simplified “children’s books” using a larger language model. This innovative approach not only facilitated the training of the Phi-3 but also enhanced its capabilities in coding and reasoning, building upon the foundations laid by its predecessors.

The Phi-3 series, while knowledgeable in general topics, does not rival the comprehensive data processing capacity of a full-scale model like the GPT-4. However, Boyd highlights that for many companies, the smaller, more focused models like the Phi-3 Mini are more suitable for their specific applications. These models require less computational power, making them significantly more cost-effective, particularly for businesses working with smaller internal datasets.

In conclusion, Microsoft’s Phi-3 Mini represents a significant step forward in the development of AI models tailored for specific tasks and industries. By combining advanced capabilities with cost efficiency, Microsoft is paving the way for more accessible and versatile AI solutions.

Hot this week

Singapore Airlines and PALO IT test generative AI for faster software development

Singapore Airlines and PALO IT successfully trial Gen-e2, an AI-first software development approach powered by GitHub Copilot.

Xiaomi SU7 Ultra joins Gran Turismo 7 in new global partnership

Xiaomi’s SU7 Ultra electric vehicle joins Gran Turismo 7 in a new partnership, with future plans including a concept car co-developed with the game.

Qualcomm to buy UK chipmaker Alphawave Semi for US$2.4 billion

Qualcomm will buy UK-based Alphawave Semi for US$2.4B to boost its data centre tech and expand beyond smartphone chips.

Resident Evil Requiem returns to Raccoon City with new story and hero, coming February 2026

Resident Evil Requiem, which launches on February 27, 2026, takes you back to Raccoon City with a new lead and chilling story.

New Relic adds Model Context Protocol support to improve AI observability

New Relic adds MCP support to its AI Monitoring tool, enabling deeper visibility across AI agents, protocols, and backend systems.

Informatica deepens partnership with Databricks to support new Iceberg and OLTP services

Informatica joins Databricks as launch partner for new Iceberg and OLTP solutions, introducing AI tools to speed up GenAI development.

Hong Kong opens skies to larger drones in bid to grow low-altitude economy

Hong Kong will allow the testing of larger drones to boost its low-altitude economy and improve logistics, following mainland China's lead.

Hong Kong to build new AI supercomputing centre in bid to lead global tech race

Hong Kong plans a new AI supercomputing centre to boost its tech hub status and support growing start-ups across the Greater Bay Area.

Steam adds full native support for Apple Silicon Macs

Steam runs natively on Apple Silicon Macs, ditching Rosetta 2 for smoother performance and better gaming on M1 and M2 devices.

Related Articles

Popular Categories