Microsoft works with Qualcomm for ‘on-device’ AI models

2023-05-26
关注

  •  

Microsoft and chipmaker Qualcomm are creating new ‘on-device’ AI models on Snapdragon processors running Windows 11, potentially paving the way for local and offline access to generative AI capabilities such as image and text generation. It comes as Microsoft confirmed it was bringing its GPT-4-powered Copilot AI into Windows 11.

Qualcomm says its 'on-device' version of Stable Diffusion has 1 billion parameters but will increase beyond 10bn in the future (Photo: Qualcomm)
Qualcomm says its ‘on-device’ version of Stable Diffusion has 1 billion parameters but will increase beyond 10bn in the future. (Photo courtesy of Qualcomm)

Announced during Microsoft’s Build developer conference, Qualcomm says its offline AI models are designed to reduce the load on cloud providers and reduce the cost of generative AI by bringing some of the processing to the edge. The company says it would allow for “more affordable, reliable, and private” generation.

Also announced during Build was tighter integration of Microsoft’s Copilot AI system with its full range of products including Windows 11 and Microsoft 365. It will sit in the Windows sidebar and work in a similar way to Bing AI chat. Users will be able to type questions or ask it to complete tasks within the operating system such as changing desktop backgrounds. Bing search, and the ability to browse the web, is also coming to OpenAI’s ChatGPT, which currently runs off a fixed dataset rather than being able to search up-to-date sources.

While all of those models currently rely on access to the internet and expensive cloud services, provided through Microsoft’s Azure platform, future versions may be more local.

Qualcomm has been developing an “AI Stack” that integrates with its Snapdragon processors to allow for more scalable foundation AI models. The company says it will allow OEMs and developers to distribute AI applications without relying on cloud providers. 

Qualcomm Snapdragon’s AI advantage

“For generative AI to become truly mainstream, much of the inferencing will need to be executed on edge devices,” said Ziad Asghar, senior vice president of product management at Qualcomm.

It is expected there will always be a hybrid approach of cloud and on-device processing for AI systems, but being able to add the on-device element is more cost-effective. Qualcomm says its AI Engine processes workloads more efficiently than running on a GPU or CPU, which in turn allows them to be run on small, thin and lightweight devices including phones, laptops and tablets.

Qualcomm displayed a range of tools for developing generative AI inside Windows 11 on device, including the Stable Diffusion text-to-image AI model. A version with more than a billion parameters is successfully operating ‘on-device’ and it has a path for 10 billion-plus versions of the model. It is also working on bringing large language models to the edge.

Content from our partners

<strong>How to get the best of both worlds in the hybrid cloud</strong>

How to get the best of both worlds in the hybrid cloud

The key to good corporate cybersecurity is defence in depth

The key to good corporate cybersecurity is defence in depth

Cybersecurity in 2023 is a two-speed system

Cybersecurity in 2023 is a two-speed system

“Both cloud and device processing are needed to extend AI across the vast universe of devices and applications,” said Pavan Davuluri, corporate VP, Windows silicon and system integration at Microsoft. “By bringing together Microsoft’s cloud AI leadership and the capabilities of the Windows platform with Qualcomm Technologies’ on-device AI expertise, we will accelerate the opportunities for generative AI experiences.” 

View all newsletters Sign up to our newsletters Data, insights and analysis delivered to you By The Tech Monitor team

Locally running AI models aren’t a new thing. There is a large open-source developer community working with versions of Meta’s leaked LlaMA large language model and having it operate on old laptops. Other groups are building text-to-speech and text-to-image models but these tend to have significant file sizes or require a good graphics card. Qualcomm is aiming to create solutions that work more seamlessly and efficiently.

It’s “no secret” that Qualcomm and Microsoft have been working closely on silicon for some time says Geoff Blaber CEO at analyst company CCS Insight. “A big driver behind that is not just what Qualcomm can deliver in terms of Arm-based CPU power and efficiency but its roadmap for AI acceleration,” Blaber says. “With the announcement of Windows Copilot, it’s clear that Microsoft is embracing AI at the heart of all its tools and platforms and for that it has very specific requirements of the underlying silicon.”

Blaber adds: “Microsoft’s vision for generative AI at the heart of its tools and platforms can’t scale if it depends solely on the cloud. It would be orders of magnitude too expensive, highly inefficient and the user experience would fall short. We’re going to see a blend of AI running on-device, in the cloud and a hybrid combination of the two. On-device generative AI isn’t feasible on mass-market Windows hardware today, but it’s a clear priority and direction of travel for the Windows ecosystem. This is the real basis for Microsoft’s close cooperation with Qualcomm.”

Microsoft beefs up AI offering with Build announcements

Build is Microsoft’s developer conference, and AI was at the core of many of its new products. A new Azure AI Studio will make it easier to integrate external data sources into the OpenAI APIs available through the Azure cloud. OpenAI’s APIs are also being made more widely available to users of the platform. Developers will be able to create plugins to integrate their tools, data and services into any Microsoft product running Copilot, which includes Windows and Office applications.

The plugins are a result of the plugin standard developed by OpenAI for ChatGPT. This means any plugins built for ChatGPT will also work across all Microsoft products including Teams. “Developers can now use one platform to build plugins that work across both consumer and business surfaces, including ChatGPT, Bing, Dynamics 365 Copilot (in preview) and Microsoft 365 Copilot,” a spokesperson for Microsoft said.

The same plugins currently available for paying users of ChatGPT will be added to Bing including OpenTable, Wolfram Alpha and Klarma. In return, Microsoft is adding its own Bing search functionality to ChatGPT. “Now, answers are grounded by search and web data and include citations so users can learn more, all directly from within chat,” said Microsoft.

An AI-powered analytics platform called Fabric for enterprise-grade data was also launched at the conference. This brings together existing tools including Power BI, Data Factory and Synapse into a single product.

Read more: Dell taps Azure, RedHat and VMware for new Apex Cloud Platform

Topics in this article : AI , Microsoft , Qualcomm

  •  

您觉得本篇内容如何
评分

相关产品

Jewell Instruments 杰威尔 Models 59560 & 59562 倾角传感器

双轴900型是一种廉价的重力基准测斜仪,具有模拟电压输出和紧凑的尺寸。它的体积小,性能好,是许多OEM、测试和测量应用的理想选择。它有高增益、标准和广角版本,每个版本都有不同的角度范围。900型接受广泛的输入电压范围,并提供高水平的单端输出,易于用任何电压表或数字记录系统测量。粘性阻尼和温度测量可供选择。

Bronkhorst 布琅轲锶特 Models P-602CM 压力仪表

Bronkhorst高科技的金属密封压力表和控制器以其独特的、专利的金属-金属密封结构和优秀的再密封能力为特点。此外,它们的特点是表面质量高,因此特别适合于满足半导体工业的要求以及其他高纯气体的应用。压力表和控制器的底座有1\/4\"端面密封阳(VCR)或downport过程连接"、"最低范围7…"350 mbar(0,1…绝对的或相对的。\最高射程1,28…64 bar(18…现在的仪器都配备了一个数字式个人电脑板,具有高精度、极佳的温度稳定性和快速的响应能力。

Visual Sound LGS-300 扬声器

.","• 8 Ohms and 70 Volts models. • 6.5\" and 4\" speaker models, • 15, 40, 60 & 100 Watt models. • 360

Cole-Parmer GO-39800-40 红外线温度计

Use the plastics models in thin film (under 0.4 mm) plastics applications such as lamination and filmThe metals models are ideal for forging, forming, and extruding operations.Note: The metals models are not recommended for use with aluminum.The plastics models 39800-42 and -43 and metals models 39800-47 and -48 feature single lasers that indicate|Choose from two laser types-class IIIa laser models for maximum brightness, or class II models where

Pearson Electronics 皮尔逊 4160 电流传感器

Accuracy ±1% or better, initial pulse response for all models, with a high impedance load such as 1 megOhmAll models listed below come with a BNC connector except as noted and are to be used with a 50 Ohm coaxial

Rosemount / Emerson 罗斯蒙特 Model 3900 氧化还原电位(ORP)电极

Models 3900 and 3900VP are provided with a double junction reference, which protects the reference element

OMEGA Engineering, Inc. 欧米茄 PSW21 & PSW22 Series 真空开关

• Adjustable Setpoint • 1\/8 NPT or Center Spout for 1\/8 ID Tubing • Pressure and Vacuum Models Available

Myron L 麦隆 752II 水质检测仪器

All models are corrected to 25°C. The TC may be disabled to conform with USP requirements.","Standard on all controller models is a heavy-duty, 10-amp output relay, operating on either increasing,"Digital and Analog 750 Series II models have an IP65\/NEMA 4X water-resistant & corrosion-proof ratedAt 152 x 122 mm\/6 x 4.8 in., all models are suitable for panel, bench or surface mounting."

Parker Hannifin / Instrumentation Group 派克汉尼汾 F150-AHR-0 转子流量计

Models F65 and F150 Forged Body Flowmeters are variable area flowmeters featuring a compact, one-pieceBoth models have a wraparound window for full 180° visibility of the flow tube and are available withaluminum, brass or 316 stainless steel wetted parts. - Models F65 and F150 Forged Body Flowmeters areBoth models have a wraparound window for full 180° visibility of the flow tube and are available with

Sitron CF420RM 液体流量计

Both of these models offer reliable liquid flow monitoring, with the flexibility of a separate panelAll models can be ordered with a great variety of threaded, flange, or sanitary process connections."

评论

您需要登录才可以回复|注册

提交评论

提取码
复制提取码
点击跳转至百度网盘