Microsoft CEO Satya Nadella
Satya Nadella’s Microsoft threw down a challenge to Apple with a claim that its new PCs will leapfrog Apple’s MacBooks © AP

A year and a half after ChatGPT brought the subject of artificial intelligence to mass public attention, most people would be forgiven for wondering: when is AI going to make a big difference to my life?

That question resonates particularly loudly during Big Tech’s annual developer conference season, which began in the middle of May. This is the moment in the year when the tech companies lay out their stalls and try to wow customers with their vision of the immediate tech future.

The arrival of ChatGPT may have grabbed the popular imagination, but for most people typing questions into a text-based chatbot is of limited interest. Since then, most of the focus in tech circles has been on the race to build the capabilities needed to deliver generative AI on a mass-market scale, rather than its uses. The headlines have been dominated by news of evermore powerful large language models, the splurge of spending on powerful new chips and the proliferation of huge, power-hungry data centres needed to process AI.

Now, these powerful technical capabilities are moving closer to the actual users of technology. The biggest news from Microsoft this month was a new generation of AI-enabled PCs, to be launched this year under the brand Copilot+, which will be powerful enough to handle AI without needing to call on a remote data centre.

In the process, Microsoft threw down a challenge to Apple with a claim that the new PCs will leapfrog Apple’s MacBooks. An AI arm’s race is now in full swing in the personal computing and smartphone worlds.  

None of this, though, has done much to answer the overriding questions for most consumers: how — and when — will all this expensive new technology make things better for me? So far, generative AI has brought a proliferation of text boxes online offering to answer questions (including in services such as Meta’s WhatsApp and Instagram); offers to help write emails or documents; and various services that summarise blocks of text, including the web digests that Google has started to provide at the top of its search results. It is unclear yet how much people are actually using these features.

As this month’s events have underlined, the tech companies harbour a much bigger ambition than this. Their goal: personal digital assistants capable of anticipating a user’s needs and intermediating much of their online activity, as well as digital agents that can go a step further and take actions on behalf of a user. These ideas were a centrepiece of Google’s event two weeks ago and Microsoft last week, as well as the announcement of a new model from OpenAI, called GPT-4o.

Yet if this is AI’s biggest promise, it is just that — a promise.
Two fundamental problems remain unsolved. One involves making AI models that are trained on historic data respond understand whatever new situation they are put in and respond appropriately. In the words of Demis Hassabis, head of Google’s AI research division, AI needs to be able to “understand and respond to our complex and dynamic world, just as we do”.

That is a tall order. The challenge isn’t just to avoid the “hallucinations”, or occasional glaring mistakes, that AI systems are prone to. It also means having a full understanding of context, in order to consistently deliver truly helpful results. Google claims to have made big strides in this department, building an extended “context window” into its latest Gemini models to enable the system to maintain an awareness of complex situations. But if the technology needs to match humans in its understanding of the world, there is a lot still to prove.

Another, related problem is to make communicating with AI as natural as talking to a person. Only at that point, according to the people building the systems, will the technology come into its own.

Microsoft chief executive Satya Nadella said this would involve learning “how to build computers that understand us, instead of us having to understand computers”. Despite his claim that this goal is tantalisingly close to being realised, others, including Hassabis, warn that trying to produce “natural” interactions with a computer remains “a very high bar”.

OpenAI gave one glimpse of what might lie ahead with a demonstration of GPT-4o, an AI model designed to work in an informal, conversational style. Yet the gap between a staged demonstration and an effective, real-world product is still large. It remains hard to predict when AI will make its big breakthrough into the consumer world.

richard.waters@ft.com

Copyright The Financial Times Limited 2024. All rights reserved.
Reuse this content (opens in new window) CommentsJump to comments section

Comments