Get all your news in one place.
100’s of premium titles.
One app.
Start reading
Tom’s Guide
Tom’s Guide
Technology
Ryan Morrison

MetaAI Voice could win the race to create the ultimate voice assistant — here's why

MetaAI.

Meta is the latest in a growing number of technology companies to embrace the idea of voice as the way we interact with technology. It joins OpenAI with ChatGPT Voice, Google with Gemini Voice and of course the old guard, Alexa from Amazon and Siri from Apple.

Unlike Siri and Alexa, MetaAI Voice sits firmly in the conversational category and there is a good reason for that — the company needed a better way for people to interact with its Ray-Ban smart glasses, Quest VR headsets, and general devices without access to a keyboard or touch screen.

Conversational AI voice allows you to talk to the AI in natural language as if you were talking to a human. It allows it to handle complex and vague queries. For example, in the Meta Connect Demo Mark Zuckerberg suggested holding an avocado up to the Meta Ray-Ban smart glasses and saying “What can I make with this?” without specifying the nature of "this".

Meta has done something Google and OpenAI haven't, though. It offers up the voices of the famous instead of an unnamed actor or generated voice. Initially, you'll be able to converse with an AI that sounds like Dame Judi Dench, John Cena, Kristen Bell, and more.

Unfortunately, the quality of synthetic voice isn’t up there with Gemini or ChatGPT Voice but you can interrupt it mid-flow and ask it the same level of natural queries. It is accessible on WhatsApp, Facebook Messenger and Instagram.

Why is MetaAI Voice a big deal

(Image credit: Future)

While MetaAI Voice might be less realistic and natural than ChatGPT Advanced Voice, the one thing it has in its favor is the Meta ecosystem. More than three billion people around the world use at least one of Meta's core products every day. MetaAI has over 400 million active monthly users and it is only really available in the States.

The text-based version is there within all the core products and looks the same whether you open it in WhatsApp, Instagram, Facebook or Messenger. Right now you can use it to generate images, have a text-based conversation and even play games. With voice, you’ll be able to leave it on the desk and chat away as you go about other tasks.

MetaAI also now uses Llama 3.2 90b as its "brain". This is a new multimodal model from Meta that can analyze images as well as text. It is likely future versions will also be able to work with more sounds, documents and even video — if it matches the progress of OpenAI’s models.

This means that, at the touch of a button in any of the apps you use everyday, you’ll be able to start talking to an AI. You’ll be able to give it a photo you’ve just taken, ask it for details of the image or to change an aspect of the image such as removing an unsightly trash can.

The real power of MetaAI Voice will be felt by those wearing the Ray-Ban Smart Glasses or a Quest headset. These devices will be able to see the world as you do and allow you to talk to the AI about anything you see in real-time.

More from Tom's Guide

Sign up to read this article
Read news from 100’s of titles, curated specifically for you.
Already a member? Sign in here
Related Stories
Top stories on inkl right now
Our Picks
Fourteen days free
Download the app
One app. One membership.
100+ trusted global sources.