Get all your news in one place.
100’s of premium titles.
One app.
Start reading
Tom’s Guide
Tom’s Guide
Technology
Ryan Morrison

Is OpenAI about to take on Alexa and Siri? ChatGPT maker files trademark for Voice Engine

OpenAI logo on a phone screen.

OpenAI may have Apple, Amazon, and Google in its sights for its next big artificial intelligence push, taking on the voice assistant market with a new Voice Engine tool.

While ChatGPT does have a voice-friendly interface on mobile — and recently introduced a way to have it speak its responses on desktop — a new trademark application from OpenAI for the words Voice Engine relates specifically to building digital voice assistants. 

It is now possible to swap out the default voice assistant on Android. Apple seems to be in talks with a range of AI companies over the future of artificial intelligence on the iPhone, so this could be a preemptive move from OpenAI building on a potential new market.

Apple is also rumored to be opening a dedicated AI App Store with the next major upgrade to iOS, which would create a new market for AI-powered assistants.

Sam Altman, OpenAI CEO said there are "many different things" being released this year. While it is expected this will include Sora, the AI video tool it could include a new AI voice system.

What do we know about Voice Engine

(Image credit: Getty)

We don’t know much about Voice Engine or whether it will even be a product. OpenAI hasn’t commented publicly on it, so all we have is rumor and the trademark filing.

While Voice Engine could be a new model built specifically for speech applications, it is also likely this is part of an enterprise play for OpenAI. It could be building a high-quality speech system that would let companies build out more efficient call center bots.

It sounds a lot like all the pieces you'd need for a fully functional, fully interactive AI voice assistant that can not only handle complex tasks but chat naturally and even take phone calls on your behalf.

The new trademark application was filed with the U.S. Patent and Trademark Office last week. While an application doesn't necessarily mean it will result in a product, this does line up with the wider market shifting more to voice and OpenAI's direction to targeted models.

The filing covers the creation of software used for building digital voice assistants, audio generation from text prompts, voice command processing, and voice service delivery.

The full application covers the development of voice service delivery, using AI for text or-voice and text-to-audio, natural language, and speech processing, generating audio and voice from a prompt (text, speech, visual, image), processing voice commands, speech recognition, and building digital voice assistants.

That sounds a lot like all the pieces you'd need for a fully functional, fully interactive AI voice assistant that can handle complex tasks, chat naturally, and even take phone calls on your behalf.

Where does GPT-5 fit in this?

(Image credit: OpenAI)

OpenAI released GPT-4 a year ago. At the time this was a groundbreaking generative AI model that powers ChatGPT and Microsoft Copilot.

The company also started training GPT-5 late last year, resulting in speculation over its release date. Altman told podcaster Lex Fridman,  "We will release an amazing new model this year," but wouldn't confirm whether this was GPT-5 or some precursor.

He also said there would be "many different things" released over the coming months. According to OpenAI CTO Mira Murati, this will include the AI video platform Sora.

There is some speculation on social media that Sora and this new Voice Engine are different modal interfaces for GPT-5. 

It is very likely that GPT-5 will be a true multimodal model, able to understand video, images, speech, text, and code — as well as generate all those content types.

Voice Engine could be a new Assistant

Given the trademark's description, it is also possible that Voice Engine could be a new voice assistant, merging Siri, Alexa, or Google Assistant's wider capabilities with ChatGPT's reasoning and natural language capabilities.

Google has already started upgrading Gemini to work in that way, Apple is rumored to be building a new version of Siri with large language model functionality, and Amazon is already testing Alexa Plus with similar underlying skills.

OpenAI may offer Voice Engine to power such systems in the future or as an alternative interface to ChatGPT that can run on smart speakers, phones, or even headphones.

Or it could just be OpenAI playing it cautious with trademarks. It had a bid to protect GPT rejected, so it now has filed trademark applications for GPT-5, 6, and even GPT-7. The latter includes music generation, converting text and data to code, and writing code from scratch.

More from Tom's Guide

Sign up to read this article
Read news from 100’s of titles, curated specifically for you.
Already a member? Sign in here
Related Stories
Top stories on inkl right now
Our Picks
Fourteen days free
Download the app
One app. One membership.
100+ trusted global sources.