Google is bringing its next-generation artificial intelligence model Gemini to its Bard chatbot from today. The company says it will significantly improve both the performance and reasoning ability of Bard — giving it the ability to understand images, text, code, audio and video natively.
There was some speculation it would be delayed due to issues with understanding certain languages, and the new version will only be available in English. However, during a press call, Google told reporters it worked well across multiple languages.
Google launched its Bard chatbot eight months ago as an experiment and in response to the success of ChatGPT. This is the third model to underpin the tool since it launched, starting with LaMDA, upgrading to PaLM 2 in the summer when it launched, and now Gemini.
Sissie Hsiao, VP for Google Bard, said that in blind evaluations with third-party testers, Bard came out as the most preferred free chatbot.
What is Gemini?
Gemini is split into three versions: Nano which will run on mobile devices, Pro which is being used to power Bard, and Ultra which will launch next year and power a new Bard Advanced.
Google says Gemini Pro performs as well as or better than GPT-3.5, the model from OpenAI that powers the free version of ChatGPT. It wouldn’t be drawn on how Gemini Ultra compares to GPT-4, only stating it outperforms “all models out there” on key benchmarks.
“Gemini can understand the world around us in the way that we do and can absorb any type of input and output. Not just text like most models but also code, audio, image, and video,” said Demis Hassabis, CEO of Google DeepMind.
How will it change Bard?
Bard has been gradually improved since its launch, with the addition of some multimodal features like analyzing the contents of an image. It's also gained extensions allowing it to check flights, review the contents of a YouTube video or check your emails.
With Gemini it becomes something completely new. While it might not look any different on the surface, under the hood will be a more powerful engine that, according to Google, will have better capabilities across the board — at least on par with the free version of ChatGPT.
It has been built on top of a fine-tuned version of Gemini Pro in English, with other languages coming in the new year. This improves its reasoning, planning, and understanding capabilities over the previous version built on PaLM 2.
The initial release will only work with text-based prompts but as Gemini was built from the ground up to be multimodal other media will come next year.
What's Next? Bard Advanced
At some point next year Google is rolling out a new version of its chatbot called Bard Advanced. It isn’t yet clear whether this will be a paid service along the same lines as ChatGPT Plus but it will be powered by Gemini Ultra, the most powerful version of the AI model.
Gemini Ultra was built for “highly complex tasks and to quickly understand and act on different types of information,” running on the most powerful chips in Google’s network of data centers
Google says Bard Advanced will provide access to the full capabilities of Gemini Ultra but first there needs to be more safety checks and testing. “This aligns with the bold and responsible approach we’ve taken since Bard launched,” said Hsiao.
“With Gemini, we’re one step closer to our vision of making Bard the best AI collaborator in the world,” she added. “We’re excited to keep bringing the latest advancements into Bard, and to see how you use it to create, learn and explore.”
Gemini is also coming to Google Workspace through Duet AI early next year. This will allow for more detailed generated text and images with Docs, Sheets and Slides. The company is also experimenting with it in Search to speed up results and allow more complex queries.