What is Google Gemini 1.0? Next-gen AI to battle ChatGPT explained

By Chris Smith December 6, 2023 6:10 pm GMT

What is Google Gemini 1.0: Google has fully revealed its next-generation Gemini 1.0 generative AI model, which is now available through the Bard conversational chatbot.

The Gemini 1.0 model, which was initially previewed at Google I/O in May, is more powerful than the existing technology and potentially more equipped to go up against OpenAI’s ChatGPT model used by Microsoft. Google’s tests say it’s more powerful.

Google says Gemini is its “largest and most capable AI model” and it’ll be responsible for powering everything from Bard to the Google Pixel range of smartphones. It’s available within Bard from today and is coming to the Pixel 8 Pro in the December Feature Drop.

The company says the key to Gemini and the “multimodal” AI model. That means it can “generalise and seamlessly understand, operate across and combine different types of information including text, code, audio, image and video.” Previous iterations achieved more limited capabilities by stitching models together. Gemini is natively multimodal.

You can see the new capabilities in an incredibly impressive-looking demonstration below. It’s fast, reactive, conversational, and terrifyingly smart.

Three different sizes for Gemini

Google says Gemini 1.0 is the first version of the model, as the numbering convention would suggest. There are tiers built for different purposes. All of them benefit from the multimodal design and their purposes are detailed below.

Nano, for example, will be great for on-device AI and will soon be available on the Pixel 8 Pro, while Ultra is geared for extreme use cases like data centres. Pro is the happy middle ground, which Bard is going to be powered by from today.

Gemini Ultra — our largest and most capable model for highly complex tasks.
Gemini Pro — our best model for scaling across a wide range of tasks.
Gemini Nano — our most efficient model for on-device tasks.

Super-human performance

Google says Gemini’s performance has been rigorously tested on tasks like natural image, audio and video understanding and mathematical reasoning. Gemini Ultra beats incumbent models on 30 out of 32 academic benchmarks for large language models.

Now, Google says, Gemini is outperforming human experts on 57 subjects when it comes to MMLU (massive multitask language understanding). Gemini scored 90.0% in those tests, while ChatGPT 4 scored 86.4%. That’s a key takeaway.

Galaxy Z Flip 5 Price Crash

Offerring 100GB of data and a total cost that’s significantly cheaper than buying the phone outright, this Z Flip 5 deal is almost too good to be true.

Mobiles UK
£139 upfront
Only £26.99/month

View Deal

How and why the next-generation was built

Google says it has approached the training of Gemini differently to previous multimodal functionality. Previously they were trained separately and put together afterwards. That meant it was good at describing images, for instance, but lacked the ability for competent complex reasoning.

Google says Gemini was pre-trained from the start to be natively multimodal and that results in a massive upgrade.

“Then we fine-tuned it with additional multimodal data to further refine its effectiveness,” the company says in the blog post. “This helps Gemini seamlessly understand and reason about all kinds of inputs from the ground up, far better than existing multimodal models — and its capabilities are state of the art in nearly every domain.”

Availability

As we mentioned, Gemini Pro is available to use within Google Bard from today, while Gemini Nano will launch on the Pixel 8 Pro as part of the December Feature Drop.

Google adds: “We’re also brining Gemini to Pixel. Pixel 8 Pro is the first smartphone engineered to run Gemini Nano, which is powering new features like Summaride in the Recorder app and rolling out in Smart Reply in Gboard, starting with WhatsApp — with more messaging apps coming next year.”

You might like…

By Chris Smith

Chris Smith is a freelance technology journalist for a host of UK tech publications, including Trusted Reviews. He's based in South Florida, USA. …

Why trust our journalism?

Founded in 2003, Trusted Reviews exists to give our readers thorough, unbiased and independent advice on what to buy.

Today, we have millions of users a month from around the world, and assess more than 1,000 products a year.

Editorial independence

Editorial independence means being able to give an unbiased verdict about a product or company, with the avoidance of conflicts of interest. To ensure this is possible, every member of the editorial staff follows a clear code of conduct.

Professional conduct

We also expect our journalists to follow clear ethical standards in their work. Our staff members must strive for honesty and accuracy in everything they do. We follow the IPSO Editors’ code of practice to underpin these standards.

Why trust our journalism?

Founded in 2003, Trusted Reviews exists to give our readers thorough, unbiased and independent advice on what to buy.

Today, we have millions of users a month from around the world, and assess more than 1,000 products a year.

Three different sizes for Gemini

Super-human performance

How and why the next-generation was built

Availability

You might like…

Apple’s answer to ChatGPT and Bard could arrive next year – report

What is Google Bard? The new features announced at I/O 2023

Google Bard arrives, pitches more responsible AI rival to Microsoft’s ChatGPT tools

Why trust our journalism?

Editorial independence

Professional conduct

Why trust our journalism?

Editorial independence

Professional conduct

Sign up to our newsletter