Google has just announced (but not released) Gemini 1.5, an update to its flagship language model — the model used in the chatbot once known as Bard, but synergistically renamed Gemini a week ago.
The big claim with this release is "a breakthrough in long-context understanding across modalities." It's also meant to be a step-up in terms of efficiency, having been built on an architecture type known as "Mixture-of-Experts (MoE)," meaning performance supposedly akin to Gemini 1.0, but relying on fewer electricity-hungry GPUs cranking away to achieve it.
That first big claim about multi-modal "long-context" understanding is as jargony as it sounds, but Google Deepmind co-founder posted a demo on X meant to show what this means in practice.
Tweet may have been deleted
Smartly utilizing a large piece of public-domain text that won't rankle any copyright sticklers — in this case a 402-page transcript of the NASA mission that landed on the moon — the LLM is able to narrow its focus to what the user needs ("context") in spite of the prompt being absolutely gigantic ("long"), so apparently that's what "long-context" means.
In the demo, Gemini 1.5 is able to pick out three amusing moments from the novel-length piece of text. It's also able to spot the event in the transcript that matches a picture of a lunar boot print — the part where, y'know, Neil Armstrong walks on the moon — which elucidates what "multimodal" is intended to mean in this context: an image recognition model working hand-in-hand with the LLM.
SEE ALSO: Samsung Galaxy S24 series comes with Google's Gemini AI modelThis upgrade is part of an ongoing effort to keep Google in the AI conversation after OpenAI ate everyone's AI lunch in 2022 by releasing ChatGPT. Late last year, Google began seriously hyping changes to come with Bard and the model powering it, which remains an also-ran large language model, more known for being shoehorned into popular Google and Android products than for being used like ChatGPT to solve day-to-day problems and blow minds at cocktail parties. In particular, a December 2023 research paper touted a version of Gemini that had exceeded the performance of OpenAI's GPT-4 model in certain cases, and become the first LLM to get a passing score on a specific AI test of "Multitask Language Understanding " or MLU.
Among other assertions about Gemini 1.5, Google says the new model can crunch large datasets with impressive accuracy, and — in a somewhat more eyebrow raising claim — perform well at reasoningacross all sorts of data types. Reasoning is the most famous weakness among most LLMs.
According to CEO Sundar Pichai, Google is releasing Gemini 1.5 to a limited group. "We’re excited to offer a limited preview of this experimental feature to developers and enterprise customers," Pichai wrote in Google's blog post.
The broader base of Gemini users will be the ultimate judges of Google's performance claims when they're actually permitted to try out Gemini 1.5 as part of an officially-released product. Google's most powerful model, Gemini Ultra was released a week ago, so it may be a while, and it's probably safe to assume that Gemini 1.5, will one day be part of Google's new premium — in other words "paid" — package of Workspace products called Google One AI Premium.
Copyright © 2023 Powered by
Google announces Gemini 1.5, a flashy upgrade to its flagship AI model-坐而论道网
sitemap
文章
45614
浏览
8151
获赞
5688
Astrology tech can provide a safe space for the LGBTQ community, but there are limitations
Mashable is celebrating Pride Monthby exploring the modern LGBTQ world, from the people who make upBest Garmin deal: Save $270 on the Garmin fenix 7X
SAVE $270: As of Jan. 10, the Garmin fenix 7X is on sale for $529.99 at Amazon. This deal saves youBest MacBook deal: Save $200 on Apple 2022 MacBook Air
Save $200: As of Jan. 13, Apple 2022 MacBook Air with M2 chip is on sale for $799 at Amazon. That&rsNvidia's DLSS Second Take: Metro Exodus Investigation
It's time for our final look into Metro Exodus, exploring the last remaining key technology the gameEstimating CPU Performance Using Amdahl's Law
If you are in the market for a new computer, or thinking of upgrading your current system, choosingHands on with Lenovo's 'rollable' display laptop at CES 2025
Lenovo has been on a roll. The Beijing-based tech company consistently debuts some of the most excitI love LUCI: This lifelogging camera pin seriously impressed me at CES 2025
The last thing I came across at this year'sCESwas LUCI, an AI-powered wearable video camera createdThe Best Strategy Games on PC
The PC is home to just about every type of video game under the sun, but few are as quintessentiallyGoogle Maps just got a colorful upgrade
Like Dorothy in Oz, Google Maps is about to get a lot more colorful. Starting Tuesday, Google Maps oBest MacBook deal: Save $200 on Apple 2022 MacBook Air
Save $200: As of Jan. 13, Apple 2022 MacBook Air with M2 chip is on sale for $799 at Amazon. That&rsBest Samsung TV deal: Samsung 55
Save $71.99: As of Jan. 13, Samsung 55-inch Class DU6900 Crystal UHD 4K Smart TV is available for $2Best Fire TV Cube deal: Save $30 at Amazon
SAVE $30: As of Jan. 13, Amazon's Fire TV Cube is on sale for $109.99 for a limited time, 21% off it21 Years of Hitman: How Stealth Action Got Perfected Over the Last 2 Decades
Coming off the highly anticipated release of Hitman 3, this latest installment has been very well reBest Lenovo Legion Go deal: Save $200 at Best Buy
SAVE $200:As of Dec. 18, the Lenovo Legion Go gaming handheld (1TB SSD, AMD Ryzen Z1 Extreme) is onBest MacBook deal: Get $200 off 2024 Apple MacBook Air
SAVE $200:As of Jan. 9, Apple's 2024 MacBook Air 13-inch laptop (M3 chip, 16GB RAM, 256GB SSD) is on