Google unveiled Gemini 1.5, an upgraded suite of AI models, capable of processing extensive content like Leo Tolstoy’s “War and Peace” in a fraction of the time it would take a human to read it. This advanced AI model, owned by Alphabet, enables users to request the analysis of significantly larger amounts of data with a single prompt. For instance, the Pro model, designed for family use, can handle up to 30,000 lines of code, 11 hours of audio, or an entire hour of video. In comparison, Google claims that its Pro model can process five times more data tokens than its closest competitor, Anthropic’s Claude 2.1 technology.
The claims made by Google have not been independently verified by Reuters.
This development underscores the ongoing competition in Silicon Valley to create the most capable and marketable AI technology. Alphabet CEO Sundar Pichai described this advancement as one of several “breakthroughs” driving his company forward. He emphasized the potential for innovative applications, such as evaluating movie rough cuts or analyzing multiple companies’ financial reports simultaneously. Google plans to leverage Gemini 1.5 to enhance its cloud services, aiming to compete more effectively with rivals like Microsoft.
Beginning Thursday, Google announced the availability of its million-token AI to select business customers, with broader accessibility expected in the future. Pichai expressed confidence in the profitability of these AI models over time, highlighting improvements in efficiency and performance, such as the implementation of a “mixture of experts” approach. This approach streamlines information gathering by employing specialized experts, akin to consulting a savant for answers rather than contacting numerous individuals.