Home Technology peripherals AI Google Gemini 1.5 technical report: Easily prove Mathematical Olympiad questions, the Flash version is 5 times faster than GPT-4 Turbo

Google Gemini 1.5 technical report: Easily prove Mathematical Olympiad questions, the Flash version is 5 times faster than GPT-4 Turbo

Jun 13, 2024 pm 01:52 PM
Google Model

In February of this year, Google launched the multi-modal large model Gemini1.5, which greatly improved performance and speed through engineering and infrastructure optimization, MoE architecture and other strategies. With longer context, stronger reasoning capabilities, and better handling of cross-modal content.

This Friday, Google DeepMind officially released the technical report of Gemini 1.5, which covers the Flash version and other recent upgrades. The document is 153 pages long.

谷歌Gemini 1.5技术报告:轻松证明奥数题,Flash版比GPT-4 Turbo快5倍

Technical report link: https://storage.googleapis.com/deepmind-media/gemini/gemini_v1_5_report.pdf

In this report, Google introduces the Gemini 1.5 series models. It represents the next generation of highly computationally efficient multi-modal large models, capable of recalling fine-grained information and reasoning from the context of millions of tokens, including multiple long documents and hours of video. Gemini 1.5 series models have multiple language and visual reasoning capabilities, making them widely used in the fields of natural language processing and computer vision. The model is capable of extracting key information from text and performing inferences, as well as comprehensively analyzing multiple long documents. Additionally, it supports the processing of large amounts of visual data and is capable of processing large amounts of visual data for hours.

The series includes two new models:

  1. Updated Gemini 1.5 Pro, with most features and benchmarks exceeding the February version
  2. Gemini 1.5 Flash, a more lightweight variant designed for improved efficiency design, and the performance penalty is minimal.

Regarding the Flash version mentioned at this week’s Google I/O conference, the report stated that Gemini 1.5 Flash is a Transformer decoder model with the same features as Gemini 1.5 Pro 2M+ contextual and multi-modal features. Efficiently utilizes tensor processing units (TPUs) and has low model serving latency. For example, Gemini 1.5 Flash can calculate attention and feed-forward components in parallel, and is also a Gemini 1.5 Pro model with larger network online extraction capabilities. It is trained using high-order preprocessing methods to improve quality.

The report evaluates the average time per output character for English, Chinese, Japanese, and French queries taken from Gemini 1.5 and the Vertex AI Streaming API.

谷歌Gemini 1.5技术报告:轻松证明奥数题,Flash版比GPT-4 Turbo快5倍

Time per output character in milliseconds for English, Chinese, Japanese, and French responses, with 10,000 characters entered, Gemini 1.5 Flash achieved the fastest build speeds of all languages ​​tested.

谷歌Gemini 1.5技术报告:轻松证明奥数题,Flash版比GPT-4 Turbo快5倍

Evaluation results of Gemini 1.5 Pro, 1.5 Flash, and Gemini 1.0 models on standard coding, multilingual, and math, science, and reasoning benchmarks. All numbers for the 1.5 Pro and 1.5 Flash are obtained after command adjustments.

谷歌Gemini 1.5技术报告:轻松证明奥数题,Flash版比GPT-4 Turbo快5倍

## Gemini 1.5 Pro compared to Gemini 1.0 Pro and Ultra on video understanding benchmarks.

谷歌Gemini 1.5技术报告:轻松证明奥数题,Flash版比GPT-4 Turbo快5倍

Comparison of Gemini 1.5 Pro with USM, Whisper, Gemini 1.0 Pro, and Gemini 1.0 Ultra on audio understanding tasks.

Gemini 1.5 model achieves near-perfect recall on cross-modal long context retrieval tasks, improving long document QA, long video QA and long context state-of-the-art performance across a wide range of benchmarks. In addition, Google also stated that as of May this year, the performance of Gemini 1.5 has been significantly improved compared to February.

谷歌Gemini 1.5技术报告:轻松证明奥数题,Flash版比GPT-4 Turbo快5倍

Gemini 1.5 Pro (May) versus initial release (February) on multiple benchmarks. The latest Gemini 1.5 Pro delivers improvements across all inference, encoding, vision and video benchmarks, while audio and translation performance remains unchanged. Note that for FLEURS, lower scores are better.

Oriol Vinyals, vice president of Google DeepMind and co-lead of the Gemini project, concluded that Gemini 1.5 Pro > 1.0 Ultra, 1.5 Flash (currently the fastest model) ~= 1.0 Ultra.

谷歌Gemini 1.5技术报告:轻松证明奥数题,Flash版比GPT-4 Turbo快5倍

By studying the limits of Gemini 1.5's long context capabilities, we can see that next token prediction and near-perfect retrieval (>99% ) and continue to improve. A generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k).

In the seventh chapter of the report, Google introduced the running scores of the Gemini 1.5 Pro math-enhanced version, which performed well on competition-level math problems, including without using tools. It achieved a breakthrough performance of 91.1% in Hendryck's MATH benchmark test.

The following are some examples of the model solving Asia Pacific Mathematical Olympiad (APMO) problems that previous models clearly could not solve. Oriol Vinyals says this answer is great because it's a proof (rather than a calculation), the solution is to the point, and it's "beautiful."

谷歌Gemini 1.5技术报告:轻松证明奥数题,Flash版比GPT-4 Turbo快5倍

Finally, Google highlighted real-world use cases for large models, such as Gemini 1.5, which works with professionals to complete tasks and achieve goals at 10 Time savings of 26-75% can be achieved across different job categories.

This cutting-edge large language model also demonstrates some surprising new features. When given a grammar manual for Kalamang, a language spoken by fewer than 200 people in western Papua New Guinea, the model could learn to translate English into Kalamang at a similar level to humans learning from the same content.

The above is the detailed content of Google Gemini 1.5 technical report: Easily prove Mathematical Olympiad questions, the Flash version is 5 times faster than GPT-4 Turbo. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1664
14
PHP Tutorial
1267
29
C# Tutorial
1239
24
Top 10 recommended for crypto digital asset trading APP (2025 global ranking) Top 10 recommended for crypto digital asset trading APP (2025 global ranking) Mar 18, 2025 pm 12:15 PM

This article recommends the top ten cryptocurrency trading platforms worth paying attention to, including Binance, OKX, Gate.io, BitFlyer, KuCoin, Bybit, Coinbase Pro, Kraken, BYDFi and XBIT decentralized exchanges. These platforms have their own advantages in terms of transaction currency quantity, transaction type, security, compliance, and special features. For example, Binance is known for its largest transaction volume and abundant functions in the world, while BitFlyer attracts Asian users with its Japanese Financial Hall license and high security. Choosing a suitable platform requires comprehensive consideration based on your own trading experience, risk tolerance and investment preferences. Hope this article helps you find the best suit for yourself

Tutorial on how to register, use and cancel Ouyi okex account Tutorial on how to register, use and cancel Ouyi okex account Mar 31, 2025 pm 04:21 PM

This article introduces in detail the registration, use and cancellation procedures of Ouyi OKEx account. To register, you need to download the APP, enter your mobile phone number or email address to register, and complete real-name authentication. The usage covers the operation steps such as login, recharge and withdrawal, transaction and security settings. To cancel an account, you need to contact Ouyi OKEx customer service, provide necessary information and wait for processing, and finally obtain the account cancellation confirmation. Through this article, users can easily master the complete life cycle management of Ouyi OKEx account and conduct digital asset transactions safely and conveniently.

Detailed tutorial on how to register for binance (2025 beginner's guide) Detailed tutorial on how to register for binance (2025 beginner's guide) Mar 18, 2025 pm 01:57 PM

This article provides a complete guide to Binance registration and security settings, covering pre-registration preparations (including equipment, email, mobile phone number and identity document preparation), and introduces two registration methods on the official website and APP, as well as different levels of identity verification (KYC) processes. In addition, the article also focuses on key security steps such as setting up a fund password, enabling two-factor verification (2FA, including Google Authenticator and SMS Verification), and setting up anti-phishing codes, helping users to register and use the Binance Binance platform for cryptocurrency transactions safely and conveniently. Please be sure to understand relevant laws and regulations and market risks before trading and invest with caution.

How to optimize jieba word segmentation to improve the keyword extraction effect of scenic spot comments? How to optimize jieba word segmentation to improve the keyword extraction effect of scenic spot comments? Apr 01, 2025 pm 06:24 PM

How to optimize jieba word segmentation to improve keyword extraction of scenic spot comments? When using jieba word segmentation to process scenic spot comment data, if the word segmentation results are ignored...

Tutorial on using gate.io mobile app Tutorial on using gate.io mobile app Mar 26, 2025 pm 05:15 PM

Tutorial on using gate.io mobile app: 1. For Android users, visit the official Gate.io website and download the Android installation package, you may need to allow the installation of applications from unknown sources in your mobile phone settings; 2. For iOS users, search "Gate.io" in the App Store to download.

The latest updates to the oldest virtual currency rankings The latest updates to the oldest virtual currency rankings Apr 22, 2025 am 07:18 AM

The ranking of virtual currencies’ “oldest” is as follows: 1. Bitcoin (BTC), issued on January 3, 2009, is the first decentralized digital currency. 2. Litecoin (LTC), released on October 7, 2011, is known as the "lightweight version of Bitcoin". 3. Ripple (XRP), issued in 2011, is designed for cross-border payments. 4. Dogecoin (DOGE), issued on December 6, 2013, is a "meme coin" based on the Litecoin code. 5. Ethereum (ETH), released on July 30, 2015, is the first platform to support smart contracts. 6. Tether (USDT), issued in 2014, is the first stablecoin to be anchored to the US dollar 1:1. 7. ADA,

Top 10 recommended for safe and reliable virtual currency purchase apps Top 10 recommended for safe and reliable virtual currency purchase apps Mar 18, 2025 pm 12:12 PM

Top 10 recommended global virtual currency trading platforms in 2025, helping you to play the digital currency market! This article will deeply analyze the core advantages and special features of ten top platforms including Binance, OKX, Gate.io, BitFlyer, KuCoin, Bybit, Coinbase Pro, Kraken, BYDFi and XBIT decentralized exchanges. Whether you are pursuing high liquidity and rich trading types, or focusing on safety, compliance and innovative functions, you can find a platform that suits you here. We will provide a comprehensive comparison of transaction types, security, special functions, etc. to help you choose the most suitable virtual currency trading platform and seize the opportunities of digital currency investment in 2025

Okex trading platform official website login portal Okex trading platform official website login portal Mar 18, 2025 pm 12:42 PM

This article introduces in detail the complete steps of logging in to the OKEx web version of Ouyi in detail, including preparation work (to ensure stable network connection and browser update), accessing the official website (to pay attention to the accuracy of the URL and avoid phishing website), finding the login entrance (click the "Login" button in the upper right corner of the homepage of the official website), entering the login information (email/mobile phone number and password, supporting verification code login), completing security verification (sliding verification, Google verification or SMS verification), and finally you can conduct digital asset trading after successfully logging in. A safe and convenient login process to ensure the safety of user assets.

See all articles