Table of Contents
Mark Chen" >Mark Chen
Prafulla Dhariwal" >Prafulla Dhariwal
Home Technology peripherals AI OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Apr 22, 2023 am 09:58 AM
ai Open source

The popularity of ChatGPT and Midjourney has made the technology diffusion model behind them the foundation of the “generative AI” revolution.

Even, it is highly sought after by researchers in the industry, and its popularity far exceeds that of GAN, which once attacked the world.

Just when diffusion models were at their most powerful, some netizens suddenly announced in a high profile:

The era of Diffusion models is over! Consistency models are crowned king!

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

what on earth is it? ? ?

It turns out that OpenAI released a blockbuster and valuable paper "Consistency Models" in March, and released the model weights on GitHub today.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

##Paper address: https://arxiv.org/abs/2303.01469

Project address: https://github.com/openai/consistency_models

"Consistency Model" in training speed It subverts the diffusion model and can "generate in one step", completing simple tasks an order of magnitude faster than the diffusion model, and using 10-2000 times less calculations.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

So, how fast is this?

Some netizens said that it is equivalent to generating 64 images with a resolution of 256x256 in about 3.5 seconds, which is

18 images per second!

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Moreover, one of the main advantages of the latest model is that it can achieve high-quality samples without the need for "adversarial training" .

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.##This research

was conducted by Ilya Sutskever, one of the Hinton students of Turing’s Big Three and the main promoter of AlexNet Written by , as well as Chinese scholars Mark Chen and Prafulla Dhariwal who developed DALL-E 2, you can imagine how hard-core the research content is.

Some netizens even said that the “consistency model” is the future research direction. I believe we will definitely laugh at the diffusion model in the future.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.So, the diffusion model also disappears?

Faster, stronger, no need for confrontation

Currently, this paper is still an unfinalized version, and research is still ongoing.

In 2021, OpenAI CEO Sam Altman wrote a blog discussing how Moore’s Law should be applied to all fields.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

## Altman publicly talked about artificial intelligence on Twitter some time ago and said that artificial intelligence is achieving "leapfrog". He said, "A new version of Moore's Law may soon appear, with the number of intelligences in the universe doubling every 18 months."

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

To others, Altman’s optimism may seem unfounded.

But the latest research conducted by the team led by OpenAI’s chief scientist Ilya Sutskever provides strong support for Altman’s claim.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

It is said that 2022 is the first year of AIGC, because many models are based on the diffusion model.

The popularity of the diffusion model gradually replaced GAN and became the most effective image generation model in the current industry. For example, DALL.E 2 and Google Imagen are both diffusion models.

However, the newly proposed "consistency model" has been proven to be able to output the same quality content as the diffusion model in a shorter time.

This is because this "consistency model" uses a single-step generation process similar to GAN.

In contrast, the diffusion model uses a repeated sampling process to gradually eliminate noise in the image.

This method, although impressive, relies on performing hundreds to thousands of steps to achieve good results, which is not only expensive to operate, but also slow.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

The continuous iterative generation process of the diffusion model consumes 10-2000 more calculations than the "consistency model" times, even slowing down inference during training.

The power of the "Consistency Model" lies in its ability to make a trade-off between sample quality and computing resources when necessary.

Additionally, this model is capable of performing zero-shot data editing tasks such as image patching, colorization, or stroke-guided image editing.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Zero-shot image editing using a consensus model trained by distillation on LSUN Bedroom 256^256

The "Consistency Model" also converts data into noise when using mathematical equations and ensures that the resulting output is consistent for similar data points, thereby enabling them to smooth transition.

This type of equation is called "Probability Flow Ordinary Differential Equation" (Probability Flow ODE).

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

This study named such models "consistency" because they maintain this self-consistency between input data and output data.

These models can be trained in either distillation mode or isolation mode.

In distillation mode, the model is able to extract data from a pre-trained diffusion model, enabling it to be executed in a single step.

In detached mode, the model does not depend on the diffusion model at all, making it a completely independent model.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

It is worth noting that both training methods remove "adversarial training" from them.

I have to admit that adversarial training will indeed produce a more powerful neural network, but the process is more circuitous. That is, it introduces a set of misclassified adversarial samples and then retrains the target neural network with the correct labels.

Therefore, adversarial training will also lead to a slight decrease in the accuracy of deep learning model predictions, and it may even bring unexpected side effects in robotic applications.

Experimental results show that the distillation technique used to train the "consistency model" is better than that used for the diffusion model.

The "Consistency Model" achieved the latest state-of-the-art FID scores of 3.55 and 6.20 on the CIFAR10 image set and ImageNet 64x64 data set, respectively.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

#This is simply realized, diffusion model The quality of GANs, the speed, is doubly perfect.

In February, Sutskever posted a tweet suggesting that

Many people believe that great AI progress must include a new "idea." But that’s not the case: Many of AI’s greatest advances have come in the form of, well, that familiar humble idea that, if done well, becomes incredible.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

The latest research proves just that, and tweaking an old concept can change everything.

Author Introduction

As the co-founder and chief scientist of OpenAI, Ilya Sutskever No need to go into details, just take a look at this group photo of the "top performers".

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

## (far right of the picture)

Yang Song (Song Yang)

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Song Yang, the first author of the paper, is a research scientist at OpenAI.

Previously, he received a bachelor's degree in mathematics and physics from Tsinghua University and a master's and doctorate in computer science from Stanford University. In addition, he has interned at Google Brain, Uber ATG, and Microsoft Research.

As a machine learning researcher, he focuses on developing scalable methods to model, analyze and generate complex high-dimensional data. His interests span multiple areas, including generative modeling, representation learning, probabilistic reasoning, artificial intelligence security, and AI for science.

Mark Chen

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Mark Chen is the head of OpenAI’s multimodal and cutting-edge research department, He is also the coach of the U.S. Computer Olympiad team.

Previously, he earned a bachelor's degree in mathematics and computer science from MIT and worked as a quantitative trader at several proprietary trading firms, including Jane Street Capital.

After joining OpenAI, he led the team to develop DALL-E 2 and introduced vision into GPT-4. In addition, he led the development of Codex, participated in the GPT-3 project, and created Image GPT.

Prafulla Dhariwal

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

##Prafulla Dhariwal is a Research Scientist at OpenAI, working on generative models and autonomous Supervised learning. Before that, he was an undergraduate at MIT, studying computing, mathematics, and physics.

Interestingly, the diffusion model can beat GAN in the field of image generation, which was what he proposed in the 2021 NeurIPS paper.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Netizen: Finally made it back to Open AI

OpenAI opened the source code of the consistency model today .

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Finally back to Open AI.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Faced with so many crazy breakthroughs and announcements every day. Netizens asked: Should we take a break or speed up?

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

This will significantly save researchers the cost of training models compared to diffusion models.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Some netizens also gave future use cases of the "consistency model": real-time editing, NeRF rendering, real-time games render.

There is currently no demo demonstration, but it is worth confirming that the speed of image generation can be greatly improved and is always the winner.

We upgraded directly from dial-up to broadband.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Brain-computer interface, plus ultra-realistic images generated in almost real time.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

The above is the detailed content of OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Nordhold: Fusion System, Explained
4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook
3 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1670
14
PHP Tutorial
1274
29
C# Tutorial
1256
24
How to use the chrono library in C? How to use the chrono library in C? Apr 28, 2025 pm 10:18 PM

Using the chrono library in C can allow you to control time and time intervals more accurately. Let's explore the charm of this library. C's chrono library is part of the standard library, which provides a modern way to deal with time and time intervals. For programmers who have suffered from time.h and ctime, chrono is undoubtedly a boon. It not only improves the readability and maintainability of the code, but also provides higher accuracy and flexibility. Let's start with the basics. The chrono library mainly includes the following key components: std::chrono::system_clock: represents the system clock, used to obtain the current time. std::chron

How to understand DMA operations in C? How to understand DMA operations in C? Apr 28, 2025 pm 10:09 PM

DMA in C refers to DirectMemoryAccess, a direct memory access technology, allowing hardware devices to directly transmit data to memory without CPU intervention. 1) DMA operation is highly dependent on hardware devices and drivers, and the implementation method varies from system to system. 2) Direct access to memory may bring security risks, and the correctness and security of the code must be ensured. 3) DMA can improve performance, but improper use may lead to degradation of system performance. Through practice and learning, we can master the skills of using DMA and maximize its effectiveness in scenarios such as high-speed data transmission and real-time signal processing.

What is real-time operating system programming in C? What is real-time operating system programming in C? Apr 28, 2025 pm 10:15 PM

C performs well in real-time operating system (RTOS) programming, providing efficient execution efficiency and precise time management. 1) C Meet the needs of RTOS through direct operation of hardware resources and efficient memory management. 2) Using object-oriented features, C can design a flexible task scheduling system. 3) C supports efficient interrupt processing, but dynamic memory allocation and exception processing must be avoided to ensure real-time. 4) Template programming and inline functions help in performance optimization. 5) In practical applications, C can be used to implement an efficient logging system.

Steps to add and delete fields to MySQL tables Steps to add and delete fields to MySQL tables Apr 29, 2025 pm 04:15 PM

In MySQL, add fields using ALTERTABLEtable_nameADDCOLUMNnew_columnVARCHAR(255)AFTERexisting_column, delete fields using ALTERTABLEtable_nameDROPCOLUMNcolumn_to_drop. When adding fields, you need to specify a location to optimize query performance and data structure; before deleting fields, you need to confirm that the operation is irreversible; modifying table structure using online DDL, backup data, test environment, and low-load time periods is performance optimization and best practice.

How to measure thread performance in C? How to measure thread performance in C? Apr 28, 2025 pm 10:21 PM

Measuring thread performance in C can use the timing tools, performance analysis tools, and custom timers in the standard library. 1. Use the library to measure execution time. 2. Use gprof for performance analysis. The steps include adding the -pg option during compilation, running the program to generate a gmon.out file, and generating a performance report. 3. Use Valgrind's Callgrind module to perform more detailed analysis. The steps include running the program to generate the callgrind.out file and viewing the results using kcachegrind. 4. Custom timers can flexibly measure the execution time of a specific code segment. These methods help to fully understand thread performance and optimize code.

Top 10 digital currency trading platforms: Top 10 safe and reliable digital currency exchanges Top 10 digital currency trading platforms: Top 10 safe and reliable digital currency exchanges Apr 30, 2025 pm 04:30 PM

The top 10 digital virtual currency trading platforms are: 1. Binance, 2. OKX, 3. Coinbase, 4. Kraken, 5. Huobi Global, 6. Bitfinex, 7. KuCoin, 8. Gemini, 9. Bitstamp, 10. Bittrex. These platforms all provide high security and a variety of trading options, suitable for different user needs.

Quantitative Exchange Ranking 2025 Top 10 Recommendations for Digital Currency Quantitative Trading APPs Quantitative Exchange Ranking 2025 Top 10 Recommendations for Digital Currency Quantitative Trading APPs Apr 30, 2025 pm 07:24 PM

The built-in quantization tools on the exchange include: 1. Binance: Provides Binance Futures quantitative module, low handling fees, and supports AI-assisted transactions. 2. OKX (Ouyi): Supports multi-account management and intelligent order routing, and provides institutional-level risk control. The independent quantitative strategy platforms include: 3. 3Commas: drag-and-drop strategy generator, suitable for multi-platform hedging arbitrage. 4. Quadency: Professional-level algorithm strategy library, supporting customized risk thresholds. 5. Pionex: Built-in 16 preset strategy, low transaction fee. Vertical domain tools include: 6. Cryptohopper: cloud-based quantitative platform, supporting 150 technical indicators. 7. Bitsgap:

How does deepseek official website achieve the effect of penetrating mouse scroll event? How does deepseek official website achieve the effect of penetrating mouse scroll event? Apr 30, 2025 pm 03:21 PM

How to achieve the effect of mouse scrolling event penetration? When we browse the web, we often encounter some special interaction designs. For example, on deepseek official website, �...

See all articles