Table of Contents
Detailed Controller" >Detailed Controller
Experimental part
Home Technology peripherals AI Tian Yuandong's team released the second version of the DOC of 'Long Story Generator': the coherence has been greatly improved, and the fun has increased by 20.7%!

Tian Yuandong's team released the second version of the DOC of 'Long Story Generator': the coherence has been greatly improved, and the fun has increased by 20.7%!

Apr 11, 2023 pm 09:22 PM
Builder

Some time ago, Dr. Tian Yuandong’s team released a story generator Re3 (Recursive Reprompting and Revision) framework based on a large-scale language model at EMNLP2022. By designing prompts, the model can generate consistent stories without any need. Fine-tuning large models can generate stories of up to 7,500 words.

Re3’s author team recently released the second version of the long story generation framework DOC (Detailed Outline Control) , which uses a hierarchical outline (outline) to describe the story For more detailed depictions and a more coherent continuation of the generated content using the fine-tuned OPT-350m model, human evaluations rated DOC as more capable of writing than the previous generation Re3.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

##Paper link: https://arxiv.org/abs/2212.10077

Paper link: https://github.com/yangkevin2/doc-story-generation

DOC consists of two complementary components:

1. Detailed outline generator (detailed outliner)Can create a more detailed, hierarchical structure of the outline, the creative work from the main drafting process Move to the planning stage;

##2.

detailed controllerEnsure more detail by controlling the story paragraphs to be consistent with the outline details The outline can still play a role in the generation process.

In the human evaluation of automatically generated stories, DOC achieved an absolute gain of 22.5% in plot consistency, a 28.2% increase in outline relevance, and a 20.7% increase in interest, which is significantly better than previous Re3 baseline model, and human evaluators also found DOC to be easier to control in an interactive generation environment.

The first author of the article, Kevin Yang, is a fourth-year doctoral student at the University of California, Berkeley. His main research interest is controllable natural language text generation in structured settings, such as using controllable Generative structured methods to improve the consistency of long texts.

The second author, Dr. Tian Yuandong, is a researcher and senior manager at Meta Artificial Intelligence Research Institute. His research interests include deep reinforcement learning and its application in games, as well as theoretical analysis of deep learning models. . He received his bachelor's and master's degrees from Shanghai Jiao Tong University in 2005 and 2008, and his doctorate from the Robotics Institute of Carnegie Mellon University in the United States in 2013.

DOC Framework

With the continuous development of natural language technology, the understanding of short texts by large-scale language models is gradually approaching the bottleneck, and people are gradually becoming more and more interested in generating longer texts. Generate interest, such as generating thousands of words at once.

Compared with short text generation tasks, long text contains more content and restrictions. The model needs to maintain overall consistency, long-term factual consistency, and maintain consistency with user output. The premise or plan remains relevant.

Compared with humans, story generation systems like Re3 still have shortcomings in many aspects, such as the inability to guarantee plot coherence over long distances, global inconsistencies, and story content deviating from the setting. plans etc.

To bridge this gap, the Detailed Outline Control (DOC) framework reuses Re3’s high-level planning-drafting-revision structure through two complementary approach improves long-term consistency.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

Detailed Outliner

First, the detailed outliner refines a brief initial outline into a more detailed outline Detailed, hierarchical outlines designed this way because a human author may iteratively refine and expand a short initial outline before drafting a longer document.

Rather than improvising new plot points, a writer might plan a coherent overarching plot in the high-level outline stage, using an expanded outline to provide more detailed guidance during the drafting process.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

During the drafting stage, the researchers reused the outline relevance and text coherence reordering from the Re3 rewriting stage to detect where the current outline items were. A paragraph of article is completed at the same time, and early stopping is implemented based on the score threshold.

There are complete settings and relevant characters in the outline, and each outline item is carefully screened for relevance and coherence in context.

In the structured prompt, the model highlights the current settings, changes in the settings, and also retrieves role descriptions based on the roles detected in the outline.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

In contrast, Re3 dynamically selects relevant characters for each segment during the drafting process and does not track setting information, which can lead to story Unexpected changes in settings

Detailed Controller

The second component, the detailed controller, controls paragraphs based on the corresponding outline item Generated to maintain fidelity to a detailed outline.

Because the detailed outline imposes many overlapping soft constraints, the detailed controller must exert sufficient control strength. At the same time, the detailed controller must also adapt to flexible natural language input and use State-of-the-art large language models are generated with computational efficiency.

So the researchers implemented the detailed controller as a controller based on OPT350m, and designed a contrast training program to align the summary with the paragraph prefix.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

The most critical thing is that the researchers also constructed many fluent hard negatives to facilitate the generated paragraphs to not only It starts off relevant to the theme and stays relevant throughout.

Experimental part

In the experiment, the input to the model is just a short English premise, usually 30-60 words, and the output is a complete story .

The researchers did not impose more rule constraints because the definition of "story" is not yet clear, let alone the definition of "good story", and the quality mainly relies on manual evaluation. index.

There are three main indicators used in evaluation, which are more suitable for comparing paragraphs rather than complete stories:

1. Coherence Sexuality, the percentage of paragraphs that human annotators judge to have a coherent plot;

2. Relevance, the percentage of paragraphs that are judged to conform to the corresponding outline entries;

3. Interestingness, the percentage of passages that are considered interesting.

The baseline models compared include Re3, ROLLING-OPT and ROLLING-GPT.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

As can be seen from the experimental results, compared with Re3, the annotators believe that the plot generated by DOC is more coherent and more relevant to the outline. ROLLING baseline improvement is higher.

And the results confirm the correctness of the model design, that is, plot coherence and outline relevance benefit from shifting creative work from planning to drafting, as well as improved control mechanisms.

And surprisingly, the annotators also believed that the DOC paragraphs were significantly more interesting. The researchers believed that this was an improvement brought about by more detailed (more event-based) outlines, and further ablation experiments also supported this this assumption.

However, qualitative analysis also revealed that the model still has huge room for further improvement.

Unlike RE3, DOC usually doesn't deviate significantly from the top-level outline, while RE3 sometimes strays almost completely off topic, but DOC often fails to follow the lower-level parts of the detailed outline.

Internal consistency remains problematic in DOC and RE3, and occasional errors in detailed outlines can have a particularly negative impact, leading to greater levels of confusion during the drafting process. Connection error.

Additionally, outlines in the DOC are often inconsistent in the level of detail, with some being too vague and others appearing to be over-expanded.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

Additionally, the settings and roles detected by the model can sometimes be incorrect or incomplete, the example below shows the DOC written according to the above outline A heavily abridged story.

Tian Yuandongs team released the second version of the DOC of Long Story Generator: the coherence has been greatly improved, and the fun has increased by 20.7%!

The above is the detailed content of Tian Yuandong's team released the second version of the DOC of 'Long Story Generator': the coherence has been greatly improved, and the fun has increased by 20.7%!. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1655
14
PHP Tutorial
1252
29
C# Tutorial
1226
24
AI ID photo generator: In actual testing, AI software demonstrated unique and powerful performance AI ID photo generator: In actual testing, AI software demonstrated unique and powerful performance Aug 09, 2023 pm 07:33 PM

After actual testing, the AI ​​ID photo generator performed well and its powerful functions are amazing. You really don’t need to worry about taking photos anymore! This sentence is rewritten as follows: Use Chuzhan AI software (the copyright and interpretation rights belong to Chuzhan AI and are only used to show the generated effect) sketch mode: Whether in daily work or business office situations, professional image is crucial . A beautiful ID photo can enhance one's professional image. ID photos generated through AI not only meet traditional photo standards, but can also restore a person's unique facial features. AI technology can intelligently identify various details such as facial contours, skin color, lighting, etc., and generate the most suitable ID photo. Whether it is appearance or temperament, it can be perfectly displayed and leave a deep first impression on people. AI generates ID photos with one click.

How to write a simple student performance report generator using Java? How to write a simple student performance report generator using Java? Nov 03, 2023 pm 02:57 PM

How to write a simple student performance report generator using Java? Student Performance Report Generator is a tool that helps teachers or educators quickly generate student performance reports. This article will introduce how to use Java to write a simple student performance report generator. First, we need to define the student object and student grade object. The student object contains basic information such as the student's name and student number, while the student score object contains information such as the student's subject scores and average grade. The following is the definition of a simple student object: public

Generators in PHP7: How to handle large-scale data efficiently and save memory? Generators in PHP7: How to handle large-scale data efficiently and save memory? Oct 20, 2023 pm 04:42 PM

Generators in PHP7: How to handle large-scale data efficiently and save memory? Overview: PHP7 introduces generators as a powerful tool in terms of large-scale data processing and memory saving. Generators are a special type of function in the PHP language. Unlike ordinary functions, generators can pause execution and return intermediate results instead of returning all results at once. This makes the generator ideal for processing large batches of data, reducing memory usage and improving processing efficiency. This article will introduce students

Best Free AI Animation Art Generator Best Free AI Animation Art Generator Feb 19, 2024 pm 10:50 PM

If you are eager to find the top free AI animation art generator, you can end your search. The world of anime art has been captivating audiences for decades with its unique character designs, captivating colors and captivating plots. However, creating anime art requires talent, skill, and a lot of time. However, with the continuous development of artificial intelligence (AI), you can now explore the world of animation art without having to delve into complex technologies with the help of the best free AI animation art generator. This will open up new possibilities for you to unleash your creativity. What is an AI anime art generator? The AI ​​Animation Art Generator utilizes sophisticated algorithms and machine learning techniques to analyze an extensive database of animation works. Through these algorithms, the system learns and identifies different animation styles

How to write a simple QR code generator through PHP How to write a simple QR code generator through PHP Sep 24, 2023 am 08:49 AM

How to write a simple QR code generator through PHP QR codes have become very common in modern society. They can quickly transmit information and improve user experience. In this article, I will introduce you to how to write a simple QR code generator using PHP. 1. Install the necessary tools and libraries Before starting, we need to make sure that the following tools and libraries have been installed: PHP: Make sure that the latest version of PHP is installed. You can check the current PHP version by running the php-v command. Composer: C

Generators in PHP7: How to handle large amounts of data and lazy loading efficiently? Generators in PHP7: How to handle large amounts of data and lazy loading efficiently? Oct 27, 2023 pm 07:31 PM

The concept of generator (Generator) was introduced in PHP7, which provides a method to efficiently handle large amounts of data and lazy loading. This article will start with concepts and principles, combined with specific code examples, to introduce the usage and advantages of generators in PHP7. A generator is a special function that, instead of returning all data at once, generates data on demand. When the function executes the yield statement, the currently generated value will be returned, and the function's state will be saved. The next time the generator function is called, the function will

Use ChatGPT to build large models in seconds! OpenAI's new plug-in is crazy, connect to the code interpreter and get it with one click Use ChatGPT to build large models in seconds! OpenAI's new plug-in is crazy, connect to the code interpreter and get it with one click Apr 04, 2023 am 11:30 AM

After ChatGPT can be connected to the Internet, OpenAI also quickly introduced a code generator. With the support of this plug-in, ChatGPT can even generate its own machine learning model. Last Friday, OpenAI just announced the shocking news that ChatGPT can connect to the Internet and connect to third-party plug-ins! In addition to third-party plug-ins, OpenAI also introduced its own plug-in "Code Interpreter" and gave several special use cases: solving quantitative and qualitative mathematical problems; performing data analysis and visualization; and quickly converting file formats. In addition, Greg Brockman demonstrated that ChatGPT can also process uploaded video files. And a best-selling author named Andrew Mayne

What is the difference between iterators and generators in Python? What is the difference between iterators and generators in Python? Oct 20, 2023 pm 07:10 PM

What is the difference between iterators and generators in Python? In Python programming, iterators and generators are tools used to process iterable objects. Both of them can be used to traverse data, but there are some differences in implementation. An iterator is an object that implements the iterator protocol. The iterator object needs to contain two methods: __iter__() and __next__(). That

See all articles