Five-minute technical talk | AIGC introduction and application selection evaluation-AI-php.cn

Home

Technology peripherals

Five-minute technical talk | AIGC introduction and application selection evaluation

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Jun 04, 2023 pm 01:31 PM

AI ai

五分钟技术趣谈 | AIGC介绍与应用选型评估

Part 01 Introduction to AIGC

AIGC (AI- Generated Content (artificial intelligence production content) refers to the production method that uses AI technology to automatically or assist in generating various forms of content such as text, code, images, voices, videos, and 3D objects. AIGC represents a new trend in the development of AI technology, from perceiving and understanding the world to generating and creating the world, and from analytical capabilities to creative capabilities. AIGC has also brought about changes in content creation, improving the quality, efficiency and diversity of content.

1.1 Text generation

Text generation refers to using AI technology automatically generates text content that conforms to grammar and logic based on given input (such as keywords, pictures, voices, etc.), which is an important aspect of AIGC.

The application scenarios of text generation are very rich, including news writing, novel creation, marketing copywriting, customer service Q&A, chat robots, educational coaching, knowledge graphs, and summary generation wait.

➤ Wen Xinyiyan: A large AI model launched by Baidu that supports multi-modal output and can perform literary creation, business copywriting creation, and mathematical logic calculations , Chinese understanding, multi-modal generation, etc.

➤ ChatGPT: A chat application based on the GPT series model launched by OpenAI. Currently, the GPT-4 model has been launched. ChatGPT based on the GPT-4 model can analyze images. And interact with text and pictures.

1.2 Code generation

Includes code completion, Code refactoring, code optimization, code annotation, etc. can cover a variety of programming languages and fields. Based on OpenAI's GPT-4 model, it is even possible to generate the corresponding website code based on a hand-drawn product prototype draft.

➤ Github Copilot: An AI-assisted programming tool developed based on the OpenAI Codex model. It supports dozens of programming languages and can perform real-time processing based on code or comments. It can provide code suggestions and entire functions in the editor, and can also achieve a pair programming experience through chat interaction.

➤ Cursor: an independent IDE software that integrates OpenAI’s GPT model. Similar to Github Copilot, Cursor can write code, edit code and chat through AI.

1.3 Image generation

Image generation refers to using Artificial intelligence technology, based on given input (such as natural language, images, videos, etc.), automatically generates images that conform to semantics and aesthetics, is an important aspect of AIGC. Image generation has a wide range of application scenarios, including artistic creation, entertainment media, education and training, e-commerce marketing, medical diagnosis, etc.

➤ Wenxin Yige: An AI art and creative assistance platform launched by Baidu. Paintings can be automatically generated based on text description and style selection.

➤ DALL-E2: A generative model based on an adaptive multi-modal encoder launched by OpenAI in (such as text, pictures, etc.) to automatically generate high-quality images.

➤ Midjourney: An AI painting tool released in March 2022. It can generate pictures based on natural language, select the artistic styles of different painters, and also Identify specific lenses or photography terms. Paintings generated by this tool have won first prize in art competitions.

1.4 Video generation

Video generation is mainly divided into two types: video editing and independent video generation. Video editing can be used for video super-scoring, repair and editing. Autonomous video generation can be used for image-to-video conversion, or for generating matching videos given descriptive text. The following are some related applications:

➤ Deepfake: This is an AI video generation platform based on GAN technology, which can realize face changing, voice conversion, and expression imitation and other functions. Users only need to upload a picture or a video as a reference, and the video will be automatically generated.

➤ Make-A-Video: An AI system launched by Meta Company that can convert text into video. It can create one-of-a-kind videos filled with vibrant colors, people, and scenery based on just a few words or lines of text.

1.5 3D modeling

AIGC-based 3D Modeling technology refers to the use of artificial intelligence technology to automatically generate 3D models that comply with semantics and aesthetics based on given input (such as natural language, images, etc.). This area is currently in an early stage of exploration. The following are some related applications or models:

➤ AICommand: an open source AI command plug-in based on Unity that can generate 3D scenes through text descriptions and Text is adjusted and optimized for 3D scenes. (https://github.com/keijiro/AICommand)

➤ ICON: An open source AI model that generates 3D character modeling based on character pictures (https: //github.com/YuliangXiu/ICON). You can experience and download the generated 3D model online: https://huggingface.co/spaces/Yuliang/ICON

五分钟技术趣谈 | AIGC介绍与应用选型评估

Part 02 AIGC Application and Model Evaluation

After ChatGPT was launched by OpenAI at the end of 2022, the cumulative number of users exceeded 100 million in just two months. It quickly became popular all over the world. As a result, the AI iPhone moment has arrived, and major IT manufacturers quickly followed up. The following is an introduction to some relevant applications or models as of April 2023.

Wen Xin’s words: See above.
ChatGPT: See above.
#Bard: A lightweight version of the NLP model launched by Google based on LaMDA.
New Bing: An intelligent search engine based on the GPT4 model launched by Microsoft. It can interact with users in natural language and combine with real-time search results to provide Information, entertainment, creation and other functions.
ChatGLM: A large conversational language model launched by Tsinghua University based on GLM architecture, open source and supporting Chinese and English bilinguals. Low-cost minimal model construction can be carried out based on CPU, and the model can also be developed and fine-tuned secondaryly.
Poe: A free AI chatbot application developed by Quora. The application integrates 6 mainstream AI chatbots including: ChatGPT and GPT-4. .

will be evaluated and compared from the following aspects (except Poe):

Natural language processing
Logical reasoning
Code generation
Multi-modal support

PS:

The ChatGPT participating in the evaluation is based on the GPT-3.5 model.
The ChatGLM participating in the evaluation is only the minimized model: chatglm-6b-int4-qe. For practical applications, the chatglm-6b model that requires GPU memory should be built, and the quality of answers will be greatly improved.

##2.1 Natural Language Processing

Evaluation Content:

➪Multiple rounds of dialogue: Let’s create a children’s story together. The rule is that I say something first and you say something next, alternating. It ends when I say "I'm done with the story." Do you understand?

➪Language understanding: My boss said 1 1=3. Everything my boss said is right, so 1 1=3, right?

➪Language Translation: Translate this passage into English: One flower blooming alone is not spring, but a hundred flowers blooming together fill the garden.

➪Emotional analysis: Analyze the emotional color of this passage: I like this new movie very much. It made me laugh many times and moved me. Cried.

##ChatGPT

五分钟技术趣谈 | AIGC介绍与应用选型评估

##文心一言

五分钟技术趣谈 | AIGC介绍与应用选型评估

Bard

五分钟技术趣谈 | AIGC介绍与应用选型评估

##NewBing

- ChatGLM
The scores are as follows:

##2.2 Logical reasoning
Assessment content:
➪ In a There are five books on the shelf: red book, green book, blue book, orange book and yellow book. The green book is to the left of the yellow book, the yellow book is the third from the left, the red book is the second from the left, and the blue book is on the far right. What is the order of these books?
➪ There are three points A, B, and C on a 100-meter-long straight line. The position of A is uncertain. The distance between A and B is 5 meters. The distance between A and C is 10 meters, what is the possible distance between B and C?
##➪ If 2

ChatGPT
#文心一言
#Bard
- #NewBing
  
  ChatGLM
  
  ##The scores are as follows:
  
  #2.3 Coding Ability
  Evaluation content:
  
  Code generation: writing a python function , accepts an integer as input and determines whether it is a palindrome.
  
  Code explanation: Explain this line of python code: my_list = [x for x in my_list if x % 2 == 0]
  
  Bug detection: Where is the BUG in this line of code: my_list = [x for x in my_list if x % 2 = 0]
  
  ChatGPT
  
  文心一言
  
  Bard
  
  #NewBing
  
  #ChatGLM
  
  #The scores are as follows:
  
  ##2.4 Multi-modal support
  Multimodal support refers to the ability to handle multiple data types, such as text, images, audio and video, etc. For example: through text input, pictures, audio and video are automatically generated based on text requirements; through picture or audio and video input, content summary text is output, etc.
  
  ##ChatGPT
  
  ChatGPT based on the GPT-3.5 model does not support multi-modal input and output capabilities, while ChatGPT based on the GPT-4 model can analyze pictures and analyze feedback text.
  
  ##文心一言
  
  Wen Xin Yi Yan can currently generate images and voices based on text descriptions. The video generation capability was demonstrated at the press conference, but during actual use, the video could not be generated.
  
  #Bard
  
  Google Bard does not support multi-modal capabilities.
  
  ##NewBing
  
  NewBing's creativity mode supports generating pictures through text descriptions.
  
  ChatGLM
  
  ##Tsinghua’s ChatGLM does not support multiple Modal capabilities.
  
  The scores are as follows:
  
  #Part 03
  Evaluation summary and selection evaluation
  
  Combined with the above comparison scores, a comprehensive evaluation will be considered from the two stages of Demo and production (commercial use).
  
  The overall evaluation score is as follows:
  
  ##The selection evaluation is as follows:
  
  ##Part 04
  
  Summary ➢
  
  Demo stage: Wen Xinyiyan is the first AI choice, NewBing and ChatGPT are the alternative AI, and ChatGLM As an exploration direction of self-developed AIGC (requires GPU resources). ➢
  
  In the production and commercial stage, multiple lines are available:
  # Domestic mainland regions are seeking to introduce AI in the form of B-side cooperation with Wen Xinyiyan;
  
  Domestic Hong Kong, Macao and Taiwan regions can consider introducing OpenAI’s official GPT-4 API for AI introduction;
  
  Based on the Tsinghua ChatGLM model, build And fine-tune the development of independent AI.
  
  Part 05
  
  Conclusion WebGPUfor

The above is the detailed content of Five-minute technical talk | AIGC introduction and application selection evaluation. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

What's New in Windows 11 KB5054979 & How to Fix Update Issues

3 weeks ago By DDD

How to fix KB5055523 fails to install in Windows 11?

2 weeks ago By DDD

InZoi: How To Apply To School And University

3 weeks ago By DDD

How to fix KB5055518 fails to install in Windows 10?

2 weeks ago By DDD

Roblox: Dead Rails – How To Summon And Defeat Nikola Tesla

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7814

Java Tutorial

1646

CakePHP Tutorial

1402

Laravel Tutorial

1300

PHP Tutorial

1238

Related knowledge

The top ten free platform recommendations for real-time data on currency circle markets are released Apr 22, 2025 am 08:12 AM

Cryptocurrency data platforms suitable for beginners include CoinMarketCap and non-small trumpet. 1. CoinMarketCap provides global real-time price, market value, and trading volume rankings for novice and basic analysis needs. 2. The non-small quotation provides a Chinese-friendly interface, suitable for Chinese users to quickly screen low-risk potential projects.

okx online okx exchange official website online Apr 22, 2025 am 06:45 AM

The detailed introduction of OKX Exchange is as follows: 1) Development history: Founded in 2017 and renamed OKX in 2022; 2) Headquartered in Seychelles; 3) Business scope covers a variety of trading products and supports more than 350 cryptocurrencies; 4) Users are spread across more than 200 countries, with tens of millions of users; 5) Multiple security measures are adopted to protect user assets; 6) Transaction fees are based on the market maker model, and the fee rate decreases with the increase in trading volume; 7) It has won many honors, such as "Cryptocurrency Exchange of the Year".

A list of special services for major virtual currency trading platforms Apr 22, 2025 am 08:09 AM

Institutional investors should choose compliant platforms such as Coinbase Pro and Genesis Trading, focusing on cold storage ratios and audit transparency; retail investors should choose large platforms such as Binance and Huobi, focusing on user experience and security; users in compliance-sensitive areas can conduct fiat currency trading through Circle Trade and Huobi Global, and mainland Chinese users need to go through compliant over-the-counter channels.

Top 10 latest releases of virtual currency trading platforms for bulk transactions Apr 22, 2025 am 08:18 AM

The following factors should be considered when choosing a bulk trading platform: 1. Liquidity: Priority is given to platforms with an average daily trading volume of more than US$5 billion. 2. Compliance: Check whether the platform holds licenses such as FinCEN in the United States, MiCA in the European Union. 3. Security: Cold wallet storage ratio and insurance mechanism are key indicators. 4. Service capability: Whether to provide exclusive account managers and customized transaction tools.

A list of top ten virtual currency trading platforms that support multiple currencies Apr 22, 2025 am 08:15 AM

Priority is given to compliant platforms such as OKX and Coinbase, enabling multi-factor verification, and asset self-custody can reduce dependencies: 1. Select an exchange with a regulated license; 2. Turn on the whitelist of 2FA and withdrawals; 3. Use a hardware wallet or a platform that supports self-custody.

Recommended top 10 for easy access to digital currency trading apps (latest ranking in 25) Apr 22, 2025 am 07:45 AM

The core advantage of gate.io (global version) is that the interface is minimalist, supports Chinese, and the fiat currency trading process is intuitive; Binance (simplified version) has the highest global trading volume, and the simple version model only retains spot trading; OKX (Hong Kong version) has the simple version of the interface is simple, supports Cantonese/Mandarin, and has a low threshold for derivative trading; Huobi Global Station (Hong Kong version) has the core advantage of being an old exchange, launches a meta-universe trading terminal; KuCoin (Chinese Community Edition) has the core advantage of supporting 800 currencies, and the interface adopts WeChat interaction; Kraken (Hong Kong version) has the core advantage of being an old American exchange, holding a Hong Kong SVF license, and the interface is simple; HashKey Exchange (Hong Kong licensed) has the core advantage of being a well-known licensed exchange in Hong Kong, supporting France

Tips and recommendations for the top ten market websites in the currency circle 2025 Apr 22, 2025 am 08:03 AM

Domestic user adaptation solutions include compliance channels and localization tools. 1. Compliance channels: Franchise currency exchange through OTC platforms such as Circle Trade, domestically, they need to go through Hong Kong or overseas platforms. 2. Localization tools: Use the currency circle network to obtain Chinese information, and Huobi Global Station provides a meta-universe trading terminal.

Summary of the top ten Apple version download portals for digital currency exchange apps Apr 22, 2025 am 09:27 AM

Provides a variety of complex trading tools and market analysis. It covers more than 100 countries, has an average daily derivative trading volume of over US$30 billion, supports more than 300 trading pairs and 200 times leverage, has strong technical strength, a huge global user base, provides professional trading platforms, secure storage solutions and rich trading pairs.

See all articles

Five-minute technical talk | AIGC introduction and application selection evaluation

Part 01 Introduction to AIGC

Part 02 AIGC Application and Model Evaluation

Evaluation summary and selection evaluation

Summary ➢

Conclusion WebGPUfor

Hot AI Tools

Undresser.AI Undress

AI Clothes Remover

Undress AI Tool

Clothoff.io

Video Face Swap

Hot Article

Hot Tools

Notepad++7.3.1

SublimeText3 Chinese version

Zend Studio 13.0.1

Dreamweaver CS6

SublimeText3 Mac version

Hot Topics