


Five-minute technical talk | AIGC introduction and application selection evaluation
Part 01 Introduction to AIGC
AIGC (AI- Generated Content (artificial intelligence production content) refers to the production method that uses AI technology to automatically or assist in generating various forms of content such as text, code, images, voices, videos, and 3D objects. AIGC represents a new trend in the development of AI technology, from perceiving and understanding the world to generating and creating the world, and from analytical capabilities to creative capabilities. AIGC has also brought about changes in content creation, improving the quality, efficiency and diversity of content.
1.1 Text generation
Text generation refers to using AI technology automatically generates text content that conforms to grammar and logic based on given input (such as keywords, pictures, voices, etc.), which is an important aspect of AIGC.
The application scenarios of text generation are very rich, including news writing, novel creation, marketing copywriting, customer service Q&A, chat robots, educational coaching, knowledge graphs, and summary generation wait.
➤ Wen Xinyiyan: A large AI model launched by Baidu that supports multi-modal output and can perform literary creation, business copywriting creation, and mathematical logic calculations , Chinese understanding, multi-modal generation, etc.
➤ ChatGPT: A chat application based on the GPT series model launched by OpenAI. Currently, the GPT-4 model has been launched. ChatGPT based on the GPT-4 model can analyze images. And interact with text and pictures.
1.2 Code generation
Includes code completion, Code refactoring, code optimization, code annotation, etc. can cover a variety of programming languages and fields. Based on OpenAI's GPT-4 model, it is even possible to generate the corresponding website code based on a hand-drawn product prototype draft.
➤ Github Copilot: An AI-assisted programming tool developed based on the OpenAI Codex model. It supports dozens of programming languages and can perform real-time processing based on code or comments. It can provide code suggestions and entire functions in the editor, and can also achieve a pair programming experience through chat interaction.
➤ Cursor: an independent IDE software that integrates OpenAI’s GPT model. Similar to Github Copilot, Cursor can write code, edit code and chat through AI.
1.3 Image generation
Image generation refers to using Artificial intelligence technology, based on given input (such as natural language, images, videos, etc.), automatically generates images that conform to semantics and aesthetics, is an important aspect of AIGC. Image generation has a wide range of application scenarios, including artistic creation, entertainment media, education and training, e-commerce marketing, medical diagnosis, etc.
➤ Wenxin Yige: An AI art and creative assistance platform launched by Baidu. Paintings can be automatically generated based on text description and style selection.
➤ DALL-E2: A generative model based on an adaptive multi-modal encoder launched by OpenAI in (such as text, pictures, etc.) to automatically generate high-quality images.
➤ Midjourney: An AI painting tool released in March 2022. It can generate pictures based on natural language, select the artistic styles of different painters, and also Identify specific lenses or photography terms. Paintings generated by this tool have won first prize in art competitions.
1.4 Video generation
Video generation is mainly divided into two types: video editing and independent video generation. Video editing can be used for video super-scoring, repair and editing. Autonomous video generation can be used for image-to-video conversion, or for generating matching videos given descriptive text. The following are some related applications:
➤ Deepfake: This is an AI video generation platform based on GAN technology, which can realize face changing, voice conversion, and expression imitation and other functions. Users only need to upload a picture or a video as a reference, and the video will be automatically generated.
➤ Make-A-Video: An AI system launched by Meta Company that can convert text into video. It can create one-of-a-kind videos filled with vibrant colors, people, and scenery based on just a few words or lines of text.
1.5 3D modeling
AIGC-based 3D Modeling technology refers to the use of artificial intelligence technology to automatically generate 3D models that comply with semantics and aesthetics based on given input (such as natural language, images, etc.). This area is currently in an early stage of exploration. The following are some related applications or models:
➤ AICommand: an open source AI command plug-in based on Unity that can generate 3D scenes through text descriptions and Text is adjusted and optimized for 3D scenes. (https://github.com/keijiro/AICommand)
➤ ICON: An open source AI model that generates 3D character modeling based on character pictures (https: //github.com/YuliangXiu/ICON). You can experience and download the generated 3D model online: https://huggingface.co/spaces/Yuliang/ICON
Part 02 AIGC Application and Model Evaluation
After ChatGPT was launched by OpenAI at the end of 2022, the cumulative number of users exceeded 100 million in just two months. It quickly became popular all over the world. As a result, the AI iPhone moment has arrived, and major IT manufacturers quickly followed up. The following is an introduction to some relevant applications or models as of April 2023.
- Wen Xin’s words: See above.
- ChatGPT: See above.
- #Bard: A lightweight version of the NLP model launched by Google based on LaMDA.
- New Bing: An intelligent search engine based on the GPT4 model launched by Microsoft. It can interact with users in natural language and combine with real-time search results to provide Information, entertainment, creation and other functions.
- ChatGLM: A large conversational language model launched by Tsinghua University based on GLM architecture, open source and supporting Chinese and English bilinguals. Low-cost minimal model construction can be carried out based on CPU, and the model can also be developed and fine-tuned secondaryly.
- Poe: A free AI chatbot application developed by Quora. The application integrates 6 mainstream AI chatbots including: ChatGPT and GPT-4. .
will be evaluated and compared from the following aspects (except Poe):
- Natural language processing
- Logical reasoning
- Code generation
- Multi-modal support
PS:
- The ChatGPT participating in the evaluation is based on the GPT-3.5 model.
- The ChatGLM participating in the evaluation is only the minimized model: chatglm-6b-int4-qe. For practical applications, the chatglm-6b model that requires GPU memory should be built, and the quality of answers will be greatly improved.
##2.1 Natural Language Processing
Evaluation Content:
➪Multiple rounds of dialogue: Let’s create a children’s story together. The rule is that I say something first and you say something next, alternating. It ends when I say "I'm done with the story." Do you understand?
➪Language understanding: My boss said 1 1=3. Everything my boss said is right, so 1 1=3, right?
➪Language Translation: Translate this passage into English: One flower blooming alone is not spring, but a hundred flowers blooming together fill the garden.
➪Emotional analysis: Analyze the emotional color of this passage: I like this new movie very much. It made me laugh many times and moved me. Cried.
- ##ChatGPT
Bard
- ChatGLM
The scores are as follows:
##2.2 Logical reasoning
Assessment content:➪ In a There are five books on the shelf: red book, green book, blue book, orange book and yellow book. The green book is to the left of the yellow book, the yellow book is the third from the left, the red book is the second from the left, and the blue book is on the far right. What is the order of these books?
➪ There are three points A, B, and C on a 100-meter-long straight line. The position of A is uncertain. The distance between A and B is 5 meters. The distance between A and C is 10 meters, what is the possible distance between B and C?
##➪ If 2
ChatGPT#文心一言
#Bard
- #NewBing
- ChatGLM
##The scores are as follows:
#2.3 Coding AbilityEvaluation content:
Code generation: writing a python function , accepts an integer as input and determines whether it is a palindrome.
- Code explanation: Explain this line of python code: my_list = [x for x in my_list if x % 2 == 0]
- Bug detection: Where is the BUG in this line of code: my_list = [x for x in my_list if x % 2 = 0]
ChatGPT
文心一言Bard
- #NewBing
- #ChatGLM
#The scores are as follows:
##2.4 Multi-modal supportMultimodal support refers to the ability to handle multiple data types, such as text, images, audio and video, etc. For example: through text input, pictures, audio and video are automatically generated based on text requirements; through picture or audio and video input, content summary text is output, etc.
##ChatGPT- ChatGPT based on the GPT-3.5 model does not support multi-modal input and output capabilities, while ChatGPT based on the GPT-4 model can analyze pictures and analyze feedback text.
##文心一言
- Wen Xin Yi Yan can currently generate images and voices based on text descriptions. The video generation capability was demonstrated at the press conference, but during actual use, the video could not be generated.
#Bard- Google Bard does not support multi-modal capabilities.
##NewBing
- NewBing's creativity mode supports generating pictures through text descriptions.
- ChatGLM
##Tsinghua’s ChatGLM does not support multiple Modal capabilities.
The scores are as follows:
#Part 03Evaluation summary and selection evaluation
Combined with the above comparison scores, a comprehensive evaluation will be considered from the two stages of Demo and production (commercial use).
The overall evaluation score is as follows:
##The selection evaluation is as follows:##Part 04
Summary ➢
Demo stage: Wen Xinyiyan is the first AI choice, NewBing and ChatGPT are the alternative AI, and ChatGLM As an exploration direction of self-developed AIGC (requires GPU resources). ➢
In the production and commercial stage, multiple lines are available:
# Domestic mainland regions are seeking to introduce AI in the form of B-side cooperation with Wen Xinyiyan;- Domestic Hong Kong, Macao and Taiwan regions can consider introducing OpenAI’s official GPT-4 API for AI introduction;
- Based on the Tsinghua ChatGLM model, build And fine-tune the development of independent AI.
- Part 05
Conclusion WebGPUfor
The above is the detailed content of Five-minute technical talk | AIGC introduction and application selection evaluation. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

Cryptocurrency data platforms suitable for beginners include CoinMarketCap and non-small trumpet. 1. CoinMarketCap provides global real-time price, market value, and trading volume rankings for novice and basic analysis needs. 2. The non-small quotation provides a Chinese-friendly interface, suitable for Chinese users to quickly screen low-risk potential projects.

The detailed introduction of OKX Exchange is as follows: 1) Development history: Founded in 2017 and renamed OKX in 2022; 2) Headquartered in Seychelles; 3) Business scope covers a variety of trading products and supports more than 350 cryptocurrencies; 4) Users are spread across more than 200 countries, with tens of millions of users; 5) Multiple security measures are adopted to protect user assets; 6) Transaction fees are based on the market maker model, and the fee rate decreases with the increase in trading volume; 7) It has won many honors, such as "Cryptocurrency Exchange of the Year".

Institutional investors should choose compliant platforms such as Coinbase Pro and Genesis Trading, focusing on cold storage ratios and audit transparency; retail investors should choose large platforms such as Binance and Huobi, focusing on user experience and security; users in compliance-sensitive areas can conduct fiat currency trading through Circle Trade and Huobi Global, and mainland Chinese users need to go through compliant over-the-counter channels.

The following factors should be considered when choosing a bulk trading platform: 1. Liquidity: Priority is given to platforms with an average daily trading volume of more than US$5 billion. 2. Compliance: Check whether the platform holds licenses such as FinCEN in the United States, MiCA in the European Union. 3. Security: Cold wallet storage ratio and insurance mechanism are key indicators. 4. Service capability: Whether to provide exclusive account managers and customized transaction tools.

Priority is given to compliant platforms such as OKX and Coinbase, enabling multi-factor verification, and asset self-custody can reduce dependencies: 1. Select an exchange with a regulated license; 2. Turn on the whitelist of 2FA and withdrawals; 3. Use a hardware wallet or a platform that supports self-custody.

The core advantage of gate.io (global version) is that the interface is minimalist, supports Chinese, and the fiat currency trading process is intuitive; Binance (simplified version) has the highest global trading volume, and the simple version model only retains spot trading; OKX (Hong Kong version) has the simple version of the interface is simple, supports Cantonese/Mandarin, and has a low threshold for derivative trading; Huobi Global Station (Hong Kong version) has the core advantage of being an old exchange, launches a meta-universe trading terminal; KuCoin (Chinese Community Edition) has the core advantage of supporting 800 currencies, and the interface adopts WeChat interaction; Kraken (Hong Kong version) has the core advantage of being an old American exchange, holding a Hong Kong SVF license, and the interface is simple; HashKey Exchange (Hong Kong licensed) has the core advantage of being a well-known licensed exchange in Hong Kong, supporting France

Domestic user adaptation solutions include compliance channels and localization tools. 1. Compliance channels: Franchise currency exchange through OTC platforms such as Circle Trade, domestically, they need to go through Hong Kong or overseas platforms. 2. Localization tools: Use the currency circle network to obtain Chinese information, and Huobi Global Station provides a meta-universe trading terminal.

Provides a variety of complex trading tools and market analysis. It covers more than 100 countries, has an average daily derivative trading volume of over US$30 billion, supports more than 300 trading pairs and 200 times leverage, has strong technical strength, a huge global user base, provides professional trading platforms, secure storage solutions and rich trading pairs.
