Home Technology peripherals AI Enterprises 'enter the pit' of large models, why are they recommended to be produced by large manufacturers?

Enterprises 'enter the pit' of large models, why are they recommended to be produced by large manufacturers?

Sep 19, 2023 pm 04:17 PM
AI large model

Enterprises enter the pit of large models, why are they recommended to be produced by large manufacturers?

Less than a year after the release of GPT-3.5 abroad, and less than half a year after the release of Wenxinyiyan in China, China has fast-forwarded to the "Battle of Hundreds of Models". Entrants include both large manufacturers and a large number of start-up companies. However, the competition for large models is fierce, and if enterprise users do not have a sharp eye, they are likely to fall into the trap, resulting in the project being unfinished.

The large models of some major manufacturers, such as Wenxin Large Models, Tongyi Qianwen, Pangu Large Models, etc., are gradually widening the gap. SuperCLUE's latest evaluation list shows that Wen Xinyiyan has surpassed GPT-3.5turbo, and domestic large models such as GLM-130B are also at the top of the list. Domestic large models have become an important part of the world in terms of quantity, and are rapidly catching up with the most advanced GPT-4 in terms of quality.

Enterprises enter the pit of large models, why are they recommended to be produced by large manufacturers?

Behind the "Battle of 100 Models", the industry is optimistic that large model technological innovation will promote industrial digitization and create trillions of market value. From the current point of view, large models "produced by major manufacturers" represented by Wen Xinyiyan and Tongyi Qianwen occupy a dominant position at the technical level. At the market level, they have also won more project cooperation by building an industrial ecology.

Why are the large models produced by big manufacturers more powerful and more popular with customers than start-up companies? In the author's opinion, there are three main reasons:

First of all, large models must eventually be applied in industry scenarios, and it is not a simple "one-shot deal." As a symbol of greater stability, security, and reliability, major manufacturers generally have sufficient accumulation in AI technology bases. Enterprise customers tend to focus on the application level and do not necessarily have strong accumulation of underlying AI technology. Major manufacturers such as Baidu, Alibaba, and Huawei have built AI bases that have been tested in actual combat. Looking at startups, there are many highlights and breakthroughs in technological innovation. However, from the perspective of full-stack AI technology base accumulation and long-term and stable customer service, The overall capability and battery life are still questionable.

Secondly, large manufacturers have stronger comprehensive strength to invest in iterative development of large model technology. For example, Baidu can mobilize the entire group to develop Wenxin Yiyan. The latest iteration of Wenxin Big Model 3.5 has an inference speed that is 17 times faster than the 3.0 version, and the model effect is improved by more than 50%. On the other hand, the most popular start-up company, light years away, chose to "sell out" to Meituan in less than half a year. Some large models developed based on open source technology lack sufficient underlying technology accumulation and autonomous development capabilities.

According to the latest "AI Large Model Technical Capability Assessment Report, 2023" released by market research company IDC, Baidu Wenxin Large Model 3.5 scored 7 full scores in 12 indicators, including "algorithm model", The two key indicators of "industry coverage" ranked first in the overall score; Alibaba Tongyi Large Model, which ranked second, received full scores in 6 out of 11 indicators and was the only manufacturer to receive full scores in "service capabilities".

Enterprises enter the pit of large models, why are they recommended to be produced by large manufacturers?

Enterprises enter the pit of large models, why are they recommended to be produced by large manufacturers?

Third, the implementation of large models in industry scenarios, subsequent service delivery, operation and maintenance, etc., are "easy for large manufacturers to master" ”, which is a difficult challenge for start-up companies. Large manufacturers can equip vertical industries and key customers with dedicated service teams to relieve customers from worries about embracing new technologies. However, start-up companies often lack experience in serving government and enterprise customers. Large-model product applications developed based on open source technology must do a good job in all aspects. Service of process was grudging.

To sum up, the big model not only looks at various technical parameters, but also looks at the industry’s implementation, the industry’s “know-how” and successful experience. Products produced by major manufacturers currently occupy a dominant position in terms of technology development, industry applications and services. Of course, the "Battle of One Hundred Models" may not be the "War of One Hundred Regiments" more than ten years ago, with only two winners in the end. The large model is still in the early stages of development, and there are more possibilities in the future, including the possibility of surpassing GPT-4 and more European and American competing products.

The above is the detailed content of Enterprises 'enter the pit' of large models, why are they recommended to be produced by large manufacturers?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1664
14
PHP Tutorial
1268
29
C# Tutorial
1242
24
Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Bytedance Cutting launches SVIP super membership: 499 yuan for continuous annual subscription, providing a variety of AI functions Jun 28, 2024 am 03:51 AM

This site reported on June 27 that Jianying is a video editing software developed by FaceMeng Technology, a subsidiary of ByteDance. It relies on the Douyin platform and basically produces short video content for users of the platform. It is compatible with iOS, Android, and Windows. , MacOS and other operating systems. Jianying officially announced the upgrade of its membership system and launched a new SVIP, which includes a variety of AI black technologies, such as intelligent translation, intelligent highlighting, intelligent packaging, digital human synthesis, etc. In terms of price, the monthly fee for clipping SVIP is 79 yuan, the annual fee is 599 yuan (note on this site: equivalent to 49.9 yuan per month), the continuous monthly subscription is 59 yuan per month, and the continuous annual subscription is 499 yuan per year (equivalent to 41.6 yuan per month) . In addition, the cut official also stated that in order to improve the user experience, those who have subscribed to the original VIP

Big model app Tencent Yuanbao is online! Hunyuan is upgraded to create an all-round AI assistant that can be carried anywhere Big model app Tencent Yuanbao is online! Hunyuan is upgraded to create an all-round AI assistant that can be carried anywhere Jun 09, 2024 pm 10:38 PM

On May 30, Tencent announced a comprehensive upgrade of its Hunyuan model. The App "Tencent Yuanbao" based on the Hunyuan model was officially launched and can be downloaded from Apple and Android app stores. Compared with the Hunyuan applet version in the previous testing stage, Tencent Yuanbao provides core capabilities such as AI search, AI summary, and AI writing for work efficiency scenarios; for daily life scenarios, Yuanbao's gameplay is also richer and provides multiple features. AI application, and new gameplay methods such as creating personal agents are added. "Tencent does not strive to be the first to make large models." Liu Yuhong, vice president of Tencent Cloud and head of Tencent Hunyuan large model, said: "In the past year, we continued to promote the capabilities of Tencent Hunyuan large model. In the rich and massive Polish technology in business scenarios while gaining insights into users’ real needs

Context-augmented AI coding assistant using Rag and Sem-Rag Context-augmented AI coding assistant using Rag and Sem-Rag Jun 10, 2024 am 11:08 AM

Improve developer productivity, efficiency, and accuracy by incorporating retrieval-enhanced generation and semantic memory into AI coding assistants. Translated from EnhancingAICodingAssistantswithContextUsingRAGandSEM-RAG, author JanakiramMSV. While basic AI programming assistants are naturally helpful, they often fail to provide the most relevant and correct code suggestions because they rely on a general understanding of the software language and the most common patterns of writing software. The code generated by these coding assistants is suitable for solving the problems they are responsible for solving, but often does not conform to the coding standards, conventions and styles of the individual teams. This often results in suggestions that need to be modified or refined in order for the code to be accepted into the application

Seven Cool GenAI & LLM Technical Interview Questions Seven Cool GenAI & LLM Technical Interview Questions Jun 07, 2024 am 10:06 AM

To learn more about AIGC, please visit: 51CTOAI.x Community https://www.51cto.com/aigc/Translator|Jingyan Reviewer|Chonglou is different from the traditional question bank that can be seen everywhere on the Internet. These questions It requires thinking outside the box. Large Language Models (LLMs) are increasingly important in the fields of data science, generative artificial intelligence (GenAI), and artificial intelligence. These complex algorithms enhance human skills and drive efficiency and innovation in many industries, becoming the key for companies to remain competitive. LLM has a wide range of applications. It can be used in fields such as natural language processing, text generation, speech recognition and recommendation systems. By learning from large amounts of data, LLM is able to generate text

Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Can fine-tuning really allow LLM to learn new things: introducing new knowledge may make the model produce more hallucinations Jun 11, 2024 pm 03:57 PM

Large Language Models (LLMs) are trained on huge text databases, where they acquire large amounts of real-world knowledge. This knowledge is embedded into their parameters and can then be used when needed. The knowledge of these models is "reified" at the end of training. At the end of pre-training, the model actually stops learning. Align or fine-tune the model to learn how to leverage this knowledge and respond more naturally to user questions. But sometimes model knowledge is not enough, and although the model can access external content through RAG, it is considered beneficial to adapt the model to new domains through fine-tuning. This fine-tuning is performed using input from human annotators or other LLM creations, where the model encounters additional real-world knowledge and integrates it

Advanced practice of industrial knowledge graph Advanced practice of industrial knowledge graph Jun 13, 2024 am 11:59 AM

1. Background Introduction First, let’s introduce the development history of Yunwen Technology. Yunwen Technology Company...2023 is the period when large models are prevalent. Many companies believe that the importance of graphs has been greatly reduced after large models, and the preset information systems studied previously are no longer important. However, with the promotion of RAG and the prevalence of data governance, we have found that more efficient data governance and high-quality data are important prerequisites for improving the effectiveness of privatized large models. Therefore, more and more companies are beginning to pay attention to knowledge construction related content. This also promotes the construction and processing of knowledge to a higher level, where there are many techniques and methods that can be explored. It can be seen that the emergence of a new technology does not necessarily defeat all old technologies. It is also possible that the new technology and the old technology will be integrated with each other.

To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework To provide a new scientific and complex question answering benchmark and evaluation system for large models, UNSW, Argonne, University of Chicago and other institutions jointly launched the SciQAG framework Jul 25, 2024 am 06:42 AM

Editor |ScienceAI Question Answering (QA) data set plays a vital role in promoting natural language processing (NLP) research. High-quality QA data sets can not only be used to fine-tune models, but also effectively evaluate the capabilities of large language models (LLM), especially the ability to understand and reason about scientific knowledge. Although there are currently many scientific QA data sets covering medicine, chemistry, biology and other fields, these data sets still have some shortcomings. First, the data form is relatively simple, most of which are multiple-choice questions. They are easy to evaluate, but limit the model's answer selection range and cannot fully test the model's ability to answer scientific questions. In contrast, open-ended Q&A

Xiaomi Byte joins forces! A large model of Xiao Ai's access to Doubao: already installed on mobile phones and SU7 Xiaomi Byte joins forces! A large model of Xiao Ai's access to Doubao: already installed on mobile phones and SU7 Jun 13, 2024 pm 05:11 PM

According to news on June 13, according to Byte's "Volcano Engine" public account, Xiaomi's artificial intelligence assistant "Xiao Ai" has reached a cooperation with Volcano Engine. The two parties will achieve a more intelligent AI interactive experience based on the beanbao large model. It is reported that the large-scale beanbao model created by ByteDance can efficiently process up to 120 billion text tokens and generate 30 million pieces of content every day. Xiaomi used the beanbao large model to improve the learning and reasoning capabilities of its own model and create a new "Xiao Ai Classmate", which not only more accurately grasps user needs, but also provides faster response speed and more comprehensive content services. For example, when a user asks about a complex scientific concept, &ldq

See all articles