


NVIDIA launches new AI chip H200, performance improved by 90%! China's computing power achieves independent breakthrough!
While the world is still obsessed with the NVIDIA H100 chip and is buying it like crazy to meet the growing demand for AI computing power, on Monday local time, NVIDIA quietly launched the latest AI chip H200 for the training of large AI models,Compared with its previous generation product H100, the performance of H200 has been improved by about 60% to 90%.
H200 is an upgraded version of NVIDIA H100. It is also based on the Hopper architecture like H100. The main upgrade includes 141GB of HBM3e video memory, and the video memory bandwidth has increased from 3.35TB/s of H100 to 4.8TB/s. According to Nvidia’s official website, H200 is also the company’s first chip to use HBM3e memory. This memory is faster and has larger capacity, so it is more suitable for large language models. NVIDIA said: "With HBM3e, NVIDIA H200 delivers 141GB of memory at 4.8TB per second, nearly twice the capacity and 2.4 times the bandwidth compared to the A100." According to the official pictures,
The output speed of H200 on large models Llama 2 and GPT-3.5 is 1.9 times and 1.6 times that of H100 respectively, and its speed in high-performance computing HPC is even higher. 110 times faster than a dual-core x86 CPU. H200 is expected to be delivered starting in the second quarter of next year, and Nvidia has not yet announced its price.
However, like H100 and other AI chips, NVIDIA will not supply H200 chips to Chinese manufacturers.
On October 17 this year, the U.S. Department of Commerce issued new export control regulations for chips, especially stricter controls on high-computing power AI chips. One of the very important new regulations is to adjust the restricted standards for advanced computing chips and set a new "performance density threshold" as a parameter. According to new regulations, NVIDIA China's "special edition" H800 and A800 have been restricted from exporting to China.
The lack of the most advanced AI chips will bring some challenges to the development of the domestic AI industry in the short term, but it also contains opportunities:First of all, there are
opportunities for replacing domestic computing power chips. For example, Baidu recently ordered 1,600 pieces of Huawei Ascend 910B chips; secondly, due to the mismatch between supply and demand, the price of computing power rental has increased, which will Good for computing power leasing companies. On the 14th, Huina Technology announced that the computing power service charges for high-performance computing servers embedded with NVIDIA A100 chips will be increased by 100%. Finally, advanced packaging technology represented by Chiplet can solve the problem to a certain extent. The problem of insufficient production capacity of advanced processes is expected to usher in accelerated development. So besides Huawei Shengteng, who else with domestic computing power can carry the banner? What are the companies related to the industry chain? What other directions are expected to benefit indirectly?
Our four major benefit directions and representative companies from the independent control of domestic AI computing powerhave produced a special topic "It is imperative to independently control AI computing power" for friends in need , welcome to scan the QR code below or add the assistant WeChat: hzkyxjyy, get it for free or read it in your circle of friends. After successful addition, all past reports and each future special report can be read for free.
The above is the detailed content of NVIDIA launches new AI chip H200, performance improved by 90%! China's computing power achieves independent breakthrough!. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

According to news on November 14, Nvidia officially released the new H200 GPU at the "Supercomputing23" conference on the morning of the 13th local time, and updated the GH200 product line. Among them, the H200 is still built on the existing Hopper H100 architecture. However, more high-bandwidth memory (HBM3e) has been added to better handle the large data sets required to develop and implement artificial intelligence, making the overall performance of running large models improved by 60% to 90% compared to the previous generation H100. The updated GH200 will also power the next generation of AI supercomputers. In 2024, more than 200 exaflops of AI computing power will be online. H200

On June 19, according to media reports in Taiwan, China, Google (Google) has approached MediaTek to cooperate in order to develop the latest server-oriented AI chip, and plans to hand it over to TSMC's 5nm process for foundry, with plans for mass production early next year. According to the report, sources revealed that this cooperation between Google and MediaTek will provide MediaTek with serializer and deserializer (SerDes) solutions and help integrate Google’s self-developed tensor processor (TPU) to help Google create the latest Server AI chips will be more powerful than CPU or GPU architectures. The industry points out that many of Google's current services are related to AI. It has invested in deep learning technology many years ago and found that using GPUs to perform AI calculations is very expensive. Therefore, Google decided to

After the debut of the NVIDIA H200, known as the world's most powerful AI chip, the industry began to look forward to NVIDIA's more powerful B100 chip. At the same time, OpenAI, the most popular AI start-up company this year, has begun to develop a more powerful and complex GPT-5 model. Guotai Junan pointed out in the latest research report that the B100 and GPT5 with boundless performance are expected to be released in 2024, and the major upgrades may release unprecedented productivity. The agency stated that it is optimistic that AI will enter a period of rapid development and its visibility will continue until 2024. Compared with previous generations of products, how powerful are B100 and GPT-5? Nvidia and OpenAI have already given a preview: B100 may be more than 4 times faster than H100, and GPT-5 may achieve super

KL730's progress in energy efficiency has solved the biggest bottleneck in the implementation of artificial intelligence models - energy costs. Compared with the industry and previous Nerner chips, the KL730 chip has increased by 3 to 4 times. The KL730 chip supports the most advanced lightweight GPT large Language models, such as nanoGPT, and provide effective computing power of 0.35-4 tera per second. AI company Kneron today announced the release of the KL730 chip, which integrates automotive-grade NPU and image signal processing (ISP) to bring safe and low-energy AI The capabilities are empowered in various application scenarios such as edge servers, smart homes, and automotive assisted driving systems. San Diego-based Kneron is known for its groundbreaking neural processing units (NPUs), and its latest chip, the KL730, aims to achieve

While the world is still obsessed with NVIDIA H100 chips and buying them crazily to meet the growing demand for AI computing power, on Monday local time, NVIDIA quietly launched its latest AI chip H200, which is used for training large AI models. Compared with other The performance of the previous generation products H100 and H200 has been improved by about 60% to 90%. The H200 is an upgraded version of the Nvidia H100. It is also based on the Hopper architecture like the H100. The main upgrade includes 141GB of HBM3e video memory, and the video memory bandwidth has increased from the H100's 3.35TB/s to 4.8TB/s. According to Nvidia’s official website, H200 is also the company’s first chip to use HBM3e memory. This memory is faster and has larger capacity, so it is more suitable for large languages.

Google’s CEO likened the AI revolution to humanity’s use of fire, but now the digital fire that fuels the industry—AI chips—is hard to come by. The new generation of advanced chips that drive AI operations are almost all manufactured by NVIDIA. As ChatGPT explodes out of the circle, the market demand for NVIDIA graphics processing chips (GPUs) far exceeds the supply. "Because there is a shortage, the key is your circle of friends," said Sharon Zhou, co-founder and CEO of Lamini, a startup that helps companies build AI models such as chatbots. "It's like toilet paper during the epidemic." This kind of thing. The situation has limited the computing power that cloud service providers such as Amazon and Microsoft can provide to customers such as OpenAI, the creator of ChatGPT.

Microsoft is developing AI-optimized chips to reduce the cost of training generative AI models, such as the ones that power the OpenAIChatGPT chatbot. The Information recently quoted two people familiar with the matter as saying that Microsoft has been developing a new chipset code-named "Athena" since at least 2019. Employees at Microsoft and OpenAI already have access to the new chips and are using them to test their performance on large language models such as GPT-4. Training large language models requires ingesting and analyzing large amounts of data in order to create new output content for the AI to imitate human conversation. This is a hallmark of generative AI models. This process requires a large number (on the order of tens of thousands) of A

According to the original words, it can be rewritten as: (Global TMT August 16, 2023) AI company Kneron, headquartered in San Diego and known for its groundbreaking neural processing units (NPU), announced the release of the KL730 chip. The chip integrates automotive-grade NPU and image signal processing (ISP), and provides safe and low-energy AI capabilities to various application scenarios such as edge servers, smart homes, and automotive assisted driving systems. The KL730 chip has achieved great results in terms of energy efficiency. A breakthrough, compared with previous Nerner chips, its energy efficiency has increased by 3 to 4 times, and is 150% to 200% higher than similar products in major industries. The chip has an effective computing power of 0.35-4 tera per second and can support the most advanced lightweight GPT large
