Home Technology peripherals AI Amazon strives to defend its cloud status, upgrades its self-developed AI chips, releases chat robot Q, and is the first to use Nvidia's new generation super chip

Amazon strives to defend its cloud status, upgrades its self-developed AI chips, releases chat robot Q, and is the first to use Nvidia's new generation super chip

Nov 29, 2023 am 08:37 AM
Amazon ai chip cloud status

Amazon is making every effort to defend its leadership in cloud computing. On the one hand, they upgraded their own cloud chips and launched Amazon's version of GPT, an artificial intelligence chatbot; on the other hand, they also deepened their cooperation with NVIDIA, launched new services based on NVIDIA chips, and jointly developed them with NVIDIA supercomputer

Dave Brown, vice president of AWS, said that by focusing the design of self-developed chips on actual workloads that are important to customers, AWS can provide them with the most advanced cloud infrastructure. The Graviton 4 launched this time is the fourth generation chip product within five years. As people’s interest in generative AI rises, the second generation AI chip Trainium 2 will help customers train themselves faster at lower cost and higher energy efficiency. machine learning model.

Graviton4 computing performance is improved by up to 30% compared to the previous generation

On Tuesday, November 28th, Eastern Time, Amazon’s cloud computing business AWS announced the launch of a new generation of AWS self-developed chips. Among them, the computing performance of the general-purpose chip Graviton4 is up to 30% higher than the previous generation Graviton3, with a 50% increase in cores and a 75% increase in memory bandwidth, thus providing the highest cost performance and energy utilization on the Amazon cloud server hosting service Amazon Elastic Compute Cloud (EC2) Effect.

Graviton4 improves security with full encryption of all high-speed physical hardware interfaces. AWS said Graviton4 will be available on memory-optimized Amazon EC2 R8g instances to help customers improve the execution of high-performance database, in-memory cache, and big data analytics workloads. R8g instances offer larger instance sizes with up to three times more vCPUs and three times more memory than previous generation R7g instances

In the next few months, it is planned to launch computers equipped with Graitons4. AWS said that in the five years since the launch of the Garviton project, more than 2 million Garviton processors have been produced, and the first 100 users of AWS EC2 have chosen to use Graviton

Amazon strives to defend its cloud status, upgrades its self-developed AI chips, releases chat robot Q, and is the first to use Nvidias new generation super chip

Trainium2 is four times faster and can train models with trillions of parameters

AWS has launched a new generation of AI chips called Trainium2, which is four times faster than the previous generation Trainium1. Trainium2 can deploy up to 100,000 chips in EC2 UltraCluster, enabling users to train base models (PM) and large language models (LLM) with trillions of parameters in a short time. Compared with the previous generation, Trainium2’s energy utilization has increased by two times

Trainium2 will be used on Amazon EC2 Trn2 instances, each containing 16 Trainium chips. Trn2 instances are designed to help customers scale the number of chip applications in next-generation EC2 UltraCluster, up to 100,000 Trainium2 chips, and provide up to 65 Execute computing power through petabyte-scale network connections through AWS Elastic Fabrication Adapters (EFA)

According to AWS, Trainium2 will be used to support new services starting next year

Amazon strives to defend its cloud status, upgrades its self-developed AI chips, releases chat robot Q, and is the first to use Nvidias new generation super chip

The first major customer, DGX Cloud, uses the upgraded version of Grace Hopper GH200 NVL32, which is the fastest GPU-driven AI supercomputer

During the annual conference re:Invent, AWS and NVIDIA announced on Tuesday an expanded strategic cooperation to provide state-of-the-art infrastructure, software and services to promote customers' generative AI innovation. This cooperation not only involves self-developed chips, but also includes cooperation in other fields

AWS will become the first cloud service provider to use the new multi-node NVLink technology NVIDIA H200 Grace Hopper super chip in the cloud. In other words, AWS will become the first important customer of the upgraded version of Grace Hopper

NVIDIA’s H200 NVL32 multi-node platform uses 32 Grace Hopper chips with NVLink and NVSwitch technology in a single instance. The platform will be used on Amazon EC2 instances connected to Amazon Network EFA and is powered by advanced virtualization (AWS Nitro System) and ultra-scale clusters (Amazon EC2 UltraClusters), allowing joint Amazon and Nvidia customers to scale deployments into the thousands. Designed H200 chip

NVIDIA and AWS will collaborate to host NVIDIA’s AI training-as-a-service DGX Cloud on AWS. This will be the first DGX cloud to feature the GH200 NVL32, providing developers with a single instance with maximum shared memory. AWS’s DGX Cloud will advance cutting-edge generative AI and training of large language models with over 1 trillion parameters

Nvidia and AWS are collaborating on a project called Ceiba to design the world’s fastest GPU-powered AI supercomputer. Powered by GH200 NVL32 and Amazon EFA's interconnect technology, this computer is a massive system. It is equipped with 16,384 GH200 super chips and has 65 exaflops of AI processing power. NVIDIA plans to use it to drive the next wave of generative AI innovation

Amazon strives to defend its cloud status, upgrades its self-developed AI chips, releases chat robot Q, and is the first to use Nvidias new generation super chip

The preview version of Amazon Q, the enterprise customer robot, is now online and can help developers develop applications on AWS

In addition to providing chips and cloud services, AWS also released a preview version of an AI chatbot called Amazon Q. Amazon Q is a new type of digital assistant that uses generative AI technology to work based on the business needs of enterprise customers. It helps enterprise customers search for information, write code and review business metrics

Q has received some training on code and documentation within AWS, which can be used by developers in the AWS cloud.

Developers can use Q to create applications on AWS, research best practices, correct errors, and get help writing new features for applications. Users can interact with Q through conversational Q&A to learn new knowledge, research best practices, and understand how to build applications on AWS without leaving the AWS console

Amazon will add Q to programs for enterprise intelligence software, call center workers and logistics management. AWS says customers can customize Q based on company data or personal profiles

Conversational Q&A is currently available in preview in all enterprise regions provided by AWS

The above is the detailed content of Amazon strives to defend its cloud status, upgrades its self-developed AI chips, releases chat robot Q, and is the first to use Nvidia's new generation super chip. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Java Tutorial
1654
14
PHP Tutorial
1252
29
C# Tutorial
1225
24
Amazon Kindle Oasis is discontinued in the United States, marking the end of the era of high-end e-readers Amazon Kindle Oasis is discontinued in the United States, marking the end of the era of high-end e-readers Feb 25, 2024 pm 12:10 PM

According to the latest news, Amazon has announced the discontinuation of its high-end e-reader Kindle Oasis and has removed it from the US market. This move indicates that the once highly anticipated Kindle Oasis has officially withdrawn from the market. Although there is still a small amount of stock in some overseas markets such as Canada and the United Kingdom, once sold out, it will no longer be available. This marks the beginning of this acclaimed high-end reader becoming a thing of the past. Kindle Oasis is loved by users for its excellent performance and design. However, as market demand changes and new products are launched, Amazon may have decided to discontinue this product. Although Kindle Oasis has left a certain mark on the market, Amazon may have shifted its focus to other product lines

New title: NVIDIA H200 released: HBM capacity increased by 76%, the most powerful AI chip that significantly improves large model performance by 90% New title: NVIDIA H200 released: HBM capacity increased by 76%, the most powerful AI chip that significantly improves large model performance by 90% Nov 14, 2023 pm 03:21 PM

According to news on November 14, Nvidia officially released the new H200 GPU at the "Supercomputing23" conference on the morning of the 13th local time, and updated the GH200 product line. Among them, the H200 is still built on the existing Hopper H100 architecture. However, more high-bandwidth memory (HBM3e) has been added to better handle the large data sets required to develop and implement artificial intelligence, making the overall performance of running large models improved by 60% to 90% compared to the previous generation H100. The updated GH200 will also power the next generation of AI supercomputers. In 2024, more than 200 exaflops of AI computing power will be online. H200

Amazon Prime Video will start rolling out ads on January 29th, and you'll have to pay extra to watch ad-free Amazon Prime Video will start rolling out ads on January 29th, and you'll have to pay extra to watch ad-free Jan 13, 2024 am 08:27 AM

Amazon has finally announced the implementation date for its much-anticipated Prime Video advertising program. Starting from January 29, 2024, PrimeVideo will begin to embed advertisements in some movies and TV series. Amazon explained in an email sent to users that this is done to continue investing in attractive content and to increase related investments in the long term. Amazon promises that the number of ads on Prime Video will be much lower than traditional TV and other streaming platforms. Members don’t need to take any additional action, and Prime membership prices remain the same. However, users who want to enjoy an ad-free experience can pay an additional $2.99 ​​per month (note from this site: currently about 21 yuan). The email also mentioned that Prime members can enjoy

Cloud computing giant launches legal battle: Amazon sues Nokia for patent infringement Cloud computing giant launches legal battle: Amazon sues Nokia for patent infringement Jul 31, 2024 pm 12:47 PM

According to news from this site on July 31, technology giant Amazon sued Finnish telecommunications company Nokia in the federal court of Delaware on Tuesday, accusing it of infringing on more than a dozen Amazon patents related to cloud computing technology. 1. Amazon stated in the lawsuit that Nokia abused Amazon Cloud Computing Service (AWS) related technologies, including cloud computing infrastructure, security and performance technologies, to enhance its own cloud service products. Amazon launched AWS in 2006 and its groundbreaking cloud computing technology had been developed since the early 2000s, the complaint said. "Amazon is a pioneer in cloud computing, and now Nokia is using Amazon's patented cloud computing innovations without permission," the complaint reads. Amazon asks court for injunction to block

MediaTek is rumored to have won a large order from Google for server AI chips and will supply high-speed Serdes chips MediaTek is rumored to have won a large order from Google for server AI chips and will supply high-speed Serdes chips Jun 19, 2023 pm 08:23 PM

On June 19, according to media reports in Taiwan, China, Google (Google) has approached MediaTek to cooperate in order to develop the latest server-oriented AI chip, and plans to hand it over to TSMC's 5nm process for foundry, with plans for mass production early next year. According to the report, sources revealed that this cooperation between Google and MediaTek will provide MediaTek with serializer and deserializer (SerDes) solutions and help integrate Google’s self-developed tensor processor (TPU) to help Google create the latest Server AI chips will be more powerful than CPU or GPU architectures. The industry points out that many of Google's current services are related to AI. It has invested in deep learning technology many years ago and found that using GPUs to perform AI calculations is very expensive. Therefore, Google decided to

Chinese e-book manufacturers are filling the void after Amazon Kindle exits the market, with sales increasing by 12.2% to 762,000 units in 2023 Chinese e-book manufacturers are filling the void after Amazon Kindle exits the market, with sales increasing by 12.2% to 762,000 units in 2023 Jan 26, 2024 pm 05:24 PM

According to news from this website on January 26, Luotu Technology today released a new "Global E-Paper Tablet Market Analysis Quarterly Report", which mentioned that global e-paper tablet shipments in 2023 will be 12.54 million units, a year-on-year increase of 17.2%. Among them, the sales volume of global e-book brands in the Chinese market reached 1.23 million units, a year-on-year increase of 20.6%, accounting for 9.8% of the global total, an increase of 0.5 percentage points from 2022. A total of 40 new products were released in the Chinese market throughout the year, continuing the popularity of 2022. In terms of brand performance, iFlytek, PalmReader, Aragonite, and Xiaoyuan lead the sales. This site learned from a report published by Luotu Technology that due to the withdrawal of Kindle e-books from the Chinese market on June 30, 2023, there will be a gap in the industry, resulting in domestic electronic

From Google to Amazon, tech giants' AI obsession From Google to Amazon, tech giants' AI obsession Jun 12, 2023 pm 05:51 PM

Recently, several prestigious foreign technology giants have demonstrated their AI ambitions. For example, Apple held WWDC23, Microsoft held Build23, and even Google held a search business conference in February. The actions of these giants undoubtedly highlight the rise of generative artificial intelligence (AIGC), and also bring a group of teams and institutions that were previously not interested in artificial intelligence. Now these big technology companies are betting heavily on artificial intelligence. A few notable signs are: Google AI, Microsoft Copilot, Apple Machine Learning, and OpenAI are pursuing general artificial intelligence. Apple's machine learning Apple seems to be "not interested" in the term artificial intelligence. At this year's WWDC, not a word was mentioned about "

Former Baidu Vice President Chu Ruisong takes over as head of Amazon Cloud Technology Greater China Former Baidu Vice President Chu Ruisong takes over as head of Amazon Cloud Technology Greater China Oct 09, 2023 pm 04:57 PM

According to news from this website on October 9, Matt Garman, senior vice president of global sales, marketing and services of Amazon Cloud Technology, announced internally a change in the leadership of Greater China. Chu Ruisong will succeed Zhang Wenyi as Amazon’s global vice president and executive of Amazon Cloud Technology Greater China. Director, Zhang Wenyi will have a new appointment. Chu Ruisong’s public information shows that before joining Amazon Cloud Technology, Chu Ruisong served as the vice president of Baidu Group for nearly four years. He was one of Baidu’s senior management teams and was responsible for leading Baidu’s Apollo smart car business. He resigned in July this year. Prior to that, he spent most of his career at SAP, holding various leadership positions in engineering, strategy, business development, etc., and was ultimately responsible for global R&D of S/4HANA cloud products, with leadership all over Germany.

See all articles