by Andy Patrizio

Nvidia announces a 2023 launch for an HPC CPU named Grace

News Analysis

Apr 12, 20213 mins

At its GPU Technology Conference, Nvidia talks about its first data-center CPU that is meant to fill a hole in its server processor line, but details are scant.

Credit: dny59 / kentoh / Getty Images

Nvidia kicked off its GPU Technology Conference (GTC) 2021 with a bang: A new CPU for high performance computing (HPC) clients–its first-ever data-center CPU–called Grace.

Based on the Arm Neoverse architecture, NVIDIA claims Grace will serve up to 10-times better performance than the fastest servers currently on the market for complex artificial intelligence and HPC workloads.

But that’s comparing then and now. Grace won’t ship until 2023, and in those two years competitors will undoubtedly up their game, too. But no one has ever accused CEO Jen-Hsun Huang of being subdued.

Nvidia made a point that Grace is not intended to compete head-to-head against Intel’s Xeon and AMD’s EPYC processors. Instead, Grace is more of a niche product, in that it is designed specifically to be tightly coupled with NVIDIA’s GPUs to remove bottlenecks for complex AI and HPC applications.

Nvidia is in the process of acquiring Arm Holdings, a deal that should close later this year if all objections are overcome.

“Leading-edge AI and data science are pushing today’s computer architecture beyond its limits—processing unthinkable amounts of data,” said Huang. “Using licensed Arm IP, Nvidia has designed Grace as a CPU specifically for giant-scale AI and HPC. Coupled with the GPU and DPU, Grace gives us the third foundational technology for computing, and the ability to re-architect the data center to advance AI. Nvidia is now a three-chip company.”

Nvidia does have server offerings, the DGX series, which use AMD Epyc CPUs (you didn’t think they were going to use Intel, did you?) to boot and coordinate everything and coordinate the Ampere GPUs. Epyc is great for running databases, but it’s a general compute processor, lacking the kind of high-speed I/O and deep learning optimizations that Nvidia needs.

Nvidia didn’t give a lot of detail, except to say it would be built on a future version of the Arm Neoverse core using a 5-nanometer manufacturing process, which means it will be built by TSMC. Grace will also use Nvidia’s homegrown NVLink high-speed interconnect between the CPU and GPU. A new version planed for 2023 will offer over 900GBps of bandwidth between the CPU and GPU. That’s much faster than the PCI Express used by AMD for CPU-GPU communications.

Two supercomputing customers

Even though Grace isn’t shipping until 2023, Nvidia already has two supercomputer customers for the processor. The Swiss National Supercomputing Centre (CSCS) and Los Alamos National Laboratory announced today that they’ll be ordering supercomputers based on Grace. Both systems will be built by HPE’s Cray subsidiary (who else?) and are set to come online in 2023.

CSCS’s system, called Alps, will be replacing their current Piz Daint system, a Xeon and NVIDIA P100 cluster. CSCS claims Alps will offer 20 ExaFLOPS of AI performance, which would be incredible if they deliver, because right now the best we have is Japan’s Fugaku at just one exaflop.

Arm’s stumbles in the data center

Overall, this is a smart move on Nvidia’s part because general purpose Arm server processors have not done well. Nvidia has its own failure data center CPU market. A decade ago it launched Project Denver, but it never got out of the labs. Denver was a general purpose CPU, whereas Grace is highly vertical and specialized.

Servers

by Andy Patrizio

Andy Patrizio is a freelance journalist based in southern California who has covered the computer industry for 20 years and has built every x86 PC he’s ever owned, laptops not included.

Andy writes the Data Center Explorer blog for Network World. His work has appeared in a variety of publications, including Tom's Guide, Wired, Dr. Dobbs Journal, Tech Target, Business Insider, and Data Center Knowledge. Earlier in his career, he held editorial positions at IT publications like InternetNews, PC Week and InformationWeek.

Andy holds a BA in Journalism from the University of Rhode Island.

Show me more

Nvidia announces a 2023 launch for an HPC CPU named Grace

At its GPU Technology Conference, Nvidia talks about its first data-center CPU that is meant to fill a hole in its server processor line, but details are scant.

Two supercomputing customers

Arm’s stumbles in the data center

More from this author

Severe weather an increasing risk for data center construction

Gartner: Data center electricity consumption to grow 26% in 2026

Nvidia unveils Vera Rubin platform targeting AI, HPC infrastructure

Edge networks a particular challenge for summer power, IT staffing needs

Marvell announces 102.4 Tbps switch silicon built for AI

A quick look at Cisco’s strategy to become a software monster

Lotus Microsystems targets AI power efficiency with vStrata platform

AMD ships second-gen Versal Prime accelerators

Show me more

AI workloads shake up observability market

New York State just hit pause on the AI data center boom

Network hiring, skills and certification trends

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

Master Linux Math with the bc Command | Easy CLI Calculations Explained!

Master Linux Math in Seconds: How to Use the expr Command Like a Pro

How to Do Math in the Command Line Using Double Parentheses

Nvidia announces a 2023 launch for an HPC CPU named Grace

Two supercomputing customers

Arm’s stumbles in the data center

From our editors straight to your inbox

More from this author

Severe weather an increasing risk for data center construction

Gartner: Data center electricity consumption to grow 26% in 2026

Nvidia unveils Vera Rubin platform targeting AI, HPC infrastructure

Edge networks a particular challenge for summer power, IT staffing needs

Marvell announces 102.4 Tbps switch silicon built for AI

A quick look at Cisco’s strategy to become a software monster

Lotus Microsystems targets AI power efficiency with vStrata platform

AMD ships second-gen Versal Prime accelerators

Show me more

AI workloads shake up observability market

New York State just hit pause on the AI data center boom

Network hiring, skills and certification trends

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

Master Linux Math with the bc Command | Easy CLI Calculations Explained!

Master Linux Math in Seconds: How to Use the expr Command Like a Pro

How to Do Math in the Command Line Using Double Parentheses