by Andy Patrizio

Nvidia shows off at Supercomputing 20

News Analysis

Nov 18, 20204 mins

Nvidia is under the covers for a slew of the world's fastest supercomputers; Intel and AMD talk future products at supercomputing conference.

Credit: MaxiPhoto / Getty Images

Nearly 70% of the 500 fastest supercomputers in the world, as announced at the Supercomputing 20 conference this week, are powered by Nvidia, including eight of the top 10.

Among them was one named Selene that Nvidia built itself and that debuted at Number 5 on the semi-annual TOP500 list of the fastest machines. With top-end systems requiring 10,000 or more CPUs and GPUs, they are enormously expensive, so government or research institutions own the majority of them.

That makes Selene all the more rare. It was built by and is based at Nvidia’s Santa Clara, California, headquarters. (It’s widely believed there are many supercomputers in private industry that are not reported for competitive reasons.)

Nvidia’s Big Showing

Also significant is that another Nvidia supercomputer, the DGX SuperPOD, took the top spot on the GREEN500 list, which measures the energy efficiency of the TOP500 systems. Four of the top five systems had Nvidia’s A100 Ampere GPU. Fujitsu’s Fugaku prototype, with just Arm processors and no DRAM, fell from first place to sixth.

This is big because GPUs have never been known for energy efficiency but now Nvidia has a new story to tell: performance and energy efficiency in one product.

Nvidia also introduced its Mellanox NDR 400Gbps InfiniBand family of interconnect products, which will be available in Q2 of 2021. The new lineup includes adapters, data processing units (DPUs), what Nvidia calls smart NICs, switches and cables.

This is not just a doubling the bandwidth per port. Mellanox is tripling the number of ports in a single device, which in theory will allow one switch platform to connect the entire data center. Mellanox said adopters of NDR 400 Gbps InfiniBand can see a network cost savings of 1.4x and power savings of up to 1.6x for datacenters.

AMD Claws Back

Good news and bad news for AMD. Its share of the top supercomputers that use its CPUs nearly doubled from 11 on the June TOP500 list to 21 on the current list. The growth came from new systems with second-generation EPYC processors, which come with an insane 64-cores.

On the down side, it can’t get any traction against Nvidia on the GPU side. Just one of the top 500 used AMD Radeon GPUs. Even Intel’s Xeon Phi, which is discontinued, had a better showing with three systems on the list.

But AMD is not giving up. On Monday it revealed its new Instinct MI100 server GPU, calling it the “world’s fastest HPC accelerator for scientific research,” with more than 10TFLOP for double-precision floating-point performance. AMD says it improves half-precision floating-point performance for AI training workloads by nearly seven times over the company’s previous generation of accelerators.

MI100 comes with a technology called Matrix Core, a part of AMD’s new CDNA architecture that is designed for HPC and machine learning workloads. Future iterations of the architecture will be used for its next-generation Instinct GPUs.

Intel’s Latest Try at GPUs

Intel is hoping the third time will be the charm for GPUs. It hired Raja Koudri, the designer of AMD’s Radeon GPU, to be its chief architect this time around so it certainly has no excuse for technical failure.

Its new GPU is called the Xe, proving once again Intel has the worst product branding department in the Silicon Valley. The biggest news regarding Xe was introduction of oneAPI Gold, the first productized version of Intel’s programming platform for the Xe GPU line.

OneAPI Gold plays into Intel’s XPU strategy of heterogeneous processing. Servers are much more than x86 chips. They have GPUs, FPGAs, AI accelerators, and network processors, and Intel has products in every category. OneAPI Gold can rule them all, allowing developers to write one set of highly optimized code and have it run optimally on any processor.

Intel is promoting oneAPI as an open standard but it’s made for Intel’s architecture. So I won’t hold my breath for AMD or Nvidia to adopt it any time soon. But for anyone all-in with Intel, it could do what CUDA did for Nvidia.

Xe processors are still in the works, with the high-end version, codenamed Ponte Vecchio, due next year. OneAPI Gold is said to ship next month.

Data Center

by Andy Patrizio

Andy Patrizio is a freelance journalist based in southern California who has covered the computer industry for 20 years and has built every x86 PC he’s ever owned, laptops not included.

Andy writes the Data Center Explorer blog for Network World. His work has appeared in a variety of publications, including Tom's Guide, Wired, Dr. Dobbs Journal, Tech Target, Business Insider, and Data Center Knowledge. Earlier in his career, he held editorial positions at IT publications like InternetNews, PC Week and InformationWeek.

Andy holds a BA in Journalism from the University of Rhode Island.