Nearly 70% of the 500 fastest supercomputers in the world, as announced at the Supercomputing 20 conference this week, are powered by Nvidia, including eight of the top 10.\nAmong them was one named Selene that Nvidia built itself and that debuted at Number 5 on the semi-annual TOP500 list of the fastest machines. With top-end systems requiring 10,000 or more CPUs and GPUs, they are enormously expensive, so government or research institutions own the majority of them.\nThat makes Selene all the more rare. It was built by and is based at Nvidia's Santa Clara, California, headquarters. (It\u2019s widely believed there are many supercomputers in private industry that are not reported for competitive reasons.)\n\nNvidia\u2019s Big Showing\nAlso significant is that another Nvidia supercomputer, the DGX SuperPOD, took the top spot on the GREEN500 list, which measures the energy efficiency of the TOP500 systems. Four of the top five systems had Nvidia\u2019s A100 Ampere GPU. Fujitsu\u2019s Fugaku prototype, with just Arm processors and no DRAM, fell from first place to sixth.\nThis is big because GPUs have never been known for energy efficiency but now Nvidia has a new story to tell: performance and energy efficiency in one product.\nNvidia also introduced its Mellanox NDR 400Gbps InfiniBand family of interconnect products, which will be available in Q2 of 2021. The new lineup includes adapters, data processing units (DPUs), what Nvidia calls smart NICs, switches and cables.\nThis is not just a doubling the bandwidth per port. Mellanox is tripling the number of ports in a single device, which in theory will allow one switch platform to connect the entire data center. Mellanox said adopters of NDR 400 Gbps InfiniBand can see a network cost savings of 1.4x and power savings of up to 1.6x for datacenters.\nAMD Claws Back\nGood news and bad news for AMD. Its share of the top supercomputers that use its CPUs nearly doubled from 11 on the June TOP500 list to 21 on the current list. The growth came from new systems with second-generation EPYC processors, which come with an insane 64-cores.\nOn the down side, it can\u2019t get any traction against Nvidia on the GPU side. Just one of the top 500 used AMD Radeon GPUs. Even Intel\u2019s Xeon Phi, which is discontinued, had a better showing with three systems on the list.\nBut AMD is not giving up. On Monday it revealed its new Instinct MI100 server GPU, calling it the \u201cworld\u2019s fastest HPC accelerator for scientific research,\u201d with more than 10TFLOP for double-precision floating-point performance. AMD says it improves half-precision floating-point performance for AI training workloads by nearly seven times over the company\u2019s previous generation of accelerators.\nMI100 comes with a technology called Matrix Core, a part of AMD\u2019s new CDNA architecture that is designed for HPC and machine learning workloads. Future iterations of the architecture will be used for its next-generation Instinct GPUs.\nIntel\u2019s Latest Try at GPUs\nIntel is hoping the third time will be the charm for GPUs. It hired Raja Koudri, the designer of AMD\u2019s Radeon GPU, to be its chief architect this time around so it certainly has no excuse for technical failure.\nIts new GPU is called the Xe, proving once again Intel has the worst product branding department in the Silicon Valley. The biggest news regarding Xe was introduction of oneAPI Gold, the first productized version of Intel\u2019s programming platform for the Xe GPU line.\nOneAPI Gold plays into Intel\u2019s XPU strategy of heterogeneous processing. Servers are much more than x86 chips. They have GPUs, FPGAs, AI accelerators, and network processors, and Intel has products in every category. OneAPI Gold can rule them all, allowing developers to write one set of highly optimized code and have it run optimally on any processor.\nIntel is promoting oneAPI as an open standard but it\u2019s made for Intel\u2019s architecture. So I won\u2019t hold my breath for AMD or Nvidia to adopt it any time soon. But for anyone all-in with Intel, it could do what CUDA did for Nvidia.\nXe processors are still in the works, with the high-end version, codenamed Ponte Vecchio, due next year. OneAPI Gold is said to ship next month.