For the last two years, in January, I have published my predictions about the new AI silicon I expect in the coming year. Like the weatherman, I’ve gotten some details wrong but I’ve been reasonably accurate, at least directionally. So, here we go again with a look at likely 2021 advancements and possible failures. I base none of these predictions on internal or NDA information; these are just my opinions.
Predictions for 2021 AI hardware
- NVIDIA will retain its dominant position in data center AI. In addition to commanding nearly 100% of the training market, NVIDIA will gain more traction in cloud inference processing due to its new Multi-Instance GPU. The MIG capability on the NVIDIA A100 provides CSPs with more flexibility and can reduce hardware provisioning costs. NVIDIA’s Automotive business will begin to see growth late in the year.
- NVIDIA will successfully acquire Arm, leveraging the licensing business model to monetize technologies Jensen Huang does not care to productize.
- Even if I am wrong concerning the Arm acquisition, NVIDIA will announce an Arm-based server to enable tight coupling of CPUs and GPUs. (The server may not ship until 2022.)
- The lure of an in-house chip design will compel most “Super Seven” hyperscale data centers to launch proprietary AI inference processors. These chips will be tailored for specific use cases and business needs, primarily impacting Intel Xeon, and will create tremendous headwinds for startups.
- The Qualcomm AI100 platform’s performance and power efficiency will secure at least one hyperscale data center win, despite the trend noted above. Should this fail to transpire, Qualcomm could shut down its data center efforts.
- Google will launch the TPU4, which was teased in July when the company published mlPerf benchmarks that more than doubled the previous design’s results. Google will also announce a second generation of the Edge TPU; the edge is too significant for Google to miss.
- Intel will land at least one major design win for Habana Gaudi, leveraging the AWS win announced in December. If this occurs, Gaudi will arguably earn the pole position to compete with NVIDIA.
- Intel will either release an updated Habana Goya inference chip or will quietly let Goya die, focusing instead on Xeon processors. I’d bet on the latter.
- Graphcore will announce at least one significant design win, perhaps Microsoft. Others that look promising include Tenstorrent, Blaize, SambaNova and Groq. However, 2021 is a make-or-break year for many startups.
- The last one is an easy one: someone will buy someone else. Seriously, startups with promising technology, such as Cerebras, SambaNova, Tenstorrent, Blaize and Graphcore, may look expensive but are quite valuable. Companies like AMD, NVIDIA, Facebook and Google can afford to pay up for a platform that may represent a durable breakthrough.
In 2020, the AI Cambrian Explosion moved from the drawing boards and into data centers and edge devices. I expect 2021 to usher in scores of new chips to accelerate AI, from the startups and the large semiconductor vendors alike. The landscape for edge AI will become populated with dozens of companies with tailored platforms to handle specific models and at various performance levels, power envelopes and costs. In contrast, the data center will remain the domain of the largest semis, with NVIDIA in the lead and a few others nipping at Jensen’s heels. I will review this list of predictions next December to see how I did.
Buckle up—2021 looks to be one heck of a ride!