Covering Disruptive Technology Powering Business in The Digital Age

Home > Archives > News > Huawei and PCL develop Cloud Brain II, embarking on a new chapter of 1000-scale AI PFLOPS clusters
Huawei and PCL develop Cloud Brain II, embarking on a new chapter of 1000-scale AI PFLOPS clusters
December 4, 2019 News


On November 29, 2019, Huawei and Peng Cheng Laboratory (PCL) presented together Peng Cheng Brain Cloud II Phase 1, officially launching the journey to clusters of 1000 petaFLOPS scale AI (PFLOPS). This sets a new milestone in the field of scientific research for the Kunpeng computer industry. The clusterHuawei’s IA Atlas 900 operates at the base of Cloud Brain II, powered by Huawei’s Kunpeng and Ascend processors. Atlas 900 infuses robust computing power into Cloud Brain II, supporting basic AI research and exploration such as computer vision, natural language, autonomous driving, smart transportation, and smart healthcare. The computing power of the Peng Cheng Cloud Brain is currently 100 PFLOPS, planned to scale to 1000 PFLOPS and higher next year.

“In September, Huawei embarked on the dual-engine computing strategy with Kunpeng and Ascend. Inspired by this strategy, we committed to making the computing power available to the world. We also launched Atlas 900, the AI training cluster . world’s fastest, “said Huawei Senior Vice President and Huawei Cloud & AI Products and Services President Hou Jinlong.

“Today, we are proud to see the Atlas 900 chosen for the Peng Cheng Cloud Brain II project. It lays the foundation for Cloud Brain II. Cloud Brain II is an industry-leading AI research platform. PCL has brought together many talents. AI will join PCL in leading cutting-edge scientific research into a smart world, “Hou added.

Hou further said: “We are currently developing Cloud Brain II Phase 1. I believe that with our joint effort this will pave the way for Cloud Brain II on a 1000 PFLOPS scale in the near future. We are confident that it will become the world’s leading AI research platform. ”

Huawei’s Intelligent Computing Business Department chairman Michael Ma said: “Huawei develops the Ascend processor-based AI computing platform Atlas, providing a broad portfolio of modules, boards, edge stations, AI servers and clusters . Our AI infrastructure for all scenarios covers the cloud-edge-device (cloud-edge-device), supporting the inference flow and complete training for deep learning. ”

“Our flagship Atlas product, Atlas 900, stands as the pinnacle of AI computing in the world. The combination of Atlas 900 and Cloud Brain II will embark on a new chapter for 1000 PFLOPS scale AI clusters and unleash the magnificent power of computing to drive smart transformation faster across industries, “said Ma.

The Peng Cheng Cloud Brain is an essential technological instrument in the field of AI. It is a basic research platform for exploring the frontier of AI technology. It currently boasts 100 PFLOPS AI computing power and is expected to reach the 1000 PFLOPS scale in the Cloud Brain II project next year.

Cloud Brain II is jointly built by PCL and Huawei. Operating on Huawei’s Kunpeng and Ascend processors, the Atlas 900 AI cluster provides superior computing power. PCL develops technologies for Cloud Brain in 1000 PFLOPS.

The Atlas 900 AI cluster has inherited Huawei’s technology know-how from over a decade. Composed of thousands of Ascend 910 AI processors, the Atlas 900 completes the training of a ResNet image classification model in 59.8 seconds, 10 seconds faster than the previous world record, with the same accuracy. Atlas 900’s powerful computing power makes the difference in scientific research and technological innovation, such as astronomical exploration, weather forecasting, autonomous driving and oil exploration. Atlas 900 Highlights:

  • Powerful Computing: Combining thousands of Ascend 910 AI processors, the Atlas 900 delivers 256–1024 PFLOPS at half precision (FP16), which equals the computing power of 500,000 PCs. The SoC design integrates AI computing, general purpose computing. and I / O functionality to effectively improve training efficiency.
  • High-speed clusternetwork : It supports three types of high-speed network interfaces: Huawei Cache Coherence System (HCCS), PCIe 4.0, and 100G RoCE, reducing gradient synchronization latency from 10% to 70% for one jump. in model training efficiency. It leverages an innovative iLossless intelligent switching algorithm to enable real-time learning and training of network-wide traffic, achieving zero packet loss and end-to-end microsecond latency.
  • Complete heat dissipation: Atlas 900 uses a cabinet-level adiabatic system, achieving a net cooling coefficient of over 95% and a system power usageefficiency (PUE ) of less than 1, 1 (an ideal PUE is 1.0).

So far, based on Ascend 910 and 310 AI processors, Huawei has launched the IA Atlas 900 cluster , Atlas 800 IA server, Atlas 500 IA edge station, Atlas 300 IA accelerator card and accelerator module. IA Atlas 200. Atlas’s holistic portfolio provides powerful computing for training and inference in all cloud-edge-device scenarios.

Going forward, Huawei will continue to increase investment and innovation in infrastructure such as processors, operating systems, databases for the Kunpeng computing industry and provide AI computing to Peng Cheng Cloud Brain to together, make AI technology faster for a broader scope of applications.