AMD CEO Lisa Su, Forrest Norrod chief of data center and Dan McNamara, head of servers, will speak at the Accelerated Data Center Premiere on Monday, November 8, to introduce some news. It has not been officially announced what the products will be, but according to leakers, AMD is going to introduce the CDNA 2 architecture and the accelerators built on it from the Instinct MI200 family (MI250 and MI250X), as well as some of the processor innovations based on Zen 3 – Either Milan-X (ie Zen 3 with V-cache for servers) or Trento (server Zen 3 s podporou Unified Memory Architecture).
Additional information on Instinct accelerators has emerged in connection with these plans. The first is more of a marketing one: AMD is abandoning the use of the term GPU in computing GPUs. It goes to accelerators. However, this change is not justified: CDNA 2 no longer contains any fixed units used to accelerate 3D graphics. Missing rasterizer, missing texturing units, missing ROP, missing accelerators for ray-tracing. Only the multimedia circuit for video acceleration remains. Some sources claimed that fixed units for 3D graphics were already missing from CDNA (1), while others said at least some were present. Anyway, with CDNA 2 they are a thing of the past.
|architecture||GCN 4||CDNA||CDNA 2||CDNA 3||Ampere|
|format||PCIe||PCIe||OAM||OAM||SXM4 / PCIe|
|CU / SM||60||120||220|
|You have. Colors||–||?||?||?||432|
|rate||1800 MHz||1502 MHz||≤1700 MHz||?||1410 MHz|
|↓↓↓ T(FL)OPS ↓↓↓|
|↑↑↑ T(FL)OPS ↑↑↑|
|32 GB||32 GB||128 GB||?||40 GB|
|HBM2||2,0 GHz||2,4 GHz||3,2 GHz||HBM3?||2,43 GHz|
|1024 GB/s||1229 GB/s||3277 GB/s||?||1555 GB/s|
|TDP||300 W||300 W||500W||~600W?||400 / 250 W|
|transistorů||13.2 billion||50.0 billion||>100 mld.?||?||54.2 billion|
|GPU area||331 mm²||750 mm²||?||?||826 mm²|
|process||7 nm||7 nm||7nm?||?||7 nm|
Newly (though still unofficially) “confirmed” values are highlighted in bold, more significant changes in red
The situation with support for the FP32 format is further clarified. CDNA is the first GPU-based architecture to natively support the full-speed FP64 format. However, the sources were inconsistent in terms of the speed of FP32 support. CDNA 2 supports packed-FP32, which means FP32 processing: FP64 2: 1, in other words the Instinct MI250X will reach up to 95.7 TFLOPS in FP32. The performance in FP64, FP32 and FP16 is therefore five times higher than that of the Nvidia A100, the performance in BF16 format is probably ten times.
Paradoxically, although all the essentials are known about the MI200 series accelerators, practically nothing has been leaked about the processors so far, so either AMD manages to keep these innovations under wraps better, or the emphasis of the action will be on accelerators.
Source: Diit.cz by diit.cz.
*The article has been translated based on the content of Diit.cz by diit.cz. If there is any problem regarding the content, copyright, please leave a report below the article. We will try to process as quickly as possible to protect the rights of the author. Thank you very much!
*We just want readers to access information more quickly and easily with other multilingual content, instead of information only available in a certain language.
*We always respect the copyright of the content of the author and always include the original link of the source article.If the author disagrees, just leave the report below the article, the article will be edited or deleted at the request of the author. Thanks very much! Best regards!