HPC/AI中的InfiniBand网络和Fat-tree拓扑:加速高性能计算与人工智能

2024-09-12 10:45

HPC/AI广CPU/GPUHPC


HPCIBInfiniBand
Fat tree
2D meshXY
3D meshXYZ
2D/3D torus2D/3D XY Z
Dragonfly+spinespine

1.1.Fat Tree

Fat-treeHPC/AI使广HPC /AI使广使fat-tree

Fat-tree使fat-tree使使

1.2 Fat-Tree

Fat-tree

Level-2 (L2) Level-1 (L1) HPC

L2 Leafspine L1 L2 Leaf-spine L1 L2 6 1:1:1:1:1:12:2:23:3 6 Leaf 4:25:1

线Credit LoopsCredit Loops使 up-down

使 32NDR/40(HDR) L1 L2025-86595105 15895983233 http://www.njwdr.com/Credit Loops

1.3

CLOS-3

1.3.1 80HDR

使 1U QM8700

image.png


1.3.2 800(HDR)

使 1U QM8700 CS8500

image.png


1.3.3 1024(NDR)

image.png

1.4

FLOPS

FLOPS = Hz CPU xCPU x CPU x CPU

Intel E5-26902.9GHz 8 CPU Intel CPU

2.9 x 8 x 8 x 2 = 371.2 GFLOPS

E5-2600 CPU 8

HPC 72 使 6

371.2GFLOPS x 72= 26,726GFLOPS = ~27TFLOPS

648 使 54

371.2GFLOPS x 648= 240,537GFLOPS = ~241TFLOPS

648 HPC 3 GPU

使 1 Gb (GbE) 使 50%使 10GbE 30% InfiniBand90% 10% www.top500.org

image.png

InfiniBand HPC InfiniBand 使 HPC

访 (RDMA)

2