HLRS - Services - Parallel Computing - Programming Models - MPI (original) (raw)

size

beff

beff/size

bandwidth per PE at Lmax

PingPong latency

PingPong bandwidth

maximal message length Lmax

#nodes * #PEs

summary

full protocol

MByte/s

MByte/s

MByte/s

microsec

MByte/s

MByte

explicitly allocated PEs, i.e. contiguous ranks on each node:

24

1805.675

75.236

400.133

11.728

954.936

8.000

3 * 8

result_3.3_SR8000_1GB_003nodes_024PEs_c.shrt

result_3.3_SR8000_1GB_003nodes_024PEs_c.gz

18

1565.703

86.983

427.860

11.525

1202.586

8.000

3 * 6

result_3.3_SR8000_1GB_003nodes_018PEs_c.shrt

result_3.3_SR8000_1GB_003nodes_018PEs_c.gz

12

1257.728

104.811

489.445

11.475

1204.480

8.000

3 * 4

result_3.3_SR8000_1GB_003nodes_012PEs_c.shrt

result_3.3_SR8000_1GB_003nodes_012PEs_c.gz

6

758.508

126.418

477.788

11.437

1224.976

8.000

3 * 2

result_3.3_SR8000_1GB_003nodes_006PEs_c.shrt

result_3.3_SR8000_1GB_003nodes_006PEs_c.gz

3

396.829

132.276

447.107

23.307

791.866

8.000

3 * 1

result_3.3_SR8000_1GB_003nodes_003PEs_c.shrt

result_3.3_SR8000_1GB_003nodes_003PEs_c.gz

16

1530.664

95.667

411.060

11.811

969.781

8.000

2 * 8

result_3.3_SR8000_1GB_002nodes_016PEs_c.shrt

result_3.3_SR8000_1GB_002nodes_016PEs_c.gz

12

1287.352

107.279

439.721

11.527

1208.742

8.000

2 * 6

result_3.3_SR8000_1GB_002nodes_012PEs_c.shrt

result_3.3_SR8000_1GB_002nodes_012PEs_c.gz

8

989.464

123.683

504.567

11.521

1213.191

8.000

2 * 4

result_3.3_SR8000_1GB_002nodes_008PEs_c.shrt

result_3.3_SR8000_1GB_002nodes_008PEs_c.gz

6

766.605

127.768

499.667

11.555

1222.560

8.000

2 * 3

result_3.3_SR8000_1GB_002nodes_006PEs_c.shrt

result_3.3_SR8000_1GB_002nodes_006PEs_c.gz

4

574.523

143.631

519.596

11.484

1226.043

8.000

2 * 2

result_3.3_SR8000_1GB_002nodes_004PEs_c.shrt

result_3.3_SR8000_1GB_002nodes_004PEs_c.gz

2

306.570

153.285

521.074

22.923

799.677

8.000

2 * 1

result_3.3_SR8000_1GB_002nodes_002PEs_c.shrt

result_3.3_SR8000_1GB_002nodes_002PEs_c.gz

8

1218.994

152.374

455.575

11.570

916.839

8.000

1 * 8

result_3.3_SR8000_1GB_001nodes_008PEs_c.shrt

result_3.3_SR8000_1GB_001nodes_008PEs_c.gz

7

1118.625

159.804

488.660

11.528

1207.508

8.000

1 * 7

result_3.3_SR8000_1GB_001nodes_007PEs_c.shrt

result_3.3_SR8000_1GB_001nodes_007PEs_c.gz

6

974.033

162.339

506.776

11.361

1211.698

8.000

1 * 6

result_3.3_SR8000_1GB_001nodes_006PEs_c.shrt

result_3.3_SR8000_1GB_001nodes_006PEs_c.gz

5

848.999

169.800

515.719

11.506

1211.176

8.000

1 * 5

result_3.3_SR8000_1GB_001nodes_005PEs_c.shrt

result_3.3_SR8000_1GB_001nodes_005PEs_c.gz

4

714.477

178.619

527.187

11.321

1216.537

8.000

1 * 4

result_3.3_SR8000_1GB_001nodes_004PEs_c.shrt

result_3.3_SR8000_1GB_001nodes_004PEs_c.gz

3

541.446

180.482

537.551

11.390

1222.115

8.000

1 * 3

result_3.3_SR8000_1GB_001nodes_003PEs_c.shrt

result_3.3_SR8000_1GB_001nodes_003PEs_c.gz

2

410.553

205.276

552.597

11.462

1230.266

8.000

1 * 2

result_3.3_SR8000_1GB_001nodes_002PEs_c.shrt

result_3.3_SR8000_1GB_001nodes_002PEs_c.gz

default round-robin order, i.e. ranks 0,3,6,... are on node 0, ranks 1,4,7,... on node 1, ranks 2,5,8,... on node 2:

24

915.478

38.145

110.275

23.077

741.535

8.000

3 * 8

result_3.3_SR8000_1GB_003nodes_024PEs.shrt

result_3.3_SR8000_1GB_003nodes_024PEs.gz

24

922.392

38.433

110.291

23.302

741.305

8.000

3 * 8

result_3.3_SR8000_1GB_003nodes_024PEs_b.shrt

result_3.3_SR8000_1GB_003nodes_024PEs_b.gz

18

895.539

49.752

138.199

23.185

752.172

8.000

3 * 6

result_3.3_SR8000_1GB_003nodes_018PEs.shrt

result_3.3_SR8000_1GB_003nodes_018PEs.gz

12

819.624

68.302

221.940

23.075

773.075

8.000

3 * 4

result_3.3_SR8000_1GB_003nodes_012PEs.shrt

result_3.3_SR8000_1GB_003nodes_012PEs.gz

6

618.331

103.055

361.906

23.158

785.927

8.000

3 * 2

result_3.3_SR8000_1GB_003nodes_006PEs.shrt

result_3.3_SR8000_1GB_003nodes_006PEs.gz

3

429.108

143.036

464.218

22.883

797.131

8.000

3 * 1

result_3.3_SR8000_1GB_003nodes_003PEs.shrt

result_3.3_SR8000_1GB_003nodes_003PEs.gz

16

775.710

48.482

115.840

36.781

103.655

8.000

2 * 8

result_3.3_SR8000_1GB_002nodes_016PEs.shrt

result_3.3_SR8000_1GB_002nodes_016PEs.gz

16

768.112

48.007

115.286

29.816

128.435

8.000

2 * 8

result_3.3_SR8000_1GB_002nodes_016PEs_b.shrt

result_3.3_SR8000_1GB_002nodes_016PEs_b.gz

12

768.140

64.012

158.576

23.544

770.873

8.000

2 * 6

result_3.3_SR8000_1GB_002nodes_012PEs.shrt

result_3.3_SR8000_1GB_002nodes_012PEs.gz

8

659.282

82.410

230.119

23.220

784.569

8.000

2 * 4

result_3.3_SR8000_1GB_002nodes_008PEs.shrt

result_3.3_SR8000_1GB_002nodes_008PEs.gz

8

680.633

85.079

230.015

23.430

775.896

8.000

2 * 4

result_3.3_SR8000_1GB_002nodes_008PEs_b.shrt

result_3.3_SR8000_1GB_002nodes_008PEs_b.gz

6

583.527

97.254

278.203

23.182

789.365

8.000

2 * 3

result_3.3_SR8000_1GB_002nodes_006PEs.shrt

result_3.3_SR8000_1GB_002nodes_006PEs.gz

4

495.397

123.849

390.279

23.160

792.499

8.000

2 * 2

result_3.3_SR8000_1GB_002nodes_004PEs.shrt

result_3.3_SR8000_1GB_002nodes_004PEs.gz

2

306.623

153.311

522.781

23.004

799.908

8.000

2 * 1

result_3.3_SR8000_1GB_002nodes_002PEs.shrt

result_3.3_SR8000_1GB_002nodes_002PEs.gz

8

1245.136

155.642

470.941

11.650

970.791

8.000

1 * 8

result_3.3_SR8000_1GB_001nodes_008PEs.shrt

result_3.3_SR8000_1GB_001nodes_008PEs.gz

6

971.215

161.869

505.246

11.563

1213.715

8.000

1 * 6

result_3.3_SR8000_1GB_001nodes_006PEs.shrt

result_3.3_SR8000_1GB_001nodes_006PEs.gz

4

706.801

176.700

526.974

11.521

1205.780

8.000

1 * 4

result_3.3_SR8000_1GB_001nodes_004PEs.shrt

result_3.3_SR8000_1GB_001nodes_004PEs.gz

2

410.471

205.236

552.259

11.549

1226.577

8.000

1 * 2

result_3.3_SR8000_1GB_001nodes_002PEs.shrt

result_3.3_SR8000_1GB_001nodes_002PEs.gz

explicitly allocated PEs, but using special additional options:

options

24

1805.675

75.236

400.133

11.728

954.936

8.000

---

result_3.3_SR8000_1GB_003nodes_024PEs_c.shrt

result_3.3_SR8000_1GB_003nodes_024PEs_c.gz

24

1806.033

75.251

381.057

12.003

1014.339

8.000

SS

result_3.3_SR8000_1GB_003nodes_024PEs_with_SS.shrt

result_3.3_SR8000_1GB_003nodes_024PEs_with_SS.gz

24

1280.068

53.336

353.586

29.933

161.925

8.000

SS, 64

result_3.3_SR8000_1GB_003nodes_024PEs_with_SS_lp64.shrt

result_3.3_SR8000_1GB_003nodes_024PEs_with_SS_lp64.gz

24

1225.848

51.077

295.441

27.604

144.019

8.000

64

result_3.3_SR8000_1GB_003nodes_024PEs_with_lp64.shrt

result_3.3_SR8000_1GB_003nodes_024PEs_with_lp64.gz