HLRS - Services - Parallel Computing - Programming Models - MPI (original) (raw)
size
beff
beff/size
bandwidth per PE at Lmax
PingPong latency
PingPong bandwidth
maximal message length Lmax
#nodes * #PEs
summary
full protocol
MByte/s
MByte/s
MByte/s
microsec
MByte/s
MByte
explicitly allocated PEs, i.e. contiguous ranks on each node:
24
1805.675
75.236
400.133
11.728
954.936
8.000
3 * 8
result_3.3_SR8000_1GB_003nodes_024PEs_c.shrt
result_3.3_SR8000_1GB_003nodes_024PEs_c.gz
18
1565.703
86.983
427.860
11.525
1202.586
8.000
3 * 6
result_3.3_SR8000_1GB_003nodes_018PEs_c.shrt
result_3.3_SR8000_1GB_003nodes_018PEs_c.gz
12
1257.728
104.811
489.445
11.475
1204.480
8.000
3 * 4
result_3.3_SR8000_1GB_003nodes_012PEs_c.shrt
result_3.3_SR8000_1GB_003nodes_012PEs_c.gz
6
758.508
126.418
477.788
11.437
1224.976
8.000
3 * 2
result_3.3_SR8000_1GB_003nodes_006PEs_c.shrt
result_3.3_SR8000_1GB_003nodes_006PEs_c.gz
3
396.829
132.276
447.107
23.307
791.866
8.000
3 * 1
result_3.3_SR8000_1GB_003nodes_003PEs_c.shrt
result_3.3_SR8000_1GB_003nodes_003PEs_c.gz
16
1530.664
95.667
411.060
11.811
969.781
8.000
2 * 8
result_3.3_SR8000_1GB_002nodes_016PEs_c.shrt
result_3.3_SR8000_1GB_002nodes_016PEs_c.gz
12
1287.352
107.279
439.721
11.527
1208.742
8.000
2 * 6
result_3.3_SR8000_1GB_002nodes_012PEs_c.shrt
result_3.3_SR8000_1GB_002nodes_012PEs_c.gz
8
989.464
123.683
504.567
11.521
1213.191
8.000
2 * 4
result_3.3_SR8000_1GB_002nodes_008PEs_c.shrt
result_3.3_SR8000_1GB_002nodes_008PEs_c.gz
6
766.605
127.768
499.667
11.555
1222.560
8.000
2 * 3
result_3.3_SR8000_1GB_002nodes_006PEs_c.shrt
result_3.3_SR8000_1GB_002nodes_006PEs_c.gz
4
574.523
143.631
519.596
11.484
1226.043
8.000
2 * 2
result_3.3_SR8000_1GB_002nodes_004PEs_c.shrt
result_3.3_SR8000_1GB_002nodes_004PEs_c.gz
2
306.570
153.285
521.074
22.923
799.677
8.000
2 * 1
result_3.3_SR8000_1GB_002nodes_002PEs_c.shrt
result_3.3_SR8000_1GB_002nodes_002PEs_c.gz
8
1218.994
152.374
455.575
11.570
916.839
8.000
1 * 8
result_3.3_SR8000_1GB_001nodes_008PEs_c.shrt
result_3.3_SR8000_1GB_001nodes_008PEs_c.gz
7
1118.625
159.804
488.660
11.528
1207.508
8.000
1 * 7
result_3.3_SR8000_1GB_001nodes_007PEs_c.shrt
result_3.3_SR8000_1GB_001nodes_007PEs_c.gz
6
974.033
162.339
506.776
11.361
1211.698
8.000
1 * 6
result_3.3_SR8000_1GB_001nodes_006PEs_c.shrt
result_3.3_SR8000_1GB_001nodes_006PEs_c.gz
5
848.999
169.800
515.719
11.506
1211.176
8.000
1 * 5
result_3.3_SR8000_1GB_001nodes_005PEs_c.shrt
result_3.3_SR8000_1GB_001nodes_005PEs_c.gz
4
714.477
178.619
527.187
11.321
1216.537
8.000
1 * 4
result_3.3_SR8000_1GB_001nodes_004PEs_c.shrt
result_3.3_SR8000_1GB_001nodes_004PEs_c.gz
3
541.446
180.482
537.551
11.390
1222.115
8.000
1 * 3
result_3.3_SR8000_1GB_001nodes_003PEs_c.shrt
result_3.3_SR8000_1GB_001nodes_003PEs_c.gz
2
410.553
205.276
552.597
11.462
1230.266
8.000
1 * 2
result_3.3_SR8000_1GB_001nodes_002PEs_c.shrt
result_3.3_SR8000_1GB_001nodes_002PEs_c.gz
default round-robin order, i.e. ranks 0,3,6,... are on node 0, ranks 1,4,7,... on node 1, ranks 2,5,8,... on node 2:
24
915.478
38.145
110.275
23.077
741.535
8.000
3 * 8
result_3.3_SR8000_1GB_003nodes_024PEs.shrt
result_3.3_SR8000_1GB_003nodes_024PEs.gz
24
922.392
38.433
110.291
23.302
741.305
8.000
3 * 8
result_3.3_SR8000_1GB_003nodes_024PEs_b.shrt
result_3.3_SR8000_1GB_003nodes_024PEs_b.gz
18
895.539
49.752
138.199
23.185
752.172
8.000
3 * 6
result_3.3_SR8000_1GB_003nodes_018PEs.shrt
result_3.3_SR8000_1GB_003nodes_018PEs.gz
12
819.624
68.302
221.940
23.075
773.075
8.000
3 * 4
result_3.3_SR8000_1GB_003nodes_012PEs.shrt
result_3.3_SR8000_1GB_003nodes_012PEs.gz
6
618.331
103.055
361.906
23.158
785.927
8.000
3 * 2
result_3.3_SR8000_1GB_003nodes_006PEs.shrt
result_3.3_SR8000_1GB_003nodes_006PEs.gz
3
429.108
143.036
464.218
22.883
797.131
8.000
3 * 1
result_3.3_SR8000_1GB_003nodes_003PEs.shrt
result_3.3_SR8000_1GB_003nodes_003PEs.gz
16
775.710
48.482
115.840
36.781
103.655
8.000
2 * 8
result_3.3_SR8000_1GB_002nodes_016PEs.shrt
result_3.3_SR8000_1GB_002nodes_016PEs.gz
16
768.112
48.007
115.286
29.816
128.435
8.000
2 * 8
result_3.3_SR8000_1GB_002nodes_016PEs_b.shrt
result_3.3_SR8000_1GB_002nodes_016PEs_b.gz
12
768.140
64.012
158.576
23.544
770.873
8.000
2 * 6
result_3.3_SR8000_1GB_002nodes_012PEs.shrt
result_3.3_SR8000_1GB_002nodes_012PEs.gz
8
659.282
82.410
230.119
23.220
784.569
8.000
2 * 4
result_3.3_SR8000_1GB_002nodes_008PEs.shrt
result_3.3_SR8000_1GB_002nodes_008PEs.gz
8
680.633
85.079
230.015
23.430
775.896
8.000
2 * 4
result_3.3_SR8000_1GB_002nodes_008PEs_b.shrt
result_3.3_SR8000_1GB_002nodes_008PEs_b.gz
6
583.527
97.254
278.203
23.182
789.365
8.000
2 * 3
result_3.3_SR8000_1GB_002nodes_006PEs.shrt
result_3.3_SR8000_1GB_002nodes_006PEs.gz
4
495.397
123.849
390.279
23.160
792.499
8.000
2 * 2
result_3.3_SR8000_1GB_002nodes_004PEs.shrt
result_3.3_SR8000_1GB_002nodes_004PEs.gz
2
306.623
153.311
522.781
23.004
799.908
8.000
2 * 1
result_3.3_SR8000_1GB_002nodes_002PEs.shrt
result_3.3_SR8000_1GB_002nodes_002PEs.gz
8
1245.136
155.642
470.941
11.650
970.791
8.000
1 * 8
result_3.3_SR8000_1GB_001nodes_008PEs.shrt
result_3.3_SR8000_1GB_001nodes_008PEs.gz
6
971.215
161.869
505.246
11.563
1213.715
8.000
1 * 6
result_3.3_SR8000_1GB_001nodes_006PEs.shrt
result_3.3_SR8000_1GB_001nodes_006PEs.gz
4
706.801
176.700
526.974
11.521
1205.780
8.000
1 * 4
result_3.3_SR8000_1GB_001nodes_004PEs.shrt
result_3.3_SR8000_1GB_001nodes_004PEs.gz
2
410.471
205.236
552.259
11.549
1226.577
8.000
1 * 2
result_3.3_SR8000_1GB_001nodes_002PEs.shrt
result_3.3_SR8000_1GB_001nodes_002PEs.gz
explicitly allocated PEs, but using special additional options:
options
24
1805.675
75.236
400.133
11.728
954.936
8.000
---
result_3.3_SR8000_1GB_003nodes_024PEs_c.shrt
result_3.3_SR8000_1GB_003nodes_024PEs_c.gz
24
1806.033
75.251
381.057
12.003
1014.339
8.000
SS
result_3.3_SR8000_1GB_003nodes_024PEs_with_SS.shrt
result_3.3_SR8000_1GB_003nodes_024PEs_with_SS.gz
24
1280.068
53.336
353.586
29.933
161.925
8.000
SS, 64
result_3.3_SR8000_1GB_003nodes_024PEs_with_SS_lp64.shrt
result_3.3_SR8000_1GB_003nodes_024PEs_with_SS_lp64.gz
24
1225.848
51.077
295.441
27.604
144.019
8.000
64