Table 4. CPU Time Consumption (in Minutes) of Each Test Job by the AMD Opteron System and IBM P690.

System

 

K8-32 a

 

P690

Compiler

 

ifc 7.1

ifc 7.1

ifc 7.1

pgf77 5.0-1

pgf77 5.0-1

pgf90 5.0-1

pgf77 5.0-1

pgf90 5.0-1

pgf77 5.0-1

pgf90 5.0-1

 

xlf 8.0

Arch. opt.b

 

(a)

(a)

(a)

(b)

(b)

(b)

(c)

(c)

(d)

(d)

 

g98 default

Library

 

ATLAS 3.4.7

MKL 6.0

ACML 1.0

BLAS

ATLAS 3.4.7

ACML 1.0

ATLAS 3.4.7

ACML 1.0

ATLAS 3.4.7

ACML 1.0

 

ESSL

322-1

 

2.10

1.92

2.45

2.66

2.72

3.44

2.36

3.1

2.36

3.1

 

2.08

322-2

 

49

49

49

72

72

70

62

60

62

60

 

38

322-3

 

76

74

74

124

125

123

94

93

94

93

 

90

338

 

12

12

12

20

19

19

15

15

15

15

 

12

339

 

14

14

14

23

22

23

17

18

17

18

 

15

364

 

17

17

17

24

24

25

18

19

18 d

19 d

 

15

397

 

196

196

197

297

297

296

232

231

232

231

 

246

415-1

 

9

10

10

14

13

14

11

12

11

12

 

9

415-2

 

20

21

22

30

26

29

22

25

23

25

 

17

420-1

 

43

43

43

64

62

61

49

49

49

49

 

47

420-2

 

43

43

43

64

62

61

49

49

49

49

 

56

424-1

 

27

27

29

52

33

39

30

36

30

35

 

25

424-2

 

26

27

29

52

33

39

30

36

30

35

 

25

438

 

18

18

22

36

22

38

19

37

19

36

 

18

439-1 c

 

39(2)

39(2)

39(2)

68(2)

64(2)

64(2)

89(4)

48(2)

89(4)

49(2)

 

91(4)

439-2

 

30

31

30

49

49

49

38

33

38

33

 

39

447-1

 

64

64

65

109

98

102

74

77

74

78

 

72

447-2

 

40

41

41

74

60

62

47

49

47

49

 

38

559-4

 

8

9

8

8

8

9

8

8

8

8

 

13

560-4

 

9

10

9

9

9

10

9 d

9 d

9

9

 

15

561-4

 

25

25

25

30

29

31

25

27

25

27

 

38

Sum

 

728

733

741

1154

1066

1103

851 d

886 d

852 d

884 d

 

830

a The 32-bit compilation under 64-bit operating system.  b The global options of ifc is set to "-unroll -O2", plus: (a) "-tpp7 -axW -ipo -ipo_obj". The global optimization to pgf77/pgf90 is set to "-Munroll -O2", plus: (b) "-tp k8-32 -Mvect=cachesize:1048576"; (c) "-tp k8-32 -Mvect=cachesize:1048576 -fastsse"; (d) "-fastsse  -time -Mreentrant -Mrecursive -Mnosave -Minfo -Mneginfo -Mscalarsse -Mvect=assoc,recog,prefetch,sse,cachesize:1048576". c The numbers in the parentheses are the step numbers required in the geometrical optimization calculations; this CPU time is excluded from sum. d Estimated.