Bench report : BaseP1vsOthersWithoutGPU (2012-11-28-20-41)
Tuesday 18 th December, 2012
Contents
1 benchs : MassAssembling ( P 1 -Lagrange) 2
1.1 mesh : disque4-1 . . . . 2 1.2 mesh : carre4-1 . . . . 3
2 benchs : MassWAssembling ( P 1 -Lagrange) 4
2.1 mesh : disque4-1 . . . . 4 2.2 mesh : carre4-1 . . . . 5
3 benchs : StiAssembling ( P 1 -Lagrange) 6
3.1 mesh : disque4-1 . . . . 6
3.2 mesh : carre4-1 . . . . 7
1.1 mesh : disque4-1
N n q n df P1base P1OptV0 P1OptV1 P1OptV2
50 3576 3576 1.242 (s)
x 1.00 0.533 (s)
x 2.33 0.069 (s)
x 18.05 0.009 (s) x 134.93 100 14222 14222 10.875 (s)
x 1.00 5.634 (s)
x 1.93 0.287 (s)
x 37.92 0.041 (s) x 263.16 150 31575 31575 44.259 (s)
x 1.00 29.042 (s)
x 1.52 0.629 (s)
x 70.39 0.097 (s) x 458.23 200 55919 55919 129.188 (s)
x 1.00 101.046 (s)
x 1.28 1.124 (s)
x 114.96 0.177 (s) x 730.40 250 86488 86488 305.606 (s)
x 1.00 250.771 (s)
x 1.22 1.739 (s)
x 175.70 0.280 (s) x 1092.52 300 125010 125010 693.431 (s)
x 1.00 562.307 (s)
x 1.23 2.582 (s)
x 268.60 0.426 (s) x 1626.86 350 170355 170355 1313.800 (s)
x 1.00 1120.008 (s)
x 1.17 3.530 (s)
x 372.18 0.589 (s) x 2231.84 400 225547 225547 3071.727 (s)
x 1.00 2074.929 (s)
x 1.48 4.485 (s)
x 684.83 0.787 (s) x 3902.29 450 281769 281769 3655.551 (s)
x 1.00 3054.103 (s)
x 1.20 5.587 (s)
x 654.27 0.960 (s) x 3807.49 500 343082 343082 5701.736 (s)
x 1.00 4459.816 (s)
x 1.28 6.894 (s)
x 827.04 1.212 (s) x 4703.74 550 424178 424178 8162.677 (s)
x 1.00 7638.798 (s)
x 1.07 8.893 (s)
x 917.92 1.584 (s) x 5154.11
Table 1: MassAssembling - Mesh le : disque4-1-< N >.msh
10
310
410
510
610
−310
−210
−110
010
110
210
310
4MassAssemblingP1 : computational time with disk mesh
Sparse Matrix size (n
q)
time in seconde
P1base P1OptV0 P1OptV1 P1OptV2 O(n
q) O(n
q2)
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
x 10
50
1000 2000 3000 4000 5000 6000
Speed Up compare to MassAssemblingP1base (order 1)
Sparse Matrix size (n
df)
Speed Up X
P1OptV0 P1OptV1 P1OptV2
Figure 1: Functions MassAssembling computation times and speedup
2
1.2 mesh : carre4-1
N n q n df P1base P1OptV0 P1OptV1 P1OptV2
50 3043 3043 0.996 (s)
x 1.00 0.424 (s)
x 2.35 0.058 (s)
x 17.31 0.008 (s) x 127.77 100 12139 12139 7.363 (s)
x 1.00 3.855 (s)
x 1.91 0.233 (s)
x 31.55 0.034 (s) x 215.32 150 26910 26910 36.811 (s)
x 1.00 21.366 (s)
x 1.72 0.533 (s)
x 69.02 0.081 (s) x 455.36 200 47834 47834 88.445 (s)
x 1.00 67.143 (s)
x 1.32 0.965 (s)
x 91.62 0.153 (s) x 579.16 250 74326 74326 248.745 (s)
x 1.00 182.475 (s)
x 1.36 1.468 (s)
x 169.44 0.236 (s) x 1054.39 300 107625 107625 467.922 (s)
x 1.00 399.437 (s)
x 1.17 2.178 (s)
x 214.88 0.366 (s) x 1278.28 350 145828 145828 709.196 (s)
x 1.00 724.398 (s)
x 0.98 2.941 (s)
x 241.13 0.483 (s) x 1467.42 400 189794 189794 2068.883 (s)
x 1.00 1669.436 (s)
x 1.24 3.776 (s)
x 547.91 0.650 (s) x 3184.28 450 241024 241024 2688.138 (s)
x 1.00 2912.279 (s)
x 0.92 4.811 (s)
x 558.70 0.827 (s) x 3249.30 500 297290 297290 4119.834 (s)
x 1.00 3563.674 (s)
x 1.16 5.952 (s)
x 692.19 1.032 (s) x 3992.12 550 360040 360040 5711.801 (s)
x 1.00 5130.092 (s)
x 1.11 7.360 (s)
x 776.01 1.245 (s) x 4588.14
Table 2: MassAssembling - Mesh le : carre4-1-< N >.msh
10
310
410
510
610
−310
−210
−110
010
110
210
310
4MassAssemblingP1 : computational time with square mesh
Sparse Matrix size (n
q)
time in seconde
P1base P1OptV0 P1OptV1 P1OptV2 O(n
q) O(n
q 2)
0 0.5 1 1.5 2 2.5 3 3.5 4
x 10
50
500 1000 1500 2000 2500 3000 3500 4000 4500 5000
Speed Up compare to MassAssemblingP1base (order 1)
Sparse Matrix size (n
df)
Speed Up X
P1OptV0 P1OptV1 P1OptV2
Figure 2: Functions MassAssembling computation times and speedup
2.1 mesh : disque4-1
N n q n df P1base P1OptV0 P1OptV1 P1OptV2
50 3576 3576 1.333 (s)
x 1.00 0.638 (s)
x 2.09 0.183 (s)
x 7.29 0.013 (s) x 100.76 100 14222 14222 11.341 (s)
x 1.00 6.447 (s)
x 1.76 0.523 (s)
x 21.68 0.045 (s) x 254.80 150 31575 31575 47.831 (s)
x 1.00 36.182 (s)
x 1.32 1.169 (s)
x 40.92 0.101 (s) x 475.41 200 55919 55919 144.649 (s)
x 1.00 125.339 (s)
x 1.15 1.997 (s)
x 72.45 0.178 (s) x 813.52 250 86488 86488 341.704 (s)
x 1.00 339.268 (s)
x 1.01 3.099 (s)
x 110.25 0.279 (s) x 1225.19 300 125010 125010 715.268 (s)
x 1.00 584.245 (s)
x 1.22 4.567 (s)
x 156.61 0.432 (s) x 1654.09 350 170355 170355 1480.894 (s)
x 1.00 1304.881 (s)
x 1.13 6.139 (s)
x 241.24 0.597 (s) x 2479.80 400 225547 225547 3349.900 (s)
x 1.00 2394.946 (s)
x 1.40 8.150 (s)
x 411.06 0.792 (s) x 4228.52 450 281769 281769 4022.335 (s)
x 1.00 3620.519 (s)
x 1.11 10.140 (s)
x 396.70 1.006 (s) x 3999.49 500 343082 343082 5901.041 (s)
x 1.00 5111.303 (s)
x 1.15 12.441 (s)
x 474.32 1.239 (s) x 4764.29 550 424178 424178 8342.178 (s)
x 1.00 8352.331 (s)
x 1.00 15.486 (s)
x 538.71 1.658 (s) x 5031.80
Table 3: MassWAssembling - Mesh le : disque4-1-< N >.msh
10
310
410
510
610
−210
−110
010
110
210
310
4MassWAssemblingP1 : computational time with disk mesh
Sparse Matrix size (n
q)
time in seconde
P1base P1OptV0 P1OptV1 P1OptV2 O(n
q) O(n
q2)
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
x 10
50
1000 2000 3000 4000 5000 6000
Speed Up compare to MassWAssemblingP1base (order 1)
Sparse Matrix size (n
df)
Speed Up X
P1OptV0 P1OptV1 P1OptV2
Figure 3: Functions MassWAssembling computation times and speedup
4
2.2 mesh : carre4-1
N n q n df P1base P1OptV0 P1OptV1 P1OptV2
50 3043 3043 1.072 (s)
x 1.00 0.500 (s)
x 2.15 0.103 (s)
x 10.40 0.010 (s) x 108.59 100 12139 12139 7.683 (s)
x 1.00 4.278 (s)
x 1.80 0.427 (s)
x 18.01 0.038 (s) x 201.84 150 26910 26910 39.385 (s)
x 1.00 26.918 (s)
x 1.46 0.951 (s)
x 41.41 0.085 (s) x 461.95 200 47834 47834 96.608 (s)
x 1.00 88.992 (s)
x 1.09 1.704 (s)
x 56.69 0.151 (s) x 638.83 250 74326 74326 282.724 (s)
x 1.00 236.944 (s)
x 1.19 2.658 (s)
x 106.36 0.228 (s) x 1239.33 300 107625 107625 497.749 (s)
x 1.00 518.584 (s)
x 0.96 3.863 (s)
x 128.85 0.348 (s) x 1432.34 350 145828 145828 741.543 (s)
x 1.00 835.461 (s)
x 0.89 5.251 (s)
x 141.23 0.494 (s) x 1502.61 400 189794 189794 2220.458 (s)
x 1.00 1718.449 (s)
x 1.29 6.921 (s)
x 320.83 0.660 (s) x 3363.52 450 241024 241024 2397.012 (s)
x 1.00 2315.954 (s)
x 1.03 8.793 (s)
x 272.62 0.837 (s) x 2863.30 500 297290 297290 4225.142 (s)
x 1.00 3515.611 (s)
x 1.20 10.755 (s)
x 392.85 1.043 (s) x 4052.61 550 360040 360040 6168.840 (s)
x 1.00 5731.954 (s)
x 1.08 13.922 (s)
x 443.09 1.288 (s) x 4789.74
Table 4: MassWAssembling - Mesh le : carre4-1-< N >.msh
10
310
410
510
610
−310
−210
−110
010
110
210
310
4MassWAssemblingP1 : computational time with square mesh
Sparse Matrix size (n
q)
time in seconde
P1base P1OptV0 P1OptV1 P1OptV2 O(n
q) O(n
q 2)
0 0.5 1 1.5 2 2.5 3 3.5 4
x 10
50
500 1000 1500 2000 2500 3000 3500 4000 4500 5000
Speed Up compare to MassWAssemblingP1base (order 1)
Sparse Matrix size (n
df)
Speed Up X
P1OptV0 P1OptV1 P1OptV2
Figure 4: Functions MassWAssembling computation times and speedup
3.1 mesh : disque4-1
N n q n df P1base P1OptV0 P1OptV1 P1OptV2
50 3576 3576 1.508 (s)
x 1.00 0.738 (s)
x 2.04 0.287 (s)
x 5.26 0.012 (s) x 122.54 100 14222 14222 12.294 (s)
x 1.00 6.864 (s)
x 1.79 1.215 (s)
x 10.12 0.066 (s) x 186.02 150 31575 31575 47.791 (s)
x 1.00 32.143 (s)
x 1.49 2.644 (s)
x 18.07 0.105 (s) x 455.01 200 55919 55919 135.202 (s)
x 1.00 99.828 (s)
x 1.35 4.724 (s)
x 28.62 0.197 (s) x 687.20 250 86488 86488 314.966 (s)
x 1.00 259.689 (s)
x 1.21 8.636 (s)
x 36.47 0.300 (s) x 1049.04 300 125010 125010 812.572 (s)
x 1.00 737.888 (s)
x 1.10 11.126 (s)
x 73.04 0.453 (s) x 1793.78 350 170355 170355 1342.657 (s)
x 1.00 1166.721 (s)
x 1.15 15.644 (s)
x 85.82 0.638 (s) x 2104.43 400 225547 225547 3268.987 (s)
x 1.00 2107.213 (s)
x 1.55 18.423 (s)
x 177.44 0.833 (s) x 3923.82 450 281769 281769 3797.105 (s)
x 1.00 3485.933 (s)
x 1.09 23.514 (s)
x 161.48 1.062 (s) x 3576.73 500 343082 343082 6085.713 (s)
x 1.00 5703.957 (s)
x 1.07 28.669 (s)
x 212.27 1.331 (s) x 4572.12 550 424178 424178 8462.518 (s)
x 1.00 8774.701 (s)
x 0.96 35.186 (s)
x 240.51 1.725 (s) x 4904.97
Table 5: StiAssembling - Mesh le : disque4-1-< N >.msh
10
310
410
510
610
−210
−110
010
110
210
310
4StiffAssemblingP1 : computational time with disk mesh
Sparse Matrix size (n
q)
time in seconde
P1base P1OptV0 P1OptV1 P1OptV2 O(n
q) O(n
q2)
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5
x 10
50
500 1000 1500 2000 2500 3000 3500 4000 4500 5000
Speed Up compare to StiffAssemblingP1base (order 1)
Sparse Matrix size (n
df)
Speed Up X
P1OptV0 P1OptV1 P1OptV2
Figure 5: Functions StiffAssembling computation times and speedup
6
3.2 mesh : carre4-1
N n q n df P1base P1OptV0 P1OptV1 P1OptV2
50 3043 3043 1.204 (s)
x 1.00 0.576 (s)
x 2.09 0.240 (s)
x 5.01 0.008 (s) x 152.13 100 12139 12139 8.063 (s)
x 1.00 4.493 (s)
x 1.79 0.972 (s)
x 8.29 0.044 (s) x 183.58 150 26910 26910 38.764 (s)
x 1.00 26.565 (s)
x 1.46 2.161 (s)
x 17.94 0.091 (s) x 423.89 200 47834 47834 96.483 (s)
x 1.00 88.635 (s)
x 1.09 3.880 (s)
x 24.86 0.167 (s) x 576.53 250 74326 74326 271.991 (s)
x 1.00 244.591 (s)
x 1.11 6.050 (s)
x 44.96 0.259 (s) x 1051.40 300 107625 107625 536.832 (s)
x 1.00 519.053 (s)
x 1.03 8.748 (s)
x 61.37 0.376 (s) x 1428.63 350 145828 145828 831.011 (s)
x 1.00 1012.707 (s)
x 0.82 12.063 (s)
x 68.89 0.517 (s) x 1608.25 400 189794 189794 2383.408 (s)
x 1.00 1845.082 (s)
x 1.29 15.862 (s)
x 150.26 0.718 (s) x 3317.31 450 241024 241024 2401.234 (s)
x 1.00 2485.502 (s)
x 0.97 20.074 (s)
x 119.62 0.902 (s) x 2662.29 500 297290 297290 4578.485 (s)
x 1.00 4166.603 (s)
x 1.10 24.958 (s)
x 183.45 1.124 (s) x 4073.78 550 360040 360040 6505.867 (s)
x 1.00 6286.496 (s)
x 1.03 29.963 (s)
x 217.13 1.362 (s) x 4777.34
Table 6: StiAssembling - Mesh le : carre4-1-< N >.msh
10
310
410
510
610
−310
−210
−110
010
110
210
310
4StiffAssemblingP1 : computational time with square mesh
Sparse Matrix size (n
q)
time in seconde
P1base P1OptV0 P1OptV1 P1OptV2 O(n
q) O(n
q 2)
0 0.5 1 1.5 2 2.5 3 3.5 4
x 10
50
500 1000 1500 2000 2500 3000 3500 4000 4500 5000
Speed Up compare to StiffAssemblingP1base (order 1)
Sparse Matrix size (n
df)
Speed Up X