1# Quickstep Polarized Atomic Orbital - TiO2
2
3Large scale benchmark for PAO ML (and/or LS DFT in general).
4
5## How to Run the Benchmark
6
7Bunzip2 all files, have [`BASIS_MOLOPT`](../../data/BASIS_MOLOPT) and [`GTH_POTENTIALS`](../../data/GTH_POTENTIALS) available (from [cp2k/data](../../data/).
8
9For tuning purposes, the length of the full benchmark can be reduced in the following ways:
10
11- by reducing the number of MD steps (STEPS 20 -> STEPS 5)
12- by doing only an energy calculation (RUN_TYPE MD -> RUN_TYPE ENERGY)
13- by doing an energy calculation with few SCF steps (MAX_SCF 50 -> MAX_SCF 5)
14
15## Results Archive
16
17Reference energies and timings using CP2K svn:17405, Piz Daint, Cray XC30, 1024 nodes, SB + K20X, +- 8Gb mem used per node
18
19### Output File
20
21(last column timings per step)
22
23```
24> grep SCF out | head -n 10
25 ------------------------------ Linear scaling SCF -----------------------------
26 SCF     1   -552962.443966503   -552962.443966503   28.188702
27 SCF     2   -553391.806569889      -429.362603386   32.410283
28 SCF     3   -553756.740393045      -794.296426542   12.872345
29 SCF     4   -554517.794343487     -1555.350376984   13.497826
30 SCF     5   -554728.247042532      -210.452699045   33.469438
31 SCF     6   -554849.946612308      -332.152268821   13.510741
32 SCF     7   -554873.984235220      -356.189891734   17.914532
33 SCF     8   -554915.784987915       -41.800752695   38.287248
34 SCF     9   -554918.728930513       -44.744695293   14.047717
35```
36
37### MD Energy File
38
39```
40> cat pao_ml_md-1.ener
41#     Step Nr.          Time[fs]        Kin.[a.u.]          Temp[K]            Pot.[a.u.]        Cons Qty[a.u.]        UsedTime[s]
42         0            0.000000       110.495412573       300.000000000   -554937.745690580   -554827.250278007         0.000000000
43         1            0.500000       106.078997716       288.009235623   -554933.183067304   -554827.104069588      1606.181380667
44         2            1.000000       102.069290733       277.122701359   -554929.060740767   -554826.991450034       437.939860202
45         3            1.500000       100.198165911       272.042513562   -554927.153307400   -554826.955141488       449.515135792
46         4            2.000000       101.022853760       274.281578054   -554928.022891735   -554827.000037975       375.188747253
47         5            2.500000       103.892195371       282.071969193   -554930.981726713   -554827.089531342       378.129010544
48         6            3.000000       107.422614470       291.657215360   -554934.608040349   -554827.185425879       363.554193239
49         7            3.500000       110.115891922       298.969584414   -554937.373128860   -554827.257236939       365.110804793
50         8            4.000000       110.802924876       300.834909692   -554938.098364813   -554827.295439937       367.848488379
51         9            4.500000       108.870951325       295.589514868   -554936.164765656   -554827.293814332       366.621533569
52        10            5.000000       104.372470458       283.375937591   -554931.619703179   -554827.247232721       371.757427932
53        11            5.500000        98.092255253       266.324871693   -554925.248720303   -554827.156465049       366.436341383
54        12            6.000000        91.519001482       248.478192945   -554918.555918355   -554827.036916873       438.594749461
55        13            6.500000        86.552351413       234.993515291   -554913.479003540   -554826.926652127       363.525124027
56        14            7.000000        84.878892773       230.450000040   -554911.758150807   -554826.879258034       361.602635782
57        15            7.500000        87.178039466       236.692286411   -554914.109975162   -554826.931935696       362.720366624
58        16            8.000000        92.619988766       251.467422791   -554919.692436207   -554827.072447441       360.480368018
59        17            8.500000        99.082357198       269.013042870   -554926.321931528   -554827.239574330       362.877061196
60        18            9.000000       104.007271910       282.384407159   -554931.375035372   -554827.367763462       364.628673398
61        19            9.500000       105.405738067       286.181305486   -554932.815285468   -554827.409547401       365.768221021
62        20           10.000000       102.522367308       278.352824576   -554929.882289568   -554827.359922261       364.275226861
63```
64
65### Timings
66
67```
68 SUBROUTINE                       CALLS  ASD         SELF TIME        TOTAL TIME
69                                MAXIMUM       AVERAGE  MAXIMUM  AVERAGE  MAXIMUM
70 CP2K                                 1  1.0    0.182    0.228 8802.156 8802.158
71 qs_mol_dyn_low                       1  2.0    0.002    0.002 8793.361 8796.208
72 qs_forces                           21  4.0    0.300    0.313 8731.917 8731.935
73 qs_energies                         21  5.0    0.001    0.001 8299.259 8316.296
74 ls_scf                              21  6.0    0.000    0.000 8140.304 8157.313
75 velocity_verlet                     20  3.0    0.014    0.024 7689.100 7689.373
76 ls_scf_main                         21  7.0    0.006    6.929 6214.282 6221.060
77 dbcsr_multiply_internal           6268 10.6    5.425    5.502 5456.668 5499.570
78 multiply_cannon                   6268 11.6    9.185    9.602 4797.967 4876.009
79 dm_ls_curvy_optimization           234  7.8    0.002    0.002 3756.519 3757.137
80 optimization_step                  234  8.8    0.001    0.046 3056.137 3059.618
81 multiply_cannon_multrec         401152 12.6  845.768 1008.921 2235.185 2348.598
82 mp_waitall_1                   3329028 13.7 2028.515 2273.110 2028.515 2273.110
83 compute_direction_newton            78  9.8    0.101    0.977 2206.565 2206.643
84 ls_scf_dm_to_ks                    255  7.9    0.003    0.004 2046.532 2047.273
85 multiply_cannon_metrocomm3      401152 12.6    0.957    1.092 1280.957 1642.411
86 commutator_symm                   1536 10.8    0.024    0.026 1393.033 1401.600
87 dbcsr_mm_accdrv_process        7955916 13.9  862.352  933.296 1051.028 1136.256
88 rebuild_ks_matrix                  277  9.6    0.002    0.004 1013.434 1013.460
89 qs_ks_build_kohn_sham_matrix       277 10.6    0.037    0.051 1013.433 1013.458
90 pao_post_scf                        21  7.0    0.000    0.000  979.076  979.097
91 multiply_cannon_metrocomm1      401152 12.6    1.188    1.293  666.837  966.589
92 pao_add_forces                      21  8.0    0.022    0.036  939.777  941.085
93 qs_ks_update_qs_env                256  8.9    0.002    0.002  915.333  915.356
94 update_p_exp                       234  9.8    0.013    0.032  849.491  852.997
95 qs_rho_update_rho                  256  8.9    0.003    0.004  732.701  747.571
96 calculate_rho_elec                 256  9.9  486.068  514.736  732.698  747.568
97 sum_up_and_integrate               277 11.6    0.488    0.516  732.856  733.703
98 integrate_v_rspace                 277 12.6  518.242  550.497  732.367  733.214
99 matrix_qs_to_ls                    276  8.0    0.142    0.144  701.236  702.223
100 pao_calc_outer_grad_lnv             21  9.0    0.100    0.114  657.554  660.960
101 transform_matrix_orth              333  8.8    0.010    0.011  647.406  651.982
102 purify_mcweeny_orth                255 10.7    0.009    0.493  635.637  639.508
103 make_m2s                         12536 11.6    5.422    5.562  568.779  607.900
104 make_images                      12536 12.6   19.448   20.772  560.314  599.309
105 make_images_sizes                12536 13.6    0.028    0.032  428.487  491.794
106 mp_alltoall_i44                  12536 14.6  428.459  491.763  428.459  491.763
107 matrix_ls_to_qs                    276  8.9    0.154    0.157  441.958  469.591
108 mp_alltoall_d11v                 10853 12.8  223.253  388.573  223.253  388.573
109 dbcsr_new_transposed              3211 11.1    9.536   10.585  333.787  347.908
110 rs_distribute_matrix               554 12.2   26.205   29.397  187.192  341.592
111 dbcsr_redistribute                2470 12.3   31.904   35.233  319.945  334.210
112```
113