1# Quickstep Polarized Atomic Orbital - TiO2 2 3Large scale benchmark for PAO ML (and/or LS DFT in general). 4 5## How to Run the Benchmark 6 7Bunzip2 all files, have [`BASIS_MOLOPT`](../../data/BASIS_MOLOPT) and [`GTH_POTENTIALS`](../../data/GTH_POTENTIALS) available (from [cp2k/data](../../data/). 8 9For tuning purposes, the length of the full benchmark can be reduced in the following ways: 10 11- by reducing the number of MD steps (STEPS 20 -> STEPS 5) 12- by doing only an energy calculation (RUN_TYPE MD -> RUN_TYPE ENERGY) 13- by doing an energy calculation with few SCF steps (MAX_SCF 50 -> MAX_SCF 5) 14 15## Results Archive 16 17Reference energies and timings using CP2K svn:17405, Piz Daint, Cray XC30, 1024 nodes, SB + K20X, +- 8Gb mem used per node 18 19### Output File 20 21(last column timings per step) 22 23``` 24> grep SCF out | head -n 10 25 ------------------------------ Linear scaling SCF ----------------------------- 26 SCF 1 -552962.443966503 -552962.443966503 28.188702 27 SCF 2 -553391.806569889 -429.362603386 32.410283 28 SCF 3 -553756.740393045 -794.296426542 12.872345 29 SCF 4 -554517.794343487 -1555.350376984 13.497826 30 SCF 5 -554728.247042532 -210.452699045 33.469438 31 SCF 6 -554849.946612308 -332.152268821 13.510741 32 SCF 7 -554873.984235220 -356.189891734 17.914532 33 SCF 8 -554915.784987915 -41.800752695 38.287248 34 SCF 9 -554918.728930513 -44.744695293 14.047717 35``` 36 37### MD Energy File 38 39``` 40> cat pao_ml_md-1.ener 41# Step Nr. Time[fs] Kin.[a.u.] Temp[K] Pot.[a.u.] Cons Qty[a.u.] UsedTime[s] 42 0 0.000000 110.495412573 300.000000000 -554937.745690580 -554827.250278007 0.000000000 43 1 0.500000 106.078997716 288.009235623 -554933.183067304 -554827.104069588 1606.181380667 44 2 1.000000 102.069290733 277.122701359 -554929.060740767 -554826.991450034 437.939860202 45 3 1.500000 100.198165911 272.042513562 -554927.153307400 -554826.955141488 449.515135792 46 4 2.000000 101.022853760 274.281578054 -554928.022891735 -554827.000037975 375.188747253 47 5 2.500000 103.892195371 282.071969193 -554930.981726713 -554827.089531342 378.129010544 48 6 3.000000 107.422614470 291.657215360 -554934.608040349 -554827.185425879 363.554193239 49 7 3.500000 110.115891922 298.969584414 -554937.373128860 -554827.257236939 365.110804793 50 8 4.000000 110.802924876 300.834909692 -554938.098364813 -554827.295439937 367.848488379 51 9 4.500000 108.870951325 295.589514868 -554936.164765656 -554827.293814332 366.621533569 52 10 5.000000 104.372470458 283.375937591 -554931.619703179 -554827.247232721 371.757427932 53 11 5.500000 98.092255253 266.324871693 -554925.248720303 -554827.156465049 366.436341383 54 12 6.000000 91.519001482 248.478192945 -554918.555918355 -554827.036916873 438.594749461 55 13 6.500000 86.552351413 234.993515291 -554913.479003540 -554826.926652127 363.525124027 56 14 7.000000 84.878892773 230.450000040 -554911.758150807 -554826.879258034 361.602635782 57 15 7.500000 87.178039466 236.692286411 -554914.109975162 -554826.931935696 362.720366624 58 16 8.000000 92.619988766 251.467422791 -554919.692436207 -554827.072447441 360.480368018 59 17 8.500000 99.082357198 269.013042870 -554926.321931528 -554827.239574330 362.877061196 60 18 9.000000 104.007271910 282.384407159 -554931.375035372 -554827.367763462 364.628673398 61 19 9.500000 105.405738067 286.181305486 -554932.815285468 -554827.409547401 365.768221021 62 20 10.000000 102.522367308 278.352824576 -554929.882289568 -554827.359922261 364.275226861 63``` 64 65### Timings 66 67``` 68 SUBROUTINE CALLS ASD SELF TIME TOTAL TIME 69 MAXIMUM AVERAGE MAXIMUM AVERAGE MAXIMUM 70 CP2K 1 1.0 0.182 0.228 8802.156 8802.158 71 qs_mol_dyn_low 1 2.0 0.002 0.002 8793.361 8796.208 72 qs_forces 21 4.0 0.300 0.313 8731.917 8731.935 73 qs_energies 21 5.0 0.001 0.001 8299.259 8316.296 74 ls_scf 21 6.0 0.000 0.000 8140.304 8157.313 75 velocity_verlet 20 3.0 0.014 0.024 7689.100 7689.373 76 ls_scf_main 21 7.0 0.006 6.929 6214.282 6221.060 77 dbcsr_multiply_internal 6268 10.6 5.425 5.502 5456.668 5499.570 78 multiply_cannon 6268 11.6 9.185 9.602 4797.967 4876.009 79 dm_ls_curvy_optimization 234 7.8 0.002 0.002 3756.519 3757.137 80 optimization_step 234 8.8 0.001 0.046 3056.137 3059.618 81 multiply_cannon_multrec 401152 12.6 845.768 1008.921 2235.185 2348.598 82 mp_waitall_1 3329028 13.7 2028.515 2273.110 2028.515 2273.110 83 compute_direction_newton 78 9.8 0.101 0.977 2206.565 2206.643 84 ls_scf_dm_to_ks 255 7.9 0.003 0.004 2046.532 2047.273 85 multiply_cannon_metrocomm3 401152 12.6 0.957 1.092 1280.957 1642.411 86 commutator_symm 1536 10.8 0.024 0.026 1393.033 1401.600 87 dbcsr_mm_accdrv_process 7955916 13.9 862.352 933.296 1051.028 1136.256 88 rebuild_ks_matrix 277 9.6 0.002 0.004 1013.434 1013.460 89 qs_ks_build_kohn_sham_matrix 277 10.6 0.037 0.051 1013.433 1013.458 90 pao_post_scf 21 7.0 0.000 0.000 979.076 979.097 91 multiply_cannon_metrocomm1 401152 12.6 1.188 1.293 666.837 966.589 92 pao_add_forces 21 8.0 0.022 0.036 939.777 941.085 93 qs_ks_update_qs_env 256 8.9 0.002 0.002 915.333 915.356 94 update_p_exp 234 9.8 0.013 0.032 849.491 852.997 95 qs_rho_update_rho 256 8.9 0.003 0.004 732.701 747.571 96 calculate_rho_elec 256 9.9 486.068 514.736 732.698 747.568 97 sum_up_and_integrate 277 11.6 0.488 0.516 732.856 733.703 98 integrate_v_rspace 277 12.6 518.242 550.497 732.367 733.214 99 matrix_qs_to_ls 276 8.0 0.142 0.144 701.236 702.223 100 pao_calc_outer_grad_lnv 21 9.0 0.100 0.114 657.554 660.960 101 transform_matrix_orth 333 8.8 0.010 0.011 647.406 651.982 102 purify_mcweeny_orth 255 10.7 0.009 0.493 635.637 639.508 103 make_m2s 12536 11.6 5.422 5.562 568.779 607.900 104 make_images 12536 12.6 19.448 20.772 560.314 599.309 105 make_images_sizes 12536 13.6 0.028 0.032 428.487 491.794 106 mp_alltoall_i44 12536 14.6 428.459 491.763 428.459 491.763 107 matrix_ls_to_qs 276 8.9 0.154 0.157 441.958 469.591 108 mp_alltoall_d11v 10853 12.8 223.253 388.573 223.253 388.573 109 dbcsr_new_transposed 3211 11.1 9.536 10.585 333.787 347.908 110 rs_distribute_matrix 554 12.2 26.205 29.397 187.192 341.592 111 dbcsr_redistribute 2470 12.3 31.904 35.233 319.945 334.210 112``` 113