Lulesh 2.0 -- Jetson TX (2 threads, affinity scatter,1)

Self (%) Cumulative (%) Codelet Name Error (%)
5.08 5.1 __cere__lulesh__ZL23IntegrateStressForE... 25.64
3.9 4.0 __cere__lulesh__ZL18CalcEnergyForElemsP... 36.97
2.9 2.9 __cere__lulesh__ZL18CalcEnergyForElemsP... 41.77
8.16 8.2 __cere__lulesh__ZL28CalcHourglassContro... 11.65
2.5 2.5 __cere__lulesh__ZL28CalcFBHourglassForc... 2.34
1.6 1.6 __cere__lulesh__ZL18CalcEnergyForElemsP... 17.46
13.6 13.5 __cere__lulesh__ZL28CalcFBHourglassForc... 2.92
5.1 5.1 __cere__lulesh__Z22CalcKinematicsForEle... 4.79
1.1 1.1 __cere__lulesh__ZL20CalcPressureForElem... 6.4
4.4 4.4 __cere__lulesh__ZL20CalcPressureForElem... 8.26
3.95 3.9 __cere__lulesh__ZL31CalcMonotonicQGradi... 1.26
2.3 2.3 __cere__lulesh__ZL28CalcMonotonicQRegio... 40.14
0.9 0.9 __cere__lulesh__ZL18CalcEnergyForElemsP... Too small
5.9 5.9 __cere__lulesh__ZL15EvalEOSForElemsR6Do... 13.13
3.2 3.2 __cere__lulesh__ZL23IntegrateStressForE... 11.86

Exec Time : 2.004303e+09 cycles

Matching : 47.91%

Responsive image
Clustering image
Call graph image

__cere__lulesh__ZL28CalcHourglassControlForElemsR6DomainPdd_1037


Real time: 1.127100e+08 -- Predicted time: 9.958430e+07

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
268 ### 1 928.72 1.072270e+05 1.213600e+05 11.65
Clustering image
Call graph image

__cere__lulesh__ZL28CalcFBHourglassForceForElemsR6DomainPdS1_S1_S1_S1_S1_S1_dii_810


Real time: 2.533511e+08 -- Predicted time: 2.459457e+08

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
450 ### 1 932.44 2.637660e+05 2.717080e+05 2.92
Clustering image
Call graph image

__cere__lulesh__Z22CalcKinematicsForElemsR6DomainPddi_1538


Real time: 9.499813e+07 -- Predicted time: 9.978029e+07

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
527 ### 1 929.48 1.073510e+05 1.022060e+05 4.79
Clustering image
Call graph image

__cere__lulesh__ZL15EvalEOSForElemsR6DomainPdiPii_2269


Real time: 2.455068e+08 -- Predicted time: 2.826024e+08

Callcount: 32620

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
9011 ### 1 37968.88 7.443000e+03 6.466000e+03 13.13
Clustering image
Call graph image

__cere__lulesh__ZL31CalcMonotonicQGradientsForElemsR6DomainPd_1646


Real time: 5.944152e+07 -- Predicted time: 5.869230e+07

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
655 ### 1 927.25 6.329700e+04 6.410500e+04 1.26
Clustering image
Call graph image

__cere__lulesh__ZL18CalcEnergyForElemsPdS_S_S_S_S_S_S_S_S_S_S_S_dddddS_S_ddiPi_2129


Real time: 0.000000e+00 -- Predicted time: 0.000000e+00

Callcount: 0

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
Clustering image
Call graph image

__cere__lulesh__ZL18CalcEnergyForElemsPdS_S_S_S_S_S_S_S_S_S_S_S_dddddS_S_ddiPi_2104


Real time: 1.057583e+08 -- Predicted time: 1.677804e+08

Callcount: 32620

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
16203 ### 1 31151.2 5.386000e+03 3.395000e+03 36.97
Clustering image
Call graph image

__cere__lulesh__ZL23IntegrateStressForElemsR6DomainPdS1_S1_S1_ii_593


Real time: 5.398875e+07 -- Predicted time: 6.125511e+07

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
498 ### 1 939.9 6.517200e+04 5.744100e+04 11.86
Clustering image
Call graph image

__cere__lulesh__ZL23IntegrateStressForElemsR6DomainPdS1_S1_S1_ii_549


Real time: 1.001763e+08 -- Predicted time: 7.449486e+07

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
213 ### 1 919.22 7.990000e+04 1.075230e+05 25.69
576 ### 2 12.06 8.703400e+04 1.110740e+05 21.64
Clustering image
Call graph image

__cere__lulesh__ZL20CalcPressureForElemsPdS_S_S_S_S_dddiPi_2051


Real time: 1.436395e+08 -- Predicted time: 1.534556e+08

Callcount: 97860

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
64324 ### 1 73091.1 1.486000e+03 2.870000e+02 80.69
59613 ### 2 31893.5 1.406000e+03 3.846000e+03 63.44
Clustering image
Call graph image

__cere__lulesh__ZL28CalcFBHourglassForceForElemsR6DomainPdS1_S1_S1_S1_S1_S1_dii_997


Real time: 5.356139e+07 -- Predicted time: 5.484690e+07

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
107 ### 1 938.32 5.845200e+04 5.708200e+04 2.34
Clustering image
Call graph image

__cere__lulesh__ZL28CalcMonotonicQRegionForElemsR6DomainiPdd_1798


Real time: 5.666185e+07 -- Predicted time: 3.391902e+07

Callcount: 10252

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
9880 ### 1 12201.09 2.780000e+03 4.644000e+03 40.14
Clustering image
Call graph image

__cere__lulesh__ZL18CalcEnergyForElemsPdS_S_S_S_S_S_S_S_S_S_S_S_dddddS_S_ddiPi_2091


Real time: 5.198056e+07 -- Predicted time: 6.297793e+07

Callcount: 32620

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
4537 ### 1 18941.78 2.472000e+03 9.810000e+02 60.32
29148 ### 2 7273.23 2.221000e+03 4.592000e+03 51.63
Clustering image
Call graph image

__cere__lulesh__ZL20CalcPressureForElemsPdS_S_S_S_S_dddiPi_2058


Real time: 1.464540e+08 -- Predicted time: 1.596320e+08

Callcount: 97860

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
60032 ### 1 20841.63 2.289000e+03 4.557000e+03 49.77
35760 ### 2 58233.86 1.922000e+03 8.840000e+02 54.01
Clustering image
Call graph image

__cere__lulesh__ZL18CalcEnergyForElemsPdS_S_S_S_S_S_S_S_S_S_S_S_dddddS_S_ddiPi_2182


Real time: 9.848493e+07 -- Predicted time: 1.691292e+08

Callcount: 32620

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
25688 ### 1 40882.08 4.137000e+03 2.409000e+03 41.77