Lulesh 2.0 -- Jetson TX (2 threads, affinity scatter,1)

Self (%) Cumulative (%) Codelet Name Error (%)
5.08 5.1 __cere__lulesh__ZL23IntegrateStressForE... 25.64
3.9 4.0 __cere__lulesh__ZL18CalcEnergyForElemsP... 36.97
2.9 2.9 __cere__lulesh__ZL18CalcEnergyForElemsP... 41.77
8.16 8.2 __cere__lulesh__ZL28CalcHourglassContro... 11.65
2.5 2.5 __cere__lulesh__ZL28CalcFBHourglassForc... 2.34
1.6 1.6 __cere__lulesh__ZL18CalcEnergyForElemsP... 17.46
13.6 13.5 __cere__lulesh__ZL28CalcFBHourglassForc... 2.92
5.1 5.1 __cere__lulesh__Z22CalcKinematicsForEle... 4.79
1.1 1.1 __cere__lulesh__ZL20CalcPressureForElem... 6.4
4.4 4.4 __cere__lulesh__ZL20CalcPressureForElem... 8.26
3.95 3.9 __cere__lulesh__ZL31CalcMonotonicQGradi... 1.26
2.3 2.3 __cere__lulesh__ZL28CalcMonotonicQRegio... 40.14
0.9 0.9 __cere__lulesh__ZL18CalcEnergyForElemsP... Too small
5.9 5.9 __cere__lulesh__ZL15EvalEOSForElemsR6Do... 13.13
3.2 3.2 __cere__lulesh__ZL23IntegrateStressForE... 11.86

Exec Time : 2.004303e+09 cycles

Matching : 47.91%

Responsive image
Clustering image
Call graph image

__cere__lulesh__ZL28CalcHourglassControlForElemsR6DomainPdd_1037


Real time: 1.127100e+08 -- Predicted time: 9.958430e+07

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
268 ### 1 928.72 1.072270e+05 1.213600e+05 11.65
Clustering image
Call graph image

__cere__lulesh__ZL28CalcFBHourglassForceForElemsR6DomainPdS1_S1_S1_S1_S1_S1_dii_810


Real time: 2.533511e+08 -- Predicted time: 2.459457e+08

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
450 ### 1 932.44 2.637660e+05 2.717080e+05 2.92
Clustering image
Call graph image

__cere__lulesh__Z22CalcKinematicsForElemsR6DomainPddi_1538


Real time: 9.499813e+07 -- Predicted time: 9.978029e+07

Callcount: 932

Invocation Cluster Part Invitro (cycles) Invivo (cycles) Error (%)
527 ### 1 929.48 1.073510e+05 1.022060e+05 4.79