You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 8 Next »

Timing histograms

Single core processing

80-core processing, timing for runks 0, 30, and 60

Results

Table shows mean time and its statistical uncertainty from histograms for each time point increment.

t pointtime incrementpoint descriptiontime for rank 0/1rank 0/80rank 30/80rank 60/80
1t1 - t0det.raw0.8±0.2 ms4.0 ±0.6 ms3.2±0.4 ms3.5 ±0.8 ms
2t2 - t1det.pedestals15±3 μs36 ±10 μs31±6 μs39 ±17 μs
3t3 - t2det.gain,offset15±2 μs27 ±4 μs26±4 μs27 ±6 μs
4...cmpars25±1 μs50 ±7 μs58±26 μs71 ±33 μs
5
gfac2±0 μs6 ±1 μs7±1 μs7 ±2 μs
6
gr0,1,21.3±0.2 ms10.5 ±1.1 ms7.0±0.9 ms9.7 ±1.6 ms
7
make arrf1.76±0.05 ms9.2 ±0.9 ms6.3±0.7 ms9.0 ±1.5 ms
8
subtract peds93.7±3.1 ms191 ±11 ms181±15 ms259 ±26 ms
9
eval gain factor for gain ranges4.9±0.6 ms20.3 ±1.5 ms14.6±1.2 ms17.3 ±2.0 ms
10
eval offset for gain ranges6.2±0.4 ms18.5 ±1.3 ms18.4±1.4 ms19.2 ±2.1 ms
11
subtract offset1.0±0.2 ms6.0 ±0.7 ms5.3±0.6 ms6.2 ±1.2 ms
12
get mask3±2 μs6 ±2 μs6±2 μs7 ±2 μs
13
common mode turned off7±1 μs15 ±2 μs17±2 μs20 ±3 μs
14t14 - t13apply gain factor and mask4.0±0.7 ms14.9 ±2.0 ms13.9±1.6 ms19.2 ±3.5 ms
99t14 - t0per evt time, inside det.calib109.8±4.2 ms276 ±15 ms247±13 ms345 ±29 ms
0t0 - t0 previous evttime between consecutive det.calib115.4±3.9 ms335 ±16 ms307±14 ms398 ±32 ms

Summary

  • single core processing is faster than per/core time in 80–core case, factor 2.5-3 FOR ALL OPERATIONS
  • in 80-core case: time per core is consistent between cores
  • all constants are cashed and access to constants is fast at sub-milisecond level
  • common mode correction is turned off, as well as mask?
  • most time consuming operation is indexed pedestal subtraction
indexed by gain ranges pedestal subtraction
    t07 = time()

    arrf[gr0] -= peds[0,gr0]
    arrf[gr1] -= peds[1,gr1]
    arrf[gr2] -= peds[2,gr2]

    t08 = time() 
  • bad single-to-multicore scaling issue has nothing to do with particular algorithm, it is common problem for any algorithm

References

  • No labels