Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Content

Table of Contents

previous page: Scaling behavior of psana1 - Part 1 - det.calib method in multicore processing with mpi

2024-02-06 Test of

...

milano216 host with

...

perf

...

Test description

...

titlecommand perf, response, and accumulation of results in dict
collapsetrue

...

Description

Using command:

perf stat -e

...

cache-references,cache-misses,cycles,instructions,branches,branch-misses,faults,migrations,page-faults,L1-dcache-load-misses,L1-icache-load-misses     python test-scaling-subproc.py <parameter>

where parameter defines test for different number of CPUs, e.g. <parameter> = −1,−2,−8,−13,−16,−17,−18 stands for test on single, 8, 16, 32, 56, 64, 128 CPUs.

Code Block
titleCode of time_consuming_algorithm
collapsetrue
import numpy as np
from time import time, sleep

def random_standard(shape=(40,60), mu=200, sigma=25, dtype=np.float64):
    a = mu + sigma*np.random.standard_normal(shape)
    return np.require(a, dtype)

def random_arrays(sh2d = (8*512,1024), dtype=np.float64):
    sh3d = (3,) + sh2d
    return random_standard(shape=sh2d, mu=10, sigma=2, dtype=dtype),\
   sleep 5

 Performance counter stats for 'sleep 5':

            27,322      cache-references:u                                          
             6,778      cache-misses:u         random_standard(shape=sh3d, mu=20,  #   24.808 % of all cache refs    
  sigma=3, dtype=dtype)

def time_consuming_algorithm():
    t01 = time()
    a, b = random_arrays()
    t02 = time()
   473,798 gr1 = a>=11
   cycles:u gr2 = (a>9) & (a<11)
    gr3 = a<=9
    t03 = time()
    a[gr1] -= b[0, gr1]
    a[gr2] -= b[1, gr2]
    a[gr3] -= b[2, gr3]
    t04 = time()
    return (t01, t02, t03, t04)
Code Block
titleCode of the event loop in test-scaling-subproc.py
collapsetrue
def do_algo(cpu=0, cmt='v0'):

    hostname = get_hostname()
     600,974 #cpu_num = psutil.Process().cpu_num()
    print('requested instructionscpu:u%03d' % cpu)

    SAVE_FIGS = True
    #SHOW_FIGS = False
  1.27  insnnevents per= cycle100
    ntpoints =    6
    arrts = np.zeros((nevents,ntpoints), dtype=np.float64)
    140,006  t05_old = time()

    branches:ufor nevt in range(nevents):
        t00 = time()
        times = time_consuming_algorithm()
        cpu_num = psutil.Process().cpu_num()
        #if cpu_num >=16 and cpu_num <=23:
    
    #    print('cpu_num:%03d nevt:%03d time:%.6f CPU_NUM IN WEKA RANGE  64      faults:u[16,23]' % (cpu_num, nevt, dt_sec))
        t05 = time()
        times = (t00,) + times + (t05,)
        arrts[nevt,:] = times
        dt_evt = t05 - t05_old
        
t05_old = t05
        if nevt%10>0: continue
     0   dt_alg =  migrations:u times[4] - times[3]
        dt_in  = times[4] - times[1]
        print('cpu_num:%03d nevt:%03d  times (sec)' % (cpu_num, nevt), \
             ' random arrs: %.6f' % 

(times[2] - times[1]), \
             ' 5.003108172 seconds time elapsed

       0.000833000 seconds user
indeces: %.6f'     % (times[3] - times[2]), \
           0.000250000 seconds sys

{'t_sec'' alg: 1707241305.4141793, 'cache-references': 27322, 'cache-misses': 6778, 'cycles': 473798, 'instructions': 600974, 'branches': 140006, 'faults': 64, 'migrations': 0}

...

%.6f'         % (times[4] - times[3]), \
             ' inside algo: %.6f' % (times[4] - times[1]), \
             ' per event: %.6f'   % dt_evt)
...
further code is ffor saving results and graphics

Results

Code Block
titleResults of the test for 1,8,16,32,56,64,128 CPU - perf response paramteres
collapsetrue
host: sdfmilan216 version: v64
test#     t_sec  ana-4.0.59-py3 [dubrovin@sdfmilan216:~/LCLS/con-py3]$


1 CPU
======
 Performance counter stats for 'python test-scaling-subproc.py -1':

     4,522,410,200      cache-references:u cache-misses cycles       instructions branches     faults       migrations  
000            0        27807(62.49%)
       112,207,635  6624    cache-misses:u   729494       603490  #    2.481 140028% of all cache refs      (62.51%)
 64  224,402,878,245      cycles:u    0
001            5        28247         7140       594354       603471       140023(62.51%)
   428,582,543,872      instructions:u  64          #  0
002  1.91  insn per cycle     10      (62.51%)
  27530  59,430,436,824      branches:u 6794       526625       600130       140006           66            0
003       (62.50%)
    15 2,353,206,592      branch-misses:u 28074         7134 #    3.96% of 584615all branches      603463    (62.50%)
   140022        657,277   67   faults:u         0
004           20        27226         6765       568909       601163 
      140061           650      migrations:u      0
005           25        27563         6878       608405       603761
       140110    657,277      page-faults:u 65            0
006           30        27623         6969      
 585092    2,169,783,808   602011   L1-dcache-load-misses:u    140048           63            0
007           35 (62.50%)
       28016  7,173,374       7208L1-icache-load-misses:u        584639       600074       139998           64      (62.50%)

      0
008   70.762930452 seconds time elapsed

      66.918003000 seconds 40user
       2.380196000 27379seconds sys

8 CPU
======
 Performance counter stats for   6883 'python test-scaling-subproc.py -2':

    35,293,654,947  557356    cache-references:u   600535       140032           65            0
009           45(62.50%)
       675,772,563 28089     cache-misses:u    7155       679594 #    1.915 % 600533of all cache refs    140030  (62.50%)
 1,863,835,416,629      cycles:u  63            0
010           50        27421         6909       546743     (62.50%)
  6034703,408,694,078,315      instructions:u  140022          # 64   1.83  insn per cycle     0
011      (62.50%)
   470,729,321,611  55    branches:u    27773         6839       672210       601042       140016           66       (62.50%)
     0
01218,710,029,709      branch-misses:u      60     #   27316 3.97% of all branches     6914     (62.50%)
  983778       6034614,759,204      faults:u 140020           64            0
013           65        29078         7103
      1105556       600407    0   139998   migrations:u        65            0
014           70        27415         6885
      1289278   4,759,204    601889  page-faults:u     140019           67            0
015           75        27235
    17,164,781,068     6850  L1-dcache-load-misses:u      596144       602625       140021           65        (62.50%)
    0
016    42,407,266       80L1-icache-load-misses:u         27224         6844       595918       600335       140078(62.50%)

      82.107165073 seconds time elapsed

  67   600.726489000 seconds user
      28.169314000 seconds 0sys

017       16 CPU
======
Performance counter stats for 'python test-scaling-subproc.py -8':

    8571,125,012,043      cache-references:u  27391         6826       545500       601040       140014           64 (62.50%)
     2,509,743,885      0
018cache-misses:u           90 #    3.529 % of 27753all cache refs       6913(62.50%)
 4,256,512,072,612      cycles:u 555237       600400       139995           64            0
019           96     (62.50%)
 6,815,210,853,848  27512    instructions:u     6800       546790#    1.60  insn 600342per cycle      140079     (62.50%)
   940,797,592,651   66   branches:u         0
020          101        27818         6651       588694       602190  (62.50%)
    37,401,077,277 140109     branch-misses:u      66     #    3.98% of all 0
021branches          106(62.50%)
        27745 9,874,603      faults:u  6969       603165       603460       140019           63            0
022      
    111        27706     0    6790  migrations:u     553593       603471       140023           65            0
023      
    116     9,874,603   27221      page-faults:u   6801       632050       600059       139993           63            0
024    34,764,585,133      121L1-dcache-load-misses:u        27708         6824       545754       602640       140025  (62.50%)
        82,908,203 64     L1-icache-load-misses:u       0
025          126        27290         6783     (62.50%)

  533678    98.180409648 seconds time 600069elapsed

    1370.175346000 seconds user
 139997    121.864448000 seconds sys

32 CPU
======
 Performance counter stats 65for 'python test-scaling-subproc.py -13':

   140,229,421,945      0
026cache-references:u          131        27309         6880       554409       600977   (62.50%)
    140009 5,022,345,750      cache-misses:u    67        #    0
027     3.582 % of all cache refs     136 (62.50%)
 8,558,410,936,114      28198cycles:u          6748       675396       602626       140022           66            0
028(62.50%)
13,628,360,184,584      instructions:u    141        27231#    1.59  insn per cycle 6980      1156713    (62.50%)
 1,881,291,550,548  602754    branches:u   140054           63            0
029          146        27808        (62.50%)
 6755   74,783,808,615    668027  branch-misses:u     601393      # 140127   3.98% of all branches     64      (62.50%)
      0
030  19,579,143      faults:u  151        27487         6743       618176       600314       140057           64 
           0
031      0    156  migrations:u      27511         6734       610589       600360       140074           64 
        19,579,143   0
032   page-faults:u       161        27664         6937       519508       600397       139995  
    68,615,480,748     65  L1-dcache-load-misses:u           0
033          166        27473         6802 (62.50%)
      583479 163,094,161      601179L1-icache-load-misses:u       140067           67            0
034         (62.50%)

 171     99.279801084 seconds time 27273elapsed

    2763.979749000 seconds user
   6761  246.852789000 seconds sys

56 CPU
======
 Performance 592098counter stats for 'python test-scaling-subproc.py -16':

  600124 245,664,589,385      140003cache-references:u           64            0
035          176        27060   (62.50%)
     5,986,128,102 6672     cache-misses:u  533480       600397   #    1399952.437 % of all cache refs      65(62.50%)
13,462,198,820,573      cycles:u      0
036          181        27516         6836       836405       600068       139996(62.50%)
23,847,765,747,744      instructions:u     64       #    1.77 0
037 insn per cycle       186    (62.50%)
 3,290,927,488,525   26990   branches:u      6592       519081       600135       140007           65            0
038  (62.50%)
   130,897,170,304     192 branch-misses:u       27087    #    3.98% 6599of all branches     578868     (62.50%)
  600395      35,494,247 139993     faults:u      63            0
039          197        27208         6650       502553
       601884       140018   0      migrations:u  67            0
040          202        27340         6935       513824
       600418 35,494,247      140002page-faults:u           66            0
041          207        27134      
   6706119,933,873,577      L1-dcache-load-misses:u  535601       600508       140023           64            0(62.50%)
042       288,403,921   212   L1-icache-load-misses:u     27640         6902       623607       600078       139999    (62.50%)

     108.453630713 seconds 64time elapsed

    5381.177612000       0
043seconds user
     333.903330000 seconds    217        27683 sys

64 CPU
======
Performance counter stats for 'python test-scaling-subproc.py -17':

   281,639,175,978      cache-references:u  7008       696304       602742       140052           65            0
044(62.50%)
     8,968,404,974     222 cache-misses:u       27462     #    67513.184 % of all cache refs  502284    (62.50%)
16,140,364,752,053   602629   cycles:u    140022           65            0
045          227        27673         6964(62.50%)
27,256,133,511,829       693908instructions:u       600078     #  139999  1.69  insn per cycle     64            0
046 (62.50%)
 3,761,710,111,186      branches:u   232        27886         7068       663126       600764       140102           64(62.50%)
   149,569,155,086      branch-misses:u   0
047        #  237  3.98% of all branches   27642       (62.50%)
  7073      39,148,442 601080     faults:u  601041       140015           65            0
048          242        27093  
       6793       491163   0    600176  migrations:u     140017           65            0
049          247        27628  
       6886 39,148,442      557406page-faults:u       603461       140020           65            0
050          252
   137,584,278,754     27014  L1-dcache-load-misses:u        6783       521803       600121       140003          (62.50%)
 65      330,750,296      0
051L1-icache-load-misses:u          257        27689         6783       571768     (62.50%)

  603483   120.688547006 seconds time elapsed

 140028   6274.688233000 seconds user
     484.406164000 67seconds sys

120 CPU
=======
 Performance counter stats for 'python     0
052test-scaling-subproc.py -18':

   532,229,037,371      cache-references:u 262        27850         6952       668413       603464       140023     (62.50%)
    14,227,944,434  67    cache-misses:u        0
053    #    2.673 % 267of all cache refs     27859  (62.50%)
29,404,359,241,173      cycles:u  6789       628276       601903       140023           67            0
054        (62.50%)
51,095,884,028,391  272    instructions:u    27749        # 6733   1.74  insn per 584079cycle       603621    (62.50%)
 7,053,547,766,317  140068    branches:u       65            0
055          277        28189         7124      (62.50%)
 569216  280,479,284,507     601049 branch-misses:u      140017     #    3.98% of 65all branches           0
056(62.50%)
        73,250,012  282    faults:u    27601         6917       574187       603782       140117           66            0
057          287       0 27928     migrations:u    6848       536114       600985       140010           65            0
058        73,250,012  293    page-faults:u    27228         6742       597382       600396       139994           64   
   260,078,672,869      0
059L1-dcache-load-misses:u          298        27858         6830       626275       600418       140002           66            0
060(62.50%)
       618,858,635   303   L1-icache-load-misses:u     27106         6749       592436       600225       140030    (62.50%)

     119.736692035 seconds 66time elapsed

   11628.275939000        0
061seconds user
     843.423292000     308        27146         6908      1148965       601180       140068           67            0
062          313        26963         6688       578839       600975       140007           65            0
063          318        29770         6771       574128       603462       140021           65            0
064          323        27643         6972       539701       600999       140011           66            0
065          328        28824         6874       655647       601880       140014           64            0
066          333        27006         6751       574385       600121       140003           65            0
067          338        27562         6912      1133050       602740       140050           64            0
068          343        27802         7111       672730       602625       140021           65            0
069seconds sys

Summary

number

of CPU

cache-

references

cache-

misses

cyclesinstructionsbranches

branch-

misses

faultspage-faults

L1-dcache-

load-misses

L1-icache-

load-misses

L1-icache

Ratio N/1

cmt
14,522,410,200112,207,635224,402,878,245428,582,543,87259,430,436,8242,353,206,592657,277657,2772,169,783,8087,173,3741
835,293,654,947675,772,563


18,710,029,709

17,164,781,06842,407,2665.9
1671,125,012,0432,509,743,885


37,401,077,277

34,764,585,13382,908,20311.6
32140,229,421,9455,022,345,750


74,783,808,615

68,615,480,748163,094,16122.7
56245,664,589,3855,986,128,102


130,897,170,304

119,933,873,577288,403,92140.2
64281,639,175,9788,968,404,974


149,569,155,086

137,584,278,754330,750,29646.1
120

532,229,037,371

14,227,944,43429,404,359,241,17351,095,884,028,3917,053,547,766,317280,479,284,50773,250,01273,250,012260,078,672,869618,858,63586.2













2024-02-07 Test of milano216 host with command perf

Description

Running perf with mpirun on a single and 80 CPUs:

perf stat -e cache-references,cache-misses,cycles,instructions,branches,branch-misses,faults,migrations,page-faults,L1-dcache-load-misses,L1-icache-load-misses,dTLB-load-misses,iTLB-load-misses     mpirun -n 1 python  Detector/examples/test-scaling-mpi.py

perf stat -e cache-references,cache-misses,cycles,instructions,branches,branch-misses,faults,migrations,page-faults,L1-dcache-load-misses,L1-icache-load-misses,dTLB-load-misses,iTLB-load-misses     mpirun -n 80 python  Detector/examples/test-scaling-mpi.py

Code Block
titleCode of the time_consuming_algorithm
collapsetrue
import numpy as np
from time import time

def random_standard(shape=(40,60), mu=200, sigma=25, dtype=np.float64):
    a = mu + sigma*np.random.standard_normal(shape)
    return np.require(a, dtype)

def random_arrays(sh2d = (8*512,1024), dtype=np.float64):
    sh3d = (3,) + sh2d
    return random_standard(shape=sh2d, mu=10, sigma=2, dtype=dtype),\
          348      random_standard(shape=sh3d, mu=20, sigma=3, dtype=dtype)

def time_consuming_algorithm():
   28163 a, b = random_arrays()
    gr1 7089= a>=11
    gr2 1065588= (a>9) & (a<11)
    601900gr3 = a<=9
    t0_sec 140023= time()
    a[gr1] -= b[0, gr1]
   66 a[gr2] -= b[1, gr2]
    a[gr3]    0
070-= b[2, gr3]
    return time() - t0_sec
Code Block
titleCode of the event loop in Detector/examples/test-scaling-mpi.py
collapsetrue
from mpi4py import 353MPI
comm = MPI.COMM_WORLD
rank = comm.Get_rank()
size = comm.Get_size()

hostname  27878         6874       695158       600071       139999           67            0
071          358= get_hostname()
cpu_num = psutil.Process().cpu_num()
print('rank:%02d cpu_num:%03d size:%02d' % (rank, cpu_num, size))

ranks = (0, 10, 20, 30, 40, 50, 60, 70)
SAVE_FIGS = True
SHOW_FIGS = False
nevents = 100
arrts = np.zeros((nevents, size), dtype=np.float64)

for nevt in range(nevents):
    dt_sec = time_consuming_algorithm()
    arrts[nevt,rank] = dt_sec  # dt_sec = time()-t0_sec
    cpu_num = psutil.Process().cpu_num()
    if cpu_num >=16 and cpu_num <=23:
         27010         6839   print('rank:%02d cpu_num:%03d nevt:%03d time:%.6f CPU_NUM IN WEKA RANGE [16,23]' % (rank, cpu_num, nevt, dt_sec))
    538832if nevt%10>0: continue
    print('rank:%02d cpu_num:%03d nevt:%03d time:%.6f' % (rank, cpu_num, nevt, dt_sec))
...
somme graphics for array arrts

Results

Code Block
titleResults for perf with mpirun on a single and 80 CPUs
collapsetrue
ana-4.0.59-py3 [dubrovin@sdfmilan216:~/LCLS/con-py3]$ perf stat -e cache-references,cache-misses,cycles,instructions,branches,branch-misses,faults,migrations,page-faults,L1-dcache-load-misses,L1-icache-load-misses,dTLB-load-misses,iTLB-load-misses 603493       140031           67      mpirun -n 1 python   0
072          363   Detector/examples/test-scaling-mpi.py 
...
Performance counter stats for 'mpirun -n 1 python Detector/examples/test-scaling-mpi.py':

     4,448,830,552     27371 cache-references:u        6882       536044       602622       140021           66    (50.00%)
        0
07390,374,312      cache-misses:u    368        27421#    2.031 % of all cache 6742refs      (50.00%)
 519800   222,814,516,280    600970  cycles:u     140005           64            0
074          373        26952         6766(50.02%)
   426,700,282,993    943875  instructions:u     601155       140043#    1.92  insn per cycle   63        (50.01%)
    0
07558,876,394,584      branches:u    378        27974         6979       978668       600529       140030          (50.01%)
 65    2,343,687,188      branch-misses:u  0
076         # 383   3.98% of all branches  27233        (50.01%)
 6846       969595   635,183    600305  faults:u     140068           65            0
077          388        27435      
   6864      1002256       601190 0      140055migrations:u           65            0
078          394        26603       
  6687       512388  635,183     600544 page-faults:u      140035           66            0
079          399        27703
     2,158,358,417    7229  L1-dcache-load-misses:u     826644       600334       140064           66            0
080(50.00%)
         5,694,036 404     L1-icache-load-misses:u   30357         7076       697640       600456       140012      (49.99%)
     68    4,282,821        0
081dTLB-load-misses:u           409        27593         7014       542822       600409  (49.99%)
     140000      890,671     67 iTLB-load-misses:u           0
082          414        28354         7250      (50.00%)

 732640     73.297275789 seconds 601181time elapsed

      14006969.795728000 seconds user
       2.318007000  68     seconds sys

ana-4.0.59-py3 [dubrovin@sdfmilan216:~/LCLS/con-py3]$ perf stat -e cache-references,cache-misses,cycles,instructions,branches,branch-misses,faults,migrations,page-faults,L1-dcache-load-misses,L1-icache-load-misses,dTLB-load-misses,iTLB-load-misses     mpirun -n 0
08380 python         419        27387Detector/examples/test-scaling-mpi.py
...
 Performance counter stats for 'mpirun -n 80 python Detector/examples/test-scaling-mpi.py':

   349,526,509,383      cache-references:u    7071       563924       601176       140064           64            0
084(50.01%)
     5,932,480,814     424 cache-misses:u       27977     #    72751.697 % of all cache refs  629946    (50.00%)
18,768,444,974,036   601886   cycles:u    140016           64            0
085          429        27340         6741(50.00%)
33,983,153,714,284      instructions:u 575039       602920    #   140103 1.81  insn per cycle      64     (49.99%)
 4,684,730,635,234      0
086branches:u          434        27795         7084       619973       602620       140019    (49.99%)
   186,649,297,019    64  branch-misses:u          0
087 #    3.98% of all branches  439        27463(50.00%)
        52,121,421 6668     faults:u  557690       600975       140007           65            0
088          444   
     28083         7166   0    663865  migrations:u     600961       140003           65            0
089          449   
     27561   52,121,421      6870page-faults:u       589958       600077       139998           63            0
090   
   171,500,392,922    454  L1-dcache-load-misses:u      27670         7053       616345       601883       140017   (50.00%)
       267,672,856 67     L1-icache-load-misses:u       0
091          459        27216         6903     (50.00%)
  564047     339,145,247  600962    dTLB-load-misses:u   140004           65            0
092          464        26729(50.01%)
        69,780,394 6814     iTLB-load-misses:u 1162124       600065       139996           65            0
093      (50.01%)

    469  92.952500273 seconds time elapsed

   27246 6501.353593000 seconds user
     410.844719000 6698       467172       600068       139996           64            0
094          474        27543         6816       487590       600127       140006           67            0
095          479        27374         6754       526273       600068       139996           64            0
096          485        28285         7204       661196       600094       140005           66            0
097          490        27550         6901       578701       602648       140030           67            0
098          495        28363         7337       649845       600533       140031           64            0
099          500        27322         6778       473798       600974       140006           64            0

Image RemovedImage RemovedImage RemovedImage RemovedImage RemovedImage RemovedImage RemovedImage Removed

Code Block
title120 CPU
collapsetrue
 host: sdfmilan216 version: v120
test#     t_sec        cache-references cache-misses cycles       instructions branches     faults       migrations  
000            0        27931         6807       740397       603462       140021           65            0
001            5        29421         7303       889000       603481       140026           65            0
002           10        28498         7097       754015       601047       140018           66            0
003           15        27902         6810       624262       602619       140018           63            0
004           20        27691         7099       643281       603499       140028           67            0
005           25        26869         6721       613941       600397       139995           65            0
006           30        28939         6813       607238       601888       140018           66            0
007           35        27753         6898       601369       600302       140068           66            0
008           40        28215         6828       560459       600186       140019           64            0
009           45        29477         6852       663561       600537       140034           65            0
010           50        27399         6776       565464       600208       140028           68            0
011           55        27550         6890       638548       600971       140006           65            0
012           60        27572         6834       572234       600249       140041           65            0
013           65        28684         6880       633639       601042       140016           65            0
014           70        27078         6702       577830       602621       140020           65            0
015           75        27944         6998       629551       601880       140014           64            0
016           80        27577         7039       589876       601055       140019           64            0
017           85        28001         6929       603365       600985       140010           65            0
018           90        27183         6926       621575       602631       140024           66            0
019           95        28359         6770       590508       600395       139993           63            0
020          100        27652         6960       595630       603471       140023           65            0
021          106        30539         6764      1159692       600961       140003           65            0
022          112        27242         7972      1147605       600962       140004           66            0
023          118        27991         7139      1206382       601045       140016           65            0
024          123        26929         7209      1520627       603480       140025           64            0
025          129        27643         7187      1183727       600399       139997           66            0
026          135        26617         6810      1351490       600961       140003           64            0
027          141        26366         8105      1419415       600404       139998           66            0
028          146        23734         8431      1063626       601897       140020           65            0
029          152        24920         8596      1202865       602163       140100           65            0
030          158        28609         8131      1380905       601883       140017           66            0
031          163        25033         8436      1031466       600062       139996           66            0
032          169        26950         6785      1238519       603482       140027           66            0
033          175        27594         6894      1036822       601891       140018           65            0
034          181        31002         6904      1043149       600961       140003           64            0
035          186        31024         6777      1041242       600065       139996           65            0
036          192        26870         6779      1134167       600075       139999           65            0
037          197        28381         7076       917260       600619       140064           64            0
038          202        27976         6872       599000       603471       140023           65            0
039          207        28855         7121       702819       602764       140057           64            0
040          212        27711         6912       520624       600323       140059           63            0
041          217        27061         6981       541981       600143       140009           65            0
042          222        27226         6645       575722       602641       140026           65            0
043          227        27115         6713       525871       600127       140006           66            0
044          232        27806         6832      1065500       600127       140003           63            0
045          237        27357         6856      1065612       600060       139994           64            0
046          242        26999         6811      1046968       600975       140007           65            0
047          247        27767         7046       545133       600397       139995           65            0
048          253        27546         6946       839604       600063       139997           66            0
049          258        27782         6750       866929       602632       140025           68            0
050          263        27729         7086       726946       600410       140001           68            0
051          268        27340         6826       976370       602623       140022           66            0
052          273        27450         6646       872728       600120       140002           63            0
053          278        27269         6861       860563       601043       140017           66            0
054          283        27358         6871      1017536       600314       140057           63            0
055          288        27165         6832       805321       601880       140014           63            0
056          293        27455         6875       926200       602652       140030           65            0
057          298        27200         6704       909596       602317       140142           65            0
058          303        27761         6980       859384       602775       140066           65            0
059          308        27801         7020       682316       600065       139996           65            0
060          313        27593         6991       902030       600413       140001           67            0
061          318        26614         6645       792663       600187       140020           65            0
062          323        27427         6647      1081342       602633       140026           68            0
063          328        27047         6730       868243       600063       139997           67            0
064          333        27317         6857       894792       601157       140045           65            0
065          338        28685         6993       608994       600524       140029           64            0
066          344        24688         8446      1258195       603782       140117           65            0
067          350        25507         7527      1314785       602741       140051           64            0
068          355        26946         6880      1288179       603461       140020           65            0
069          361        30221         6737       834593       602622       140021           65            0
070          367        24845         7983      1305169       600975       140007           64            0
071          372        25235         7542      1210487       602641       140026           65            0
072          378        27787         7044      1113910       600413       140001           67            0
073          384        25338         8136      1221054       601050       140018           65            0
074          390        24565         8029      1299879       600327       140063           66            0
075          395        27326         6989      1123870       600334       140064           66            0
076          401        25591         9659      1485770       602636       140025           65            0
077          407        25937         9379      1242144       602766       140059           65            0
078          412        25349         8464      1118847       601049       140016           63            0
079          418        26772         7380      1246516       600764       140102           63            0
080          424        27522         6988      1671984       600214       140027           64            0
081          430        27465         8689      1154090       601044       140015           64            0
082          435        30807         6815      1084313       601900       140020           64            0
083          440        28580         7260       597202       600399       139997           67            0
084          445        28788         7039       767798       601898       140021           66            0
085          450        28948         7032       733130       603463       140022           65            0
086          455        28059         7309       536819       602625       140021           65            0
087          460        27062         6525       630170       601053       140017           63            0
088          466        27254         6773       549007       602622       140021           66            0
089          471        26982         6652       589272       601042       140016           66            0
090          476        26762         6858      1026028       600418       140002           66            0
091          481        27963         7002       658324       601169       140050           66            0
092          486        27989         7013       605852       601891       140018           65            0
093          491        28111         7131       611524       603493       140031           67            0
094          496        27635         7109       598087       602631       140024           67            0
095          501        27312         6889       596459       600071       139999           67            0
096          506        28674         7153       589621       600125       140004           65            0
097          511        27867         7041       603441       601893       140020           67            0
098          516        27463         6819       493625       600067       139998           66            0
099          521        28909         7355       672193       603462       140021           65            0 

Image RemovedImage RemovedImage RemovedImage RemovedImage RemovedImage RemovedImage Removed

References

seconds sys

Summary

number

of mpi cores

cache-

references

cache-

misses

cyclesinstructionsbranches

branch-

misses

faultspage-faults

L1-dcache-

load-misses

L1-icache-

load-misses

dTLB-

load-misses

iTLB-

load-misses

cmt
14,448,830,55290,374,312222,814,516,280426,700,282,99358,876,394,5842,343,687,188635,183635,1832,158,358,4175,694,0364,282,821890,671
80349,526,509,3835,932,480,81418,768,444,974,03633,983,153,714,2844,684,730,635,234186,649,297,01952,121,42152,121,421171,500,392,922267,672,856339,145,24769,780,394
Ratio (80)/(1)79.465.784.179.679.579.782.082.079.347.079.278.4


2024-02-09 Test of milano216 host with command perf

Description

Use commands with changed list of counters like 

perf stat -e stalled-cycles-backend,stalled-cycles-frontend,ls_l1_d_tlb_miss.all,l1_dtlb_misses,l1_data_cache_fills_all,bp_l1_tlb_miss_l2_tlb_miss.if2m,bp_l1_tlb_miss_l2_tlb_miss,l2_dtlb_misses,l2_itlb_misses  python test-scaling-subproc.py -8

Convert perf output to dict, present results in table.

Summary

(*) CPU numbers excludes weka FS.

number of CPU         

stalled-cycles-backend

←Ratio N/1

stalled-cycles-frontend←Ratio N/1

ls_l1_d_tlb_

miss.all

←Ratio N/1

l1_dtlb_

misses

←Ratio N/1

l1_data_cache_

fills_all 

←Ratio N/1

bp_l1_tlb_miss

_l2_tlb_miss.if2m

←Ratio N/1

bp_l1_tlb_miss_

l2_tlb_miss

←Ratio N/1

l2_dtlb_

misses

←Ratio N/1l2_itlb_misses←Ratio N/1
1143,8286141230,987724133,227437132,84519312179,46971413,7011769,30914,8333841719,0261
82105,881833153421,10835915172,7790305.2173,5082125.318216,5648748.325,6066.96124,8977.931,7193006.55591,8217.8
168796,313234  618018,69189035327,8927539.9326,3371839.934551,34106015.855,33114.812467,9761668,2272211410605,35214.7
24*10413,149941  7210519,49087046491,67324814.8490,56609314.951539,38429723.678,4332117889,6212396,9224692015177,11621.1
3217251,05529712013858,55495560671,04724720.2666,23099720.368736,84216831.5105,8742923936,97831135,3222502821599,94030.0
56*

17892,504080

124

24120,4931581041136,77853834.21135,44832534.6120696,77595255.3178,0824842679,84355234,4982544838164,17153
64

27304,844238

190

27697,5220171201258,99972937.91258,03135438.3141469,10904664.9201,3305450957,21866258,6096325343825,04260.9
120*

45388,735746

316

46279,2646612002382,06582071.62376,50710673.3264016,453328121375,69910293410,817121488,30815510178261,952109

References