You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 8 Next »

Update on the HZZ 250 GeV +80/-30 250/fb Analysis for 17 September 2013:


Analysis:

  • divide into 4 jet / 6 jet topologies
  • apply preselection depending on topology
  • train/apply TMVA's
  • validate with cut table
  • check remaining backgrounds

Distributions before preselection with only a cut on the reconstructed Higgs mass:







Preselection:

Evis<140.0: (4 Jet category)
#y34>0.0#hmass>95. && hmass<140.
#PTvisJETS>25.0 && PTvisJETS<70.0
#nTrks>5.
#ej1<120.

Evis>140: (6 jet category)
#y34>0.0
#hmass>90. && hmass<160.
#ej1<120.
#jetthrust<0.98
#nTrks>5

Distributions after preselection:

aaa


(Note: the lepton pair mass plot has the entries at less than 10 GeV randomly set to 10 and 130 GeV so that the signal peak can be seen.)

The TMVA variables:

  1. Evis JETS
  2. Mvis JETS
  3. PTvis JETS
  4. B Likeness
  5. C likeness
  6. Number of HE Electrons
  7. Higgs Mass
  8. Z energy
  9. Zstr energy
  10. Z cos theta
  11. Zstr cos theta
  12. Z mass
  13. Zstr mass
  14. azimuthal separation of Z jets
  15. thrust
  16. Number Charged Tracks
  17. Number of electrons
  18. Number of muons
  19.  
  20. ----NEW VARIABLES:
  1. y34
  2. y56
  3. lepton pair (PDG ID1 = -ID2) mass closest to Mz
  4. jet pair mass closest to mW
  5. jet pair1 - mW

    +

    jet pair2 - mW

Performance of different MVA options:

BEFORE

 [neal@localhost weights]$ grep -A 8 -i optimal- ../hzzv14-presel-v4-new-common0410results.txt
 --- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
 -- ---------------------------------------------------------------------------------------------
 ---       Cuts:  ( 664.0744,1312202.5)      -0.0050            0         0         0        0        0
 --- Likelihood:  ( 664.0744,1312202.5)       1.0000     0.728991  644.0848  779980.1   0.9699   0.5944
 ---     Fisher:  ( 664.0744,1312202.5)       0.0120      1.70335  258.2099  22721.13   0.3888  0.01732
 ---       BDTG:  ( 664.0744,1312202.5)      -0.9823      1.17754  538.5353  208621.3    0.811    0.159
 ---        BDT:  ( 664.0744,1312202.5)      -0.0462      3.42709  100.6751  762.2891   0.1516 0.0005809
 -- ---------------------------------------------------------------------------------------------

NOW:

     if ((fp = fopen("HZZ-TMVA-vars-ffh_zz_all_SM_background_+80e-_-30e+-v6-mini.txt","r"))==NULL) exit(0);
[neal@localhost test]$ source storeresultsHZZv4.sh hzz-withlep-withmW-n2-Mhiggspresel-plusptntrkp8g16y34nvtx


--- ==================================================================================================
--- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
--- --------------------------------------------------------------------------------------------------
---       Cuts:  (1035.8644,  1196975)      -0.0050            0         0         0        0        0
--- Likelihood:  (1035.8644,  1196975)       1.0000      1.66886  315.8824  35510.98   0.3049  0.02967
---     Fisher:  (1035.8644,  1196975)       0.0047      2.38122  586.8862  60157.76   0.5666  0.05026
---       BDTG:  (1035.8644,  1196975)      -0.9795        2.625  293.8675  12238.81   0.2837  0.01022
---        BDT:  (1035.8644,  1196975)      -0.0427       4.9889  146.0711  711.2005    0.141 0.0005942
--- --------------------------------------------------------------------------------------------------



Spent a lot of time playing with different preselection cuts and then settled for only making cuts where
I was cutting almost no signal away and then letting the MVA's do the rest.

Some signals are just not reasonable to try to select:

1) decays like l+l-H where H->ffnunu
Even in cases where the ff are leptons the initial l+l- are more energetic
and those were being selected.




Changed to checking all opposite signed lepton pairs for consistency with coming
from a Z.

 

 

HZZ generated sample stats

 

 

 

 

 

 

 

 

 

events

weight

 

I108056.001.stdhep

10000

20000

1.29E-004

e1e1h_zz.Gwhizard-

I108055.001.stdhep

10000

20000

4.36E-003

e1e1h_zz.Gwhizard-

I108054.001.stdhep

10000

20000

4.11E-004

e1e1h_zz.Gwhizard-

I108053.001.stdhep

10000

20000

2.70E-005

e1e1h_zz.Gwhizard-

I108058.001.stdhep

10000

20000

4.29E-003

e2e2h_zz.Gwhizard-

I108057.001.stdhep

10000

20000

4.00E-004

e2e2h_zz.Gwhizard-

I108060.001.stdhep

10000

20000

4.28E-003

e3e3h_zz.Gwhizard-

I108059.001.stdhep

10000

20000

4.00E-004

e3e3h_zz.Gwhizard-

 

 

Sum =

1.43E-002

10.40%

I108062.001.stdhep

10000

20000

2.54E-002

nnh_zz.Gwhizard-1_

I108061.001.stdhep

10000

20000

3.01E-003

nnh_zz.Gwhizard-1_

 

 

Sum =

2.84E-002

20.68%

I108064.001.stdhep

10000

20000

8.67E-002

qqh_zz.Gwhizard-1_

I108063.001.stdhep

10000

20000

8.08E-003

qqh_zz.Gwhizard-1_

 

 

Sum =

9.47E-002

68.93%

 

 

 

 

 

 

 

 

 

 

 

 

All =

1.37E-001

 

2) There was also a simple programming error which I've already fixed.


NEW)NEW:">Cut table for BDT: (NEW)NEW:

cuts__ (Cut Name            ):       all   others       2f       4f       6f       aa       1f       3f       5f        q     e+mu      tau       nu  Signif.
cut #0 (all                 ): 7.769e+07   625000 14507062  1019938        0 54073784        0  7464875        0      948       91       45      262      0.15  Delta(sig*BR) = 6.548594 +/- 0.046879
cut #1 (y34>0.y34>0.         ): 7.705e+07   625000 14449800  1018312        0 53568784        0  7387500        0      947       91       45      262      0.15  Delta(sig*BR) = 6.523157 +/- 0.046724
cut #2 (95.<hmass<140. 95.<hmass<140. ): 1.257e+07   150000 10264438   825238        0   387992        0   938000        0      901       86       43      213      0.35  Delta(sig*BR) = 2.854636 +/- 0.021075
cut #3 (25.<PTvisJETS<70 PTvisJETS no cut ): 5.637e+06    25000  4545812   744212        0     8750        0   312125        0      866       84       37      180      0.49  Delta(sig*BR) = 2.033046 +/- 0.015436
cut #4 (nTrks>5 nTrks>5      ): 3.826e+06        0  2957175   679838        0     8750        0   179500        0      866       81       36      176      0.59  Delta(sig*BR) = 1.688536 +/- 0.012688
cut #5 (jetthrust no cut jetthrust < 0.98 ): 3.826e+06        0  2957175   679838        0     8750        0   179500        0      866       81       36      176      0.59  Delta(sig*BR) = 1.688536 +/- 0.012688
cut #6 (ej1<120. ej1<120.   ): 2.144e+06        0  1480162   539312        0     8750        0   114500        0      844       78       35      176      0.77  Delta(sig*BR) = 1.292336 +/- 0.009952
cut #7 (MVA                 ): 9.252e+02        0       50      712        0        0        0        0        0       79       27        3       53      5.35  Delta(sig*BR) = 0.186991 +/- 0.010382

Unknown macro: {if (NF==18) print $3,$7,$15}

' | sort -n | uniq -c | awk '

Unknown macro: {if (NF==4) aa+=($1*$4);print $1*$4"t"$0}

END

Unknown macro: {print "Sum of weights = "aa}

' | sort -k 1,1 -n
Sum of weights = 762,5
12,5          1 106561 4f_sz 12,500000
12,5          1 106564 4f_sw 12,500000
12,5          1 106573 4f_zz 12,500000
12,5          1 106580 4f_zz 12,500000
12,5          1 106607 2f_z_ 12,500000
25            2 106551 4f_ww 12,500000
25            2 106562 4f_sz 12,500000
37,5          3 106575 4f_zz 12,500000
37,5          3 106608 2f_z_ 12,500000
50            4 106578 4f_ww 12,500000
62,5          5 106572 4f_sz 12,500000
100           8 106574 4f_zz 12,500000
125          10 106577 4f_ww 12,500000
237,5        19 106576 4f_zz 12,500000

OLD)OLD:">Cut table for BDT: (OLD)OLD:

isigall= 1345.984602 isigpresel= 1460.101057 igood= 114.116455 ibad= 762.500000
intot= 77692005.984714 ipre= 876.616455
cuts__ (Cut Name            ):       all   others       2f       4f       6f       aa       1f       3f       5f        q     e+mu      tau       nu  Signif.
cut #0 (all                 ): 7.769e+07   625000 14507062  1019938        0 54073784        0  7464875        0      948       91       45      262      0.15  Delta(sig*BR) = 6.548594 +/- 0.046879
cut #1 (20.<PTvisJETS       ): 1.283e+07    37500  5525512   746188        0  2968250        0  3546875        0      806       69       44      259      0.33  Delta(sig*BR) = 3.042092 +/- 0.024172
cut #2 (EvisJETS<220.       ): 4.637e+06    25000  3287550   514400        0   130196        0   678875        0      608       42       41      219      0.42  Delta(sig*BR) = 2.365748 +/- 0.020248
cut #3 (50.<hmass<140.      ): 3.306e+06        0  2542562   378675        0    60426        0   323875        0      536       38       35      206      0.45  Delta(sig*BR) = 2.233245 +/- 0.019503
cut #4 (||dph|3.14159|>0.15 ): 2.772e+06        0  2083912   364150        0    57097        0   266000        0      514       36       33      202      0.47  Delta(sig*BR) = 2.120439 +/ 0.018965
cut #5 (nTrks>10            ): 1.426e+06        0  1136375   169088        0    30283        0    89875        0      466       10       19      177      0.56  Delta(sig*BR) = 1.777889 +/- 0.017994
cut #6 (jetthrust<0.99      ): 1.325e+06        0  1044700   165588        0    30033        0    84125        0      463       10       19      173      0.58  Delta(sig*BR) = 1.730704 +/- 0.017728
cut #7 (ej1>40.             ): 1.313e+06        0  1035112   164625        0    29215        0    83250        0      462       10       19      173      0.58  Delta(sig*BR) = 1.725415 +/- 0.017704
cut #8 (MVA                 ): 8.766e+02        0      138      625        0        0        0        0        0       51        0        1       62      3.85  Delta(sig*BR) = 0.259452 +/- 0.015456

Remaining backgrounds: (OLD)$ sed 's/\./,/g' zzhpassing.dat | awk '

'|sort-n|uniq-c|awk'
Unknown macro: {if (NF==4) aa+=($1*$4);print $1*$4"t"$0}

END

Unknown macro: {print "Sum of weights = "aa}

' | sort -k 1,1 -n
Sum of weights = 762,5
12,5          1 106562 4f_sz 12,500000
12,5          1 106566 4f_sw 12,500000
12,5          1 106572 4f_sz 12,500000
12,5          1 106575 4f_zz 12,500000
12,5          1 106607 2f_z_ 12,500000
25            2 106551 4f_ww 12,500000
25            2 106553 4f_zz 12,500000
25            2 106571 4f_sz 12,500000
25            2 106573 4f_zz 12,500000
37,5          3 106574 4f_zz 12,500000
37,5          3 106578 4f_ww 12,500000
125          10 106608 2f_z_ 12,500000
162,5        13 106576 4f_zz 12,500000
237,5        19 106577 4f_ww 12,500000

Background contributions for Evis>140:


[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '

Unknown macro: {if ($18>=140. && NF==18) print $3,$7,$15}

' | sort -n | uniq -c | awk '

ENDbl">Remaining backgrounds: (NEW)[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '
Unknown macro: {if (NF==18) print $3,$7,$15}

' | sort -n | uniq -c | awk '

Unknown macro: {if (NF==4) aa+=($1*$4);print $1*$4"t"$0}

END

Unknown macro: {print "Sum of weights = "aa}

' | sort -k 1,1 -n
Sum of weights = 762,5
12,5          1 106561 4f_sz 12,500000
12,5          1 106564 4f_sw 12,500000
12,5          1 106573 4f_zz 12,500000
12,5          1 106580 4f_zz 12,500000
12,5          1 106607 2f_z_ 12,500000
25            2 106551 4f_ww 12,500000
25            2 106562 4f_sz 12,500000
37,5          3 106575 4f_zz 12,500000
37,5          3 106608 2f_z_ 12,500000
50            4 106578 4f_ww 12,500000
62,5          5 106572 4f_sz 12,500000
100           8 106574 4f_zz 12,500000
125          10 106577 4f_ww 12,500000
237,5        19 106576 4f_zz 12,500000

OLD)OLD:">Cut table for BDT: (OLD)OLD:

isigall= 1345.984602 isigpresel= 1460.101057 igood= 114.116455 ibad= 762.500000
intot= 77692005.984714 ipre= 876.616455
cuts__ (Cut Name            ):       all   others       2f       4f       6f       aa       1f       3f       5f        q     e+mu      tau       nu  Signif.
cut #0 (all                 ): 7.769e+07   625000 14507062  1019938        0 54073784        0  7464875        0      948       91       45      262      0.15  Delta(sig*BR) = 6.548594 +/- 0.046879
cut #1 (20.<PTvisJETS       ): 1.283e+07    37500  5525512   746188        0  2968250        0  3546875        0      806       69       44      259      0.33  Delta(sig*BR) = 3.042092 +/- 0.024172
cut #2 (EvisJETS<220.       ): 4.637e+06    25000  3287550   514400        0   130196        0   678875        0      608       42       41      219      0.42  Delta(sig*BR) = 2.365748 +/- 0.020248
cut #3 (50.<hmass<140.      ): 3.306e+06        0  2542562   378675        0    60426        0   323875        0      536       38       35      206      0.45  Delta(sig*BR) = 2.233245 +/- 0.019503
cut #4 (||dph|3.14159|>0.15 ): 2.772e+06        0  2083912   364150        0    57097        0   266000        0      514       36       33      202      0.47  Delta(sig*BR) = 2.120439 +/ 0.018965
cut #5 (nTrks>10            ): 1.426e+06        0  1136375   169088        0    30283        0    89875        0      466       10       19      177      0.56  Delta(sig*BR) = 1.777889 +/- 0.017994
cut #6 (jetthrust<0.99      ): 1.325e+06        0  1044700   165588        0    30033        0    84125        0      463       10       19      173      0.58  Delta(sig*BR) = 1.730704 +/- 0.017728
cut #7 (ej1>40.             ): 1.313e+06        0  1035112   164625        0    29215        0    83250        0      462       10       19      173      0.58  Delta(sig*BR) = 1.725415 +/- 0.017704
cut #8 (MVA                 ): 8.766e+02        0      138      625        0        0        0        0        0       51        0        1       62      3.85  Delta(sig*BR) = 0.259452 +/- 0.015456

Remaining backgrounds: (OLD)$ sed 's/\./,/g' zzhpassing.dat | awk '

' | sort -n | uniq -c | awk '
Unknown macro: {if (NF==4) aa+=($1*$4);print $1*$4"t"$0}

END

Unknown macro: {print "Sum of weights = "aa}

' | sort -k 1,1 -n
Sum of weights = 762,5
12,5          1 106562 4f_sz 12,500000
12,5          1 106566 4f_sw 12,500000
12,5          1 106572 4f_sz 12,500000
12,5          1 106575 4f_zz 12,500000
12,5          1 106607 2f_z_ 12,500000
25            2 106551 4f_ww 12,500000
25            2 106553 4f_zz 12,500000
25            2 106571 4f_sz 12,500000
25            2 106573 4f_zz 12,500000
37,5          3 106574 4f_zz 12,500000
37,5          3 106578 4f_ww 12,500000
125          10 106608 2f_z_ 12,500000
162,5        13 106576 4f_zz 12,500000
237,5        19 106577 4f_ww 12,500000

Background contributions for Evis>140:


[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '

Unknown macro: {if ($18>=140. && NF==18) print $3,$7,$15}

' | sort -n | uniq -c | awk '

END
Unknown macro: {print "Sum of weights = "aa}

' | sort -k 1,1 -n
Sum of weights = 187,5
12,5          1 106551 4f_ww 12,500000
12,5          1 106577 4f_ww 12,500000
25            2 106553 4f_zz 12,500000
25            2 106573 4f_zz 12,500000
25            2 106576 4f_zz 12,500000
37,5          3 106574 4f_zz 12,500000
50            4 106608 2f_z_ 12,500000



h2. Background contributions for Evis<140: (OLD)



!variables_id_c2-hzzrecheck1-evislt140.png|border=1!

[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '

Unknown macro: {if ($18<140. && NF==18) print $3,$7,$15}

' | sort -n | uniq -c | awk '

Unknown macro: {if (NF==4) aa+=($1*$4);print $1*$4"t"$0}

END

' | sort -k 1,1 -n

Sum of weights = 575
12,5 1 106551 4f_ww 12,500000
12,5 1 106562 4f_sz 12,500000
12,5 1 106566 4f_sw 12,500000
12,5 1 106572 4f_sz 12,500000
12,5 1 106575 4f_zz 12,500000
12,5 1 106607 2f_z_ 12,500000
25 2 106571 4f_sz 12,500000
37,5 3 106578 4f_ww 12,500000
75 6 106608 2f_z_ 12,500000
137,5 11 106576 4f_zz 12,500000
225 18 106577 4f_ww 12,500000 <***************


— Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
— --------------------------------------------------------------------------------------------------
---       Cuts:  (309.89453,  1128865)      -0.0050            0         0         0        0        0
— Likelihood:  (309.89453,  1128865)       1.0000     0.378385  299.2573  625191.7   0.9657   0.5538
---     Fisher:  (309.89453,  1128865)       0.0088      1.63217  150.0501  8301.587   0.4842 0.007354
---       BDTG:  (309.89453,  1128865)      -0.9950      1.40141  134.4158  9065.234   0.4337  0.00803
---        BDT:  (309.89453,  1128865)       0.0177      3.63521   38.5486  73.90136   0.1244 6.547e-05
— --------------------------------------------------------------------------------------------------

after raising reco higgs mass cut from 50 GeV to 100 GeV:

 --- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
— --------------------------------------------------------------------------------------------------
---       Cuts:  (209.30833,    51800)      -0.0050            0         0         0        0        0
— Likelihood:  (209.30833,    51800)       0.9105      1.08931  191.9554  30860.64   0.9171   0.5958
---     Fisher:  (209.30833,    51800)       0.0176      1.78558  116.8125  4162.945   0.5581  0.08037
---       BDTG:  (209.30833,    51800)      -0.5098      1.00598  204.0195  40926.24   0.9747   0.7901
---        BDT:  (209.30833,    51800)       0.0361      2.92564   72.9263  548.4119   0.3484  0.01059
— --------------------------------------------------------------------------------------------------


Evis<140 GeV and only idrup = 106577 for the background: (OLD)



Plans:

  • No labels