Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin
Table of Contents
outlinetrue

Update on the HZZ 250 GeV +80/-30 250/fb Analysis for 17-19 September 2013:

...

...

Analysis Procedure:

  • divide into 4 jet / 6 jet topologies
    • this is ZH with H->ZZ* , Z-> nn,ll,qq
  • apply preselection depending on topology
  • train/apply TMVA'sTMVAs
  • validate with cut table
  • check remaining backgrounds

Distributions before preselection with only a cut on the reconstructed Higgs mass:

...





...

Preselection:

Simplified and making only clean cuts ...

Code Block
Evis<140.0: (4 Jet category)
#y34>0.0#hmass>95. && hmass<140.
#PTvisJETS>25.0 && PTvisJETS<70.0
#nTrks>5.
#ej1<120.

Evis>140: (6 jet category)
#y34>0.0
#hmass>90. && hmass<160.
#ej1<120.
#jetthrust<0.98
#nTrks>5

Distributions after preselection:

aaa


(Note: the lepton pair mass plot has the entries at less than 10 GeV randomly set to 10 and 130 GeV so that the signal peak can be seen.)

The TMVA variables:

  1. Evis JETS
  2. Mvis JETS
  3. PTvis JETS
  4. B Likeness
  5. C likeness
  6. Number of HE Electrons
  7. Higgs Mass
  8. Z energy
  9. Zstr energy
  10. Z cos theta
  11. Zstr cos theta
  12. Z mass
  13. Zstr mass
  14. azimuthal separation of Z jets
  15. thrust
  16. Number Charged Tracks
  17. Number of electrons
  18. Number of muons 

----NEW VARIABLES:

  1. y34
  2. y56
  3. lepton pair (PDG ID1 = -ID2) mass closest to Mz
  4. jet pair mass closest to mW
  5. jet pair1 - mW

    +

    jet pair2 - mW

Performance of different MVA options:

BEFORE

Some signals are just not reasonable to try to select:

1) decays like l+l-H where H->ffnunu
Even in cases where the ff are leptons the initial l+l- are more energetic
and those were being selected. Image Added
Image Added
Image Added

eeH, H -> cc ee
Image Added

nu nu H, H -> nu nu nu nu
Image Added

nu nu H, H -> nu nu nu nu
Image Added

 

 

HZZ generated sample stats

 

 

 

 

 

 

 

 

 

events

weight

 

I108056.001.stdhep

10000

20000

1.29E-004

e1e1h_zz.Gwhizard-

I108055.001.stdhep

10000

20000

4.36E-003

e1e1h_zz.Gwhizard-

I108054.001.stdhep

10000

20000

4.11E-004

e1e1h_zz.Gwhizard-

I108053.001.stdhep

10000

20000

2.70E-005

e1e1h_zz.Gwhizard-

I108058.001.stdhep

10000

20000

4.29E-003

e2e2h_zz.Gwhizard-

I108057.001.stdhep

10000

20000

4.00E-004

e2e2h_zz.Gwhizard-

I108060.001.stdhep

10000

20000

4.28E-003

e3e3h_zz.Gwhizard-

I108059.001.stdhep

10000

20000

4.00E-004

e3e3h_zz.Gwhizard-

 

 

Sum =

1.43E-002

10.40%

I108062.001.stdhep

10000

20000

2.54E-002

nnh_zz.Gwhizard-1_

I108061.001.stdhep

10000

20000

3.01E-003

nnh_zz.Gwhizard-1_

 

 

Sum =

2.84E-002

20.68%

I108064.001.stdhep

10000

20000

8.67E-002

qqh_zz.Gwhizard-1_

I108063.001.stdhep

10000

20000

8.08E-003

qqh_zz.Gwhizard-1_

 

 

Sum =

9.47E-002

68.93%

 

 

 

 

 

 

 

 

 

 

 

 

All =

1.37E-001

 

Mass of lepton pair with mass closest to mZ:

Image Added

The following also shows the jet pair mass for those that have mass closest to mW:

Image Added

Note: Compared to last week plots and tables there was also a simple programming error which I've already fixed.

...

Performance of different MVA options:

BEFORE

Code Block

 [neal@localhost weights]$ grep -A 8 -i optimal- ../hzzv14-presel-v4-new-common0410results.txt
 --- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
 -- -------------
Code Block

 [neal@localhost weights]$ grep -A 8 -i optimal- ../hzzv14-presel-v4-new-common0410results.txt
 --- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
 -- ---------------------------------------------------------------------------------------------
 ---       Cuts:  ( 664.0744,1312202.5)      -0.0050            0         0         0        0        0
 --- Likelihood:  ( 664.0744,1312202.5)       1.0000     0.728991  644.0848  779980.1   0.9699   0.5944
 ---     Fisher:  ( 664.0744,1312202.5)       0.0120      1.70335  258.2099  22721.13   0.3888  0.01732
 ---       BDTG:  ( 664.0744,1312202.5)      -0.9823      1.17754  538.5353  208621.3    0.811    0.159
 ---        BDT:  ( 664.0744,1312202.5)      -0.0462      3.42709  100.6751  762.2891   0.1516 0.0005809
 -- --------------------------------------------------------------------------------
 -------------

NOW:

Code Block

     if ((fp = fopen("HZZ-TMVA-vars-ffh_zz_all_SM_background_+80e-_-30e+-v6-mini.txt","r"))==NULL) exit(0);
[neal@localhost test]$ source storeresultsHZZv4.sh hzz-withlep-withmW-n2-Mhiggspresel-plusptntrkp8g16y34nvtx


--- ==================================================================================================
--- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
---        Cuts:  ( 664.0744,1312202.5)      -0.0050            0         0         0        0        0
 --- Likelihood:  ( 664.0744,1312202.5)       1.0000     0.728991  644.0848  779980.1   0.9699   0.5944
 ---     Fisher:  ( 664.0744,1312202.5)       0.0120      1.70335  258.2099  22721.13   0.3888  0.01732
 ---       BDTG:  ( 664.0744,1312202.5)      -0.9823      1.17754  538.5353  208621.3    0.811    0.159
 ---        BDT:  ( 664.0744,1312202.5)      -0.0462      3.42709  100.6751  762.2891   0.1516 0.0005809
 -- ---------------------------------------------------------------------------------------------

NOW:

Code Block

-----------
---     if ((fp Cuts:=  (1035.8644,  1196975)      -0.0050            0         0         0        0        0
--- Likelihood:  (1035.8644,  1196975)       1.0000      1.66886  315.8824  35510.98   0.3049  0.02967
---     Fisher:  (1035.8644,  1196975)       0.0047      2.38122  586.8862  60157.76   0.5666  0.05026
---       BDTG:  (1035.8644,  1196975)      -0.9795        2.625  293.8675  12238.81   0.2837  0.01022
---        BDT:  (1035.8644,  1196975)      -0.0427       4.9889  146.0711  711.2005    0.141 0.0005942
--- fopen("HZZ-TMVA-vars-ffh_zz_all_SM_background_+80e-_-30e+-v6-mini.txt","r"))==NULL) exit(0);
[neal@localhost test]$ source storeresultsHZZv4.sh hzz-withlep-withmW-n2-Mhiggspresel-plusptntrkp8g16y34nvtx


--- ==================================================================================================
--- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
--- --------------------------------------------------------------------------------------------------
---------------------

Image Removed

Image Removed

Image Removed
Image Removed
Image Removed Image Removed

Spent a lot of time playing with different preselection cuts and then settled for only making cuts where
I was cutting almost no signal away and then letting the MVA's do the rest.

Some signals are just not reasonable to try to select:

1) decays like l+l-H where H->ffnunu
Even in cases where the ff are leptons the initial l+l- are more energetic
and those were being selected. Image Removed
Image Removed
Image Removed

Image Removed
Image Removed
Image Removed
Changed to checking all opposite signed lepton pairs for consistency with coming
from a Z.

Image Removed

 

 

HZZ generated sample stats

 

 

 

 

 

 

 

 

 

events

weight

 

I108056.001.stdhep

10000

20000

1.29E-004

e1e1h_zz.Gwhizard-

I108055.001.stdhep

10000

20000

4.36E-003

e1e1h_zz.Gwhizard-

I108054.001.stdhep

10000

20000

4.11E-004

e1e1h_zz.Gwhizard-

I108053.001.stdhep

10000

20000

2.70E-005

e1e1h_zz.Gwhizard-

I108058.001.stdhep

10000

20000

4.29E-003

e2e2h_zz.Gwhizard-

I108057.001.stdhep

10000

20000

4.00E-004

e2e2h_zz.Gwhizard-

I108060.001.stdhep

10000

20000

4.28E-003

e3e3h_zz.Gwhizard-

I108059.001.stdhep

10000

20000

4.00E-004

e3e3h_zz.Gwhizard-

 

 

Sum =

1.43E-002

10.40%

I108062.001.stdhep

10000

20000

2.54E-002

nnh_zz.Gwhizard-1_

I108061.001.stdhep

10000

20000

3.01E-003

nnh_zz.Gwhizard-1_

 

 

Sum =

2.84E-002

20.68%

I108064.001.stdhep

10000

20000

8.67E-002

qqh_zz.Gwhizard-1_

I108063.001.stdhep

10000

20000

8.08E-003

qqh_zz.Gwhizard-1_

 

 

Sum =

9.47E-002

68.93%

 

 

 

 

 

 

 

 

 

 

 

 

All =

1.37E-001

 

2) There was also a simple programming error which I've already fixed.

Cut table for BDT: (OLD)

       Cuts:  (1035.8644,  1196975)      -0.0050            0         0         0        0        0
--- Likelihood:  (1035.8644,  1196975)       1.0000      1.66886  315.8824  35510.98   0.3049  0.02967
---     Fisher:  (1035.8644,  1196975)       0.0047      2.38122  586.8862  60157.76   0.5666  0.05026
---       BDTG:  (1035.8644,  1196975)      -0.9795        2.625  293.8675  12238.81   0.2837  0.01022
---        BDT:  (1035.8644,  1196975)      -0.0427       4.9889  146.0711  711.2005    0.141 0.0005942
--- --------------------------------------------------------------------------------------------------
                                                          ^^^^^^

BDT plots:

Image Added

Image Added

Image Added
Image Added
Image Added Image Added

...

Cut table for BDT: (NEW)NEW:

Code Block

 cuts__ (Cut Name                          ):       all   others        2f       4f       6f       aa       1f       3f       5f        q      e+mu      tau       nu  Signif.
 cut #0 (all                               ): 7.769e+07   625000  14507062  1019938        0 54073784        0  7464875        0       948       91       45      262      0.15 
Code Block

OLD:

isigall= 1345.984602 isigpresel= 1460.101057 igood= 114.116455 ibad= 762.500000
intot= 77692005.984714 ipre= 876.616455
cuts__ (Cut Name            ):       all   others       2f       4f       6f       aa       1f       3f       5f        q     e+mu      tau       nu  Signif.
cut #0 (all                 ): 7.769e+07   625000 14507062  1019938        0 54073784        0  7464875        0      948       91       45      262      0.15  Delta(sig*BR) = 6.548594 +/- 0.046879
cut #1 (20.<PTvisJETS       ): 1.283e+07    37500  5525512   746188        0  2968250        0  3546875        0      806       69       44      259      0.33  Delta(sig*BR) = 3.042092 +/- 0.024172
cut #2 (EvisJETS<220.       ): 4.637e+06    25000  3287550   514400        0   130196        0   678875        0      608       42       41      219      0.42  Delta(sig*BR) = 2.365748 +/- 0.020248
cut #3 (50.<hmass<140.      ): 3.306e+06        0  2542562   378675        0    60426        0   323875        0      536       38       35      206      0.45  Delta(sig*BR) = 26.233245548594 +/-  0.019503046879
 cut #4#1 (||dph|-3.14159|>0.15y34>0.y34>0.                      ): 27.772e705e+06       07   625000  2083912  14449800  364150       1018312        0   0 57097       53568784        0   266000       7387500        0       514      947       36      91       33     45      202     262      0.47 15  Delta(sig*BR) = 26.120439523157 +/-  0.018965046724
 cut #5#2 (nTrks>10           95.<hmass<140. 95.<hmass<140.     ): 1.426e+06        0  1136375   169088        0    30283        0    89875        0      466       10       19      177      0.56 257e+07    150000 10264438   825238        0   387992        0   938000         0      901       86       43      213      0.35  Delta(sig*BR) =  12.777889854636 +/- 0.017994021075
 cut #6#3 (jetthrust<0.99     25.<PTvisJETS<70 PTvisJETS no cut ): 15.325e637e+06       06    25000   1044700  4545812   165588       744212        0   0     30033       8750        0   0   84125       312125        0       463      866       10      84       19     37      173     180      0.58 49  Delta(sig*BR) = 12.730704033046 +/-  0.017728015436
 cut #7#4 (ej1>40.             ): 1.313e+06        0  1035112   164625        0    29215        0    83250        0      462       10       19      173      0.58 nTrks>5 nTrks>5                   ): 3.826e+06        0   2957175   679838        0     8750        0   179500        0       866       81       36      176      0.59  Delta(sig*BR) = 1.725415688536 +/-  0.017704012688
 cut #8#5 (MVA                jetthrust ): 8.766e+02        0      138      625        0        0        0        0        0       51        0        1       62      3.85 no cut jetthrust < 0.98 ): 3.826e+06        0   2957175   679838        0     8750        0   179500        0       866       81       36      176      0.59  Delta(sig*BR) = 01.259452688536 +/-  0.015456

Remaining backgrounds: (OLD)

Code Block

$ sed 's/\./,/g' zzhpassing.dat | awk '{if (NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 762,5
12,5          1 106562 4f_sz 12,500000
12,5          1 106566 4f_sw 12,500000
12,5          1 106572 4f_sz 12,500000
12,5          1 106575 4f_zz 12,500000
12,5          1 106607 2f_z_ 12,500000
25            2 106551 4f_ww 12,500000
25            2 106553 4f_zz 12,500000
25            2 106571 4f_sz 12,500000
25            2 106573 4f_zz 12,500000
37,5          3 106574 4f_zz 12,500000
37,5          3 106578 4f_ww 12,500000
125          10 106608 2f_z_ 12,500000
162,5        13 106576 4f_zz 12,500000
237,5        19 106577 4f_ww 12,500000

Background contributions for Evis>140:

012688
 cut #6 (ej1<120. ej1<120.                 ): 2.144e+06        0   1480162   539312        0     8750        0   114500        0       844       78       35      176      0.77  Delta(sig*BR) = 1.292336 +/-  0.009952
 cut #7 (MVA                               ): 9.252e+02        0        50      712        0        0        0        0        0       79        27        3       53      5.35  Delta(sig*BR) = 0.186991 +/- 0.010382

Remaining backgrounds: (NEW)

Code Block

$ sed 's/\./,/g' zzhpassing.dat | awk '{if (NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 762,5
12,5          1 106561 4f_sz 12,500000
12,5          1 106564 4f_sw 12,500000
12,5          1 106573 4f_zz 12,500000
12,5          1 106580 4f_zz 12,500000
12,5          1 106607 2f_z_ 12,500000
25            2 106551
Code Block

[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '{if ($18>=140. && NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 187,5
12,5          1 106551 4f_ww 12,500000
12,5          1 106577 4f_ww 12,500000
25            2 106553106562 4f_zzsz 12,500000
25           37,5          23 106573106575 4f_zz 12,500000
37,5          3 106608 2f_z_ 12,500000
25           50            24 106576106578 4f_zzww 12,500000
3762,5          35 106574106572 4f_zzsz 12,500000
50           100           48 106608106574 2f4f_z_zz 12,500000

Background contributions for Evis<140: (OLD)

Image Removed

Code Block

 [neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '{if ($18<140. && NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 575
12,5          1 106551 4f_ww 12,500000
12,5          1 106562 4f_sz 12,500000
12,5          1 106566 4f_sw 12,500000
12,5          1 106572 4f_sz 12,500000
12,5          1 106575 4f_zz 12,500000
12,5          1 106607 2f_z_ 12,500000
25            2 106571 4f_sz 12,500000
37,5          3 106578 4f_ww 12,500000
75            6 106608 2f_z_ 12,500000
137,5        11 106576 4f_zz 12,500000
225          18 106577 4f_ww 12,500000 <***************
Code Block

 --- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
--- --------------------------------------------------------------------------------------------------
---       Cuts:  (309.89453,  1128865)      -0.0050            0         0         0        0        0
--- Likelihood:  (309.89453,  1128865)       1.0000     0.378385  299.2573  625191.7   0.9657   0.5538
---     Fisher:  (309.89453,  1128865)       0.0088      1.63217  150.0501  8301.587   0.4842 0.007354
---       BDTG:  (309.89453,  1128865)      -0.9950      1.40141  134.4158  9065.234   0.4337  0.00803
---        BDT:  (309.89453,  1128865)       0.0177      3.63521   38.5486  73.90136   0.1244 6.547e-05
--- --------------------------------------------------------------------------------------------------

after raising reco higgs mass cut from 50 GeV to 100 GeV:

 --- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
--- --------------------------------------------------------------------------------------------------
---       Cuts:  (209.30833,    51800)      -0.0050            0         0         0        0        0
--- Likelihood:  (209.30833,    51800)       0.9105      1.08931  191.9554  30860.64   0.9171   0.5958
---     Fisher:  (209.30833,    51800)       0.0176      1.78558  116.8125  4162.945   0.5581  0.08037
---       BDTG:  (209.30833,    51800)      -0.5098      1.00598  204.0195  40926.24   0.9747   0.7901
---        BDT:  (209.30833,    51800)       0.0361      2.92564   72.9263  548.4119   0.3484  0.01059
--- --------------------------------------------------------------------------------------------------

Evis<140 GeV and only idrup = 106577 for the background: (OLD)

Image Removed
Image Removed
Image Removed

125          10 106577 4f_ww 12,500000
237,5        19 106576 4f_zz 12,500000  <---- 

Plans:

  • just noticed that the preselection nTrks cut is on the PFO count and not the #charged tracks distribution shown so I will switch to
    using the charged tracks count in the preselection
  • may still be able to do better with the leptonic Z decay selection
    • high evis events with well reconstructed leptonic Z decays and a clear 6 jet topology (using y56 cut)
  • write report

...