You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 5 Next »

Update on the HZZ 250 GeV +80/-30 250/fb Analysis for 17 September 2013:


Analysis:

  • divide into 4 jet / 6 jet topologies
  • apply preselection depending on topology
  • train/apply TMVA's
  • validate with cut table
  • check remaining backgrounds

Distributions before preselection with only a cut on the reconstructed Higgs mass:



Preselection:

Evis<140.0: (4 Jet category)
#y34>0.0#hmass>95. && hmass<140.
#PTvisJETS>25.0 && PTvisJETS<70.0
#nTrks>5.
#ej1<120.

Evis>140: (6 jet category)
#y34>0.0
#hmass>90. && hmass<160.
#ej1<120.
#jetthrust<0.98
#nTrks>5

Distributions after preselection:

aaa


(Note: the lepton pair mass plot has the entries at less than 10 GeV randomly set to 10 and 130 GeV so that the signal peak can be seen.)

The TMVA variables:

  1. Evis JETS
  2. Mvis JETS
  3. PTvis JETS
  4. B Likeness
  5. C likeness
  6. Number of HE Electrons
  7. Higgs Mass
  8. Z energy
  9. Zstr energy
  10. Z cos theta
  11. Zstr cos theta
  12. Z mass
  13. Zstr mass
  14. azimuthal separation of Z jets
  15. thrust
  16. Number Charged Tracks
  17. Number of electrons
  18. Number of muons
  19.  
  20. ----NEW VARIABLES:
  1. y34
  2. y56
  3. lepton pair (PDG ID1 = -ID2) mass closest to Mz
  4. jet pair mass closest to mW
  5. jet pair1 - mW

    +

    jet pair2 - mW

Performance of different MVA options:

BEFORE

 [neal@localhost weights]$ grep -A 8 -i optimal- ../hzzv14-presel-v4-new-common0410results.txt
 --- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
 -- ---------------------------------------------------------------------------------------------
 ---       Cuts:  ( 664.0744,1312202.5)      -0.0050            0         0         0        0        0
 --- Likelihood:  ( 664.0744,1312202.5)       1.0000     0.728991  644.0848  779980.1   0.9699   0.5944
 ---     Fisher:  ( 664.0744,1312202.5)       0.0120      1.70335  258.2099  22721.13   0.3888  0.01732
 ---       BDTG:  ( 664.0744,1312202.5)      -0.9823      1.17754  538.5353  208621.3    0.811    0.159
 ---        BDT:  ( 664.0744,1312202.5)      -0.0462      3.42709  100.6751  762.2891   0.1516 0.0005809
 -- ---------------------------------------------------------------------------------------------

NOW:

     if ((fp = fopen("HZZ-TMVA-vars-ffh_zz_all_SM_background_+80e-_-30e+-v6-mini.txt","r"))==NULL) exit(0);
[neal@localhost test]$ source storeresultsHZZv4.sh hzz-withlep-withmW-n2-Mhiggspresel-plusptntrkp8g16y34nvtx


--- ==================================================================================================
--- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
--- --------------------------------------------------------------------------------------------------
---       Cuts:  (1035.8644,  1196975)      -0.0050            0         0         0        0        0
--- Likelihood:  (1035.8644,  1196975)       1.0000      1.66886  315.8824  35510.98   0.3049  0.02967
---     Fisher:  (1035.8644,  1196975)       0.0047      2.38122  586.8862  60157.76   0.5666  0.05026
---       BDTG:  (1035.8644,  1196975)      -0.9795        2.625  293.8675  12238.81   0.2837  0.01022
---        BDT:  (1035.8644,  1196975)      -0.0427       4.9889  146.0711  711.2005    0.141 0.0005942
--- --------------------------------------------------------------------------------------------------



Spent a lot of time playing with different preselection cuts and then settled for only making cuts where
I was cutting almost no signal away and the letting the MVA's do the rest.
1) decays like l+l-H where H->ffnunu
Even in cases where the ff are leptons the initial l+l- are more energetic
and those were being selected.

Changed to checking all opposite signed lepton pairs for consistency with coming
from a Z.


2) There was also a simple programming error which I've already fixed.


Cut table for BDT:

OLD:

isigall= 1345.984602 isigpresel= 1460.101057 igood= 114.116455 ibad= 762.500000
intot= 77692005.984714 ipre= 876.616455
cuts__ (Cut Name            ):       all   others       2f       4f       6f       aa       1f       3f       5f        q     e+mu      tau       nu  Signif.
cut #0 (all                 ): 7.769e+07   625000 14507062  1019938        0 54073784        0  7464875        0      948       91       45      262      0.15  Delta(sig*BR) = 6.548594 +/- 0.046879
cut #1 (20.<PTvisJETS       ): 1.283e+07    37500  5525512   746188        0  2968250        0  3546875        0      806       69       44      259      0.33  Delta(sig*BR) = 3.042092 +/- 0.024172
cut #2 (EvisJETS<220.       ): 4.637e+06    25000  3287550   514400        0   130196        0   678875        0      608       42       41      219      0.42  Delta(sig*BR) = 2.365748 +/- 0.020248
cut #3 (50.<hmass<140.      ): 3.306e+06        0  2542562   378675        0    60426        0   323875        0      536       38       35      206      0.45  Delta(sig*BR) = 2.233245 +/- 0.019503
cut #4 (||dph|-3.14159|>0.15 ): 2.772e+06        0  2083912   364150        0    57097        0   266000        0      514       36       33      202      0.47  Delta(sig*BR) = 2.120439 +/- 0.018965
cut #5 (nTrks>10            ): 1.426e+06        0  1136375   169088        0    30283        0    89875        0      466       10       19      177      0.56  Delta(sig*BR) = 1.777889 +/- 0.017994
cut #6 (jetthrust<0.99      ): 1.325e+06        0  1044700   165588        0    30033        0    84125        0      463       10       19      173      0.58  Delta(sig*BR) = 1.730704 +/- 0.017728
cut #7 (ej1>40.             ): 1.313e+06        0  1035112   164625        0    29215        0    83250        0      462       10       19      173      0.58  Delta(sig*BR) = 1.725415 +/- 0.017704
cut #8 (MVA                 ): 8.766e+02        0      138      625        0        0        0        0        0       51        0        1       62      3.85  Delta(sig*BR) = 0.259452 +/- 0.015456

Remaining backgrounds:

$ sed 's/\./,/g' zzhpassing.dat | awk '{if (NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 762,5
12,5          1 106562 4f_sz 12,500000
12,5          1 106566 4f_sw 12,500000
12,5          1 106572 4f_sz 12,500000
12,5          1 106575 4f_zz 12,500000
12,5          1 106607 2f_z_ 12,500000
25            2 106551 4f_ww 12,500000
25            2 106553 4f_zz 12,500000
25            2 106571 4f_sz 12,500000
25            2 106573 4f_zz 12,500000
37,5          3 106574 4f_zz 12,500000
37,5          3 106578 4f_ww 12,500000
125          10 106608 2f_z_ 12,500000
162,5        13 106576 4f_zz 12,500000
237,5        19 106577 4f_ww 12,500000

Background contributions for Evis>140:

[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '{if ($18>=140. && NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 187,5
12,5          1 106551 4f_ww 12,500000
12,5          1 106577 4f_ww 12,500000
25            2 106553 4f_zz 12,500000
25            2 106573 4f_zz 12,500000
25            2 106576 4f_zz 12,500000
37,5          3 106574 4f_zz 12,500000
50            4 106608 2f_z_ 12,500000

Background contributions for Evis<140:

 [neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '{if ($18<140. && NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 575
12,5          1 106551 4f_ww 12,500000
12,5          1 106562 4f_sz 12,500000
12,5          1 106566 4f_sw 12,500000
12,5          1 106572 4f_sz 12,500000
12,5          1 106575 4f_zz 12,500000
12,5          1 106607 2f_z_ 12,500000
25            2 106571 4f_sz 12,500000
37,5          3 106578 4f_ww 12,500000
75            6 106608 2f_z_ 12,500000
137,5        11 106576 4f_zz 12,500000
225          18 106577 4f_ww 12,500000 <***************
 --- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
--- --------------------------------------------------------------------------------------------------
---       Cuts:  (309.89453,  1128865)      -0.0050            0         0         0        0        0
--- Likelihood:  (309.89453,  1128865)       1.0000     0.378385  299.2573  625191.7   0.9657   0.5538
---     Fisher:  (309.89453,  1128865)       0.0088      1.63217  150.0501  8301.587   0.4842 0.007354
---       BDTG:  (309.89453,  1128865)      -0.9950      1.40141  134.4158  9065.234   0.4337  0.00803
---        BDT:  (309.89453,  1128865)       0.0177      3.63521   38.5486  73.90136   0.1244 6.547e-05
--- --------------------------------------------------------------------------------------------------

after raising reco higgs mass cut from 50 GeV to 100 GeV:

 --- Classifier   (  #signal, #backgr.)  Optimal-cut  S/sqrt(S+B)      NSig      NBkg   EffSig   EffBkg
--- --------------------------------------------------------------------------------------------------
---       Cuts:  (209.30833,    51800)      -0.0050            0         0         0        0        0
--- Likelihood:  (209.30833,    51800)       0.9105      1.08931  191.9554  30860.64   0.9171   0.5958
---     Fisher:  (209.30833,    51800)       0.0176      1.78558  116.8125  4162.945   0.5581  0.08037
---       BDTG:  (209.30833,    51800)      -0.5098      1.00598  204.0195  40926.24   0.9747   0.7901
---        BDT:  (209.30833,    51800)       0.0361      2.92564   72.9263  548.4119   0.3484  0.01059
--- --------------------------------------------------------------------------------------------------

Evis<140 GeV and only idrup = 106577 for the background:



Potential tools for reducing the 4f_WW background:

  • higher cut on number of charged tracks (unlikely to help S)
  • extra cuts but keep any event that has high B-likeness
  • redoing n-tuples adding variables for jet combos consistent with W decays (IN PROGRESS)

Note concerning ZZ background: wider cut on acoplanarity should help

  • No labels