Update on the HZZ 250 GeV +80/-30 250/fb Analysis for 17 September 2013:
Analysis:
- divide into 4 jet / 6 jet topologies
- apply preselection depending on topology
- train/apply TMVA's
- validate with cut table
- check remaining backgrounds
Distributions before preselection with only a cut on the reconstructed Higgs mass:
Preselection:
Evis<140.0: (4 Jet category) #y34>0.0#hmass>95. && hmass<140. #PTvisJETS>25.0 && PTvisJETS<70.0 #nTrks>5. #ej1<120. Evis>140: (6 jet category) #y34>0.0 #hmass>90. && hmass<160. #ej1<120. #jetthrust<0.98 #nTrks>5
Distributions after preselection:
aaa
(Note: the lepton pair mass plot has the entries at less than 10 GeV randomly set to 10 and 130 GeV so that the signal peak can be seen.)
The TMVA variables:
- Evis JETS
- Mvis JETS
- PTvis JETS
- B Likeness
- C likeness
- Number of HE Electrons
- Higgs Mass
- Z energy
- Zstr energy
- Z cos theta
- Zstr cos theta
- Z mass
- Zstr mass
- azimuthal separation of Z jets
- thrust
- Number Charged Tracks
- Number of electrons
- Number of muons
- ----NEW VARIABLES:
- y34
- y56
- lepton pair (PDG ID1 = -ID2) mass closest to Mz
- jet pair mass closest to mW
jet pair1 - mW
+
jet pair2 - mW
Performance of different MVA options:
BEFORE
[neal@localhost weights]$ grep -A 8 -i optimal- ../hzzv14-presel-v4-new-common0410results.txt --- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg -- --------------------------------------------------------------------------------------------- --- Cuts: ( 664.0744,1312202.5) -0.0050 0 0 0 0 0 --- Likelihood: ( 664.0744,1312202.5) 1.0000 0.728991 644.0848 779980.1 0.9699 0.5944 --- Fisher: ( 664.0744,1312202.5) 0.0120 1.70335 258.2099 22721.13 0.3888 0.01732 --- BDTG: ( 664.0744,1312202.5) -0.9823 1.17754 538.5353 208621.3 0.811 0.159 --- BDT: ( 664.0744,1312202.5) -0.0462 3.42709 100.6751 762.2891 0.1516 0.0005809 -- ---------------------------------------------------------------------------------------------
NOW:
if ((fp = fopen("HZZ-TMVA-vars-ffh_zz_all_SM_background_+80e-_-30e+-v6-mini.txt","r"))==NULL) exit(0); [neal@localhost test]$ source storeresultsHZZv4.sh hzz-withlep-withmW-n2-Mhiggspresel-plusptntrkp8g16y34nvtx --- ================================================================================================== --- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg --- -------------------------------------------------------------------------------------------------- --- Cuts: (1035.8644, 1196975) -0.0050 0 0 0 0 0 --- Likelihood: (1035.8644, 1196975) 1.0000 1.66886 315.8824 35510.98 0.3049 0.02967 --- Fisher: (1035.8644, 1196975) 0.0047 2.38122 586.8862 60157.76 0.5666 0.05026 --- BDTG: (1035.8644, 1196975) -0.9795 2.625 293.8675 12238.81 0.2837 0.01022 --- BDT: (1035.8644, 1196975) -0.0427 4.9889 146.0711 711.2005 0.141 0.0005942 --- --------------------------------------------------------------------------------------------------
Spent a lot of time playing with different preselection cuts and then settled for only making cuts where
I was cutting almost no signal away and the letting the MVA's do the rest.
1) decays like l+l-H where H->ffnunu
Even in cases where the ff are leptons the initial l+l- are more energetic
and those were being selected.
Changed to checking all opposite signed lepton pairs for consistency with coming
from a Z.
2) There was also a simple programming error which I've already fixed.
Cut table for BDT:
OLD: isigall= 1345.984602 isigpresel= 1460.101057 igood= 114.116455 ibad= 762.500000 intot= 77692005.984714 ipre= 876.616455 cuts__ (Cut Name ): all others 2f 4f 6f aa 1f 3f 5f q e+mu tau nu Signif. cut #0 (all ): 7.769e+07 625000 14507062 1019938 0 54073784 0 7464875 0 948 91 45 262 0.15 Delta(sig*BR) = 6.548594 +/- 0.046879 cut #1 (20.<PTvisJETS ): 1.283e+07 37500 5525512 746188 0 2968250 0 3546875 0 806 69 44 259 0.33 Delta(sig*BR) = 3.042092 +/- 0.024172 cut #2 (EvisJETS<220. ): 4.637e+06 25000 3287550 514400 0 130196 0 678875 0 608 42 41 219 0.42 Delta(sig*BR) = 2.365748 +/- 0.020248 cut #3 (50.<hmass<140. ): 3.306e+06 0 2542562 378675 0 60426 0 323875 0 536 38 35 206 0.45 Delta(sig*BR) = 2.233245 +/- 0.019503 cut #4 (||dph|-3.14159|>0.15 ): 2.772e+06 0 2083912 364150 0 57097 0 266000 0 514 36 33 202 0.47 Delta(sig*BR) = 2.120439 +/- 0.018965 cut #5 (nTrks>10 ): 1.426e+06 0 1136375 169088 0 30283 0 89875 0 466 10 19 177 0.56 Delta(sig*BR) = 1.777889 +/- 0.017994 cut #6 (jetthrust<0.99 ): 1.325e+06 0 1044700 165588 0 30033 0 84125 0 463 10 19 173 0.58 Delta(sig*BR) = 1.730704 +/- 0.017728 cut #7 (ej1>40. ): 1.313e+06 0 1035112 164625 0 29215 0 83250 0 462 10 19 173 0.58 Delta(sig*BR) = 1.725415 +/- 0.017704 cut #8 (MVA ): 8.766e+02 0 138 625 0 0 0 0 0 51 0 1 62 3.85 Delta(sig*BR) = 0.259452 +/- 0.015456
Remaining backgrounds:
$ sed 's/\./,/g' zzhpassing.dat | awk '{if (NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n Sum of weights = 762,5 12,5 1 106562 4f_sz 12,500000 12,5 1 106566 4f_sw 12,500000 12,5 1 106572 4f_sz 12,500000 12,5 1 106575 4f_zz 12,500000 12,5 1 106607 2f_z_ 12,500000 25 2 106551 4f_ww 12,500000 25 2 106553 4f_zz 12,500000 25 2 106571 4f_sz 12,500000 25 2 106573 4f_zz 12,500000 37,5 3 106574 4f_zz 12,500000 37,5 3 106578 4f_ww 12,500000 125 10 106608 2f_z_ 12,500000 162,5 13 106576 4f_zz 12,500000 237,5 19 106577 4f_ww 12,500000
Background contributions for Evis>140:
[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '{if ($18>=140. && NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n Sum of weights = 187,5 12,5 1 106551 4f_ww 12,500000 12,5 1 106577 4f_ww 12,500000 25 2 106553 4f_zz 12,500000 25 2 106573 4f_zz 12,500000 25 2 106576 4f_zz 12,500000 37,5 3 106574 4f_zz 12,500000 50 4 106608 2f_z_ 12,500000
Background contributions for Evis<140:
[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '{if ($18<140. && NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n Sum of weights = 575 12,5 1 106551 4f_ww 12,500000 12,5 1 106562 4f_sz 12,500000 12,5 1 106566 4f_sw 12,500000 12,5 1 106572 4f_sz 12,500000 12,5 1 106575 4f_zz 12,500000 12,5 1 106607 2f_z_ 12,500000 25 2 106571 4f_sz 12,500000 37,5 3 106578 4f_ww 12,500000 75 6 106608 2f_z_ 12,500000 137,5 11 106576 4f_zz 12,500000 225 18 106577 4f_ww 12,500000 <***************
--- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg --- -------------------------------------------------------------------------------------------------- --- Cuts: (309.89453, 1128865) -0.0050 0 0 0 0 0 --- Likelihood: (309.89453, 1128865) 1.0000 0.378385 299.2573 625191.7 0.9657 0.5538 --- Fisher: (309.89453, 1128865) 0.0088 1.63217 150.0501 8301.587 0.4842 0.007354 --- BDTG: (309.89453, 1128865) -0.9950 1.40141 134.4158 9065.234 0.4337 0.00803 --- BDT: (309.89453, 1128865) 0.0177 3.63521 38.5486 73.90136 0.1244 6.547e-05 --- -------------------------------------------------------------------------------------------------- after raising reco higgs mass cut from 50 GeV to 100 GeV: --- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg --- -------------------------------------------------------------------------------------------------- --- Cuts: (209.30833, 51800) -0.0050 0 0 0 0 0 --- Likelihood: (209.30833, 51800) 0.9105 1.08931 191.9554 30860.64 0.9171 0.5958 --- Fisher: (209.30833, 51800) 0.0176 1.78558 116.8125 4162.945 0.5581 0.08037 --- BDTG: (209.30833, 51800) -0.5098 1.00598 204.0195 40926.24 0.9747 0.7901 --- BDT: (209.30833, 51800) 0.0361 2.92564 72.9263 548.4119 0.3484 0.01059 --- --------------------------------------------------------------------------------------------------
Evis<140 GeV and only idrup = 106577 for the background:
Potential tools for reducing the 4f_WW background:
- higher cut on number of charged tracks (unlikely to help S)
- extra cuts but keep any event that has high B-likeness
- redoing n-tuples adding variables for jet combos consistent with W decays (IN PROGRESS)
Note concerning ZZ background: wider cut on acoplanarity should help