Table of Contents | ||
---|---|---|
|
Update on the HZZ 250 GeV +80/-30 250/fb Analysis for 17-19 September 2013:
...
...
Analysis Procedure:
- divide into 4 jet / 6 jet topologies
- this is ZH with H->ZZ* , Z-> nn,ll,qq
- apply preselection depending on topology
- train/apply TMVA'sTMVAs
- validate with cut table
- check remaining backgrounds
Distributions before preselection with only a cut on the reconstructed Higgs mass:
...
...
Preselection:
Simplified and making only clean cuts ...
Code Block |
---|
Evis<140.0: (4 Jet category) #y34>0.0#hmass>95. && hmass<140. #PTvisJETS>25.0 && PTvisJETS<70.0 #nTrks>5. #ej1<120. Evis>140: (6 jet category) #y34>0.0 #hmass>90. && hmass<160. #ej1<120. #jetthrust<0.98 #nTrks>5 |
Distributions after preselection:
aaa
(Note: the lepton pair mass plot has the entries at less than 10 GeV randomly set to 10 and 130 GeV so that the signal peak can be seen.)
The TMVA variables:
- Evis JETS
- Mvis JETS
- PTvis JETS
- B Likeness
- C likeness
- Number of HE Electrons
- Higgs Mass
- Z energy
- Zstr energy
- Z cos theta
- Zstr cos theta
- Z mass
- Zstr mass
- azimuthal separation of Z jets
- thrust
- Number Charged Tracks
- Number of electrons
- Number of muons
----NEW VARIABLES:
- y34
- y56
- lepton pair (PDG ID1 = -ID2) mass closest to Mz
- jet pair mass closest to mW
jet pair1 - mW
+
jet pair2 - mW
Performance of different MVA options:
BEFORE
Some signals are just not reasonable to try to select:
1) decays like l+l-H where H->ffnunu
Even in cases where the ff are leptons the initial l+l- are more energetic
and those were being selected.
eeH, H -> cc ee
nu nu H, H -> nu nu nu nu
nu nu H, H -> nu nu nu nu
|
| HZZ generated sample stats |
|
|
|
|
|
|
|
|
| events | weight |
|
I108056.001.stdhep | 10000 | 20000 | 1.29E-004 | e1e1h_zz.Gwhizard- |
I108055.001.stdhep | 10000 | 20000 | 4.36E-003 | e1e1h_zz.Gwhizard- |
I108054.001.stdhep | 10000 | 20000 | 4.11E-004 | e1e1h_zz.Gwhizard- |
I108053.001.stdhep | 10000 | 20000 | 2.70E-005 | e1e1h_zz.Gwhizard- |
I108058.001.stdhep | 10000 | 20000 | 4.29E-003 | e2e2h_zz.Gwhizard- |
I108057.001.stdhep | 10000 | 20000 | 4.00E-004 | e2e2h_zz.Gwhizard- |
I108060.001.stdhep | 10000 | 20000 | 4.28E-003 | e3e3h_zz.Gwhizard- |
I108059.001.stdhep | 10000 | 20000 | 4.00E-004 | e3e3h_zz.Gwhizard- |
|
| Sum = | 1.43E-002 | 10.40% |
I108062.001.stdhep | 10000 | 20000 | 2.54E-002 | nnh_zz.Gwhizard-1_ |
I108061.001.stdhep | 10000 | 20000 | 3.01E-003 | nnh_zz.Gwhizard-1_ |
|
| Sum = | 2.84E-002 | 20.68% |
I108064.001.stdhep | 10000 | 20000 | 8.67E-002 | qqh_zz.Gwhizard-1_ |
I108063.001.stdhep | 10000 | 20000 | 8.08E-003 | qqh_zz.Gwhizard-1_ |
|
| Sum = | 9.47E-002 | 68.93% |
|
|
|
|
|
|
|
|
|
|
|
| All = | 1.37E-001 |
|
Mass of lepton pair with mass closest to mZ:
The following also shows the jet pair mass for those that have mass closest to mW:
Note: Compared to last week plots and tables there was also a simple programming error which I've already fixed.
...
Performance of different MVA options:
BEFORE
Code Block |
---|
[neal@localhost weights]$ grep -A 8 -i optimal- ../hzzv14-presel-v4-new-common0410results.txt
--- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg
-- ---------------------------------------------------------- |
Code Block |
[neal@localhost weights]$ grep -A 8 -i optimal- ../hzzv14-presel-v4-new-common0410results.txt --- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg -- --------------------------------------------------------------------------------------------- --- Cuts: ( 664.0744,1312202.5) -0.0050 0 0 0 0 0 --- Likelihood: ( 664.0744,1312202.5) 1.0000 0.728991 644.0848 779980.1 0.9699 0.5944 --- Fisher: ( 664.0744,1312202.5) 0.0120 1.70335 258.2099 22721.13 0.3888 0.01732 --- BDTG: ( 664.0744,1312202.5) -0.9823 1.17754 538.5353 208621.3 0.811 0.159 --- BDT: ( 664.0744,1312202.5) -0.0462 3.42709 100.6751 762.2891 0.1516 0.0005809 -- ----------------------------------- ---------------------------------------------------------- |
NOW:
Code Block |
---|
if ((fp = fopen("HZZ-TMVA-vars-ffh_zz_all_SM_background_+80e-_-30e+-v6-mini.txt","r"))==NULL) exit(0); [neal@localhost test]$ source storeresultsHZZv4.sh hzz-withlep-withmW-n2-Mhiggspresel-plusptntrkp8g16y34nvtx --- ================================================================================================== --- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg --- Cuts: ( 664.0744,1312202.5) -0.0050 0 0 0 0 0 --- Likelihood: ( 664.0744,1312202.5) 1.0000 0.728991 644.0848 779980.1 0.9699 0.5944 --- Fisher: ( 664.0744,1312202.5) 0.0120 1.70335 258.2099 22721.13 0.3888 0.01732 --- BDTG: ( 664.0744,1312202.5) -0.9823 1.17754 538.5353 208621.3 0.811 0.159 --- BDT: ( 664.0744,1312202.5) -0.0462 3.42709 100.6751 762.2891 0.1516 0.0005809 -- --------------------------------------------------------------------------------------------- |
NOW:
Code Block |
---|
----- --- if ((fp Cuts:= (1035.8644, 1196975) -0.0050 0 0 0 0 0 --- Likelihood: (1035.8644, 1196975) 1.0000 1.66886 315.8824 35510.98 0.3049 0.02967 --- Fisher: (1035.8644, 1196975) 0.0047 2.38122 586.8862 60157.76 0.5666 0.05026 --- BDTG: (1035.8644, 1196975) -0.9795 2.625 293.8675 12238.81 0.2837 0.01022 --- BDT: (1035.8644, 1196975) -0.0427 4.9889 146.0711 711.2005 0.141 0.0005942 --- fopen("HZZ-TMVA-vars-ffh_zz_all_SM_background_+80e-_-30e+-v6-mini.txt","r"))==NULL) exit(0); [neal@localhost test]$ source storeresultsHZZv4.sh hzz-withlep-withmW-n2-Mhiggspresel-plusptntrkp8g16y34nvtx --- ================================================================================================== --- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg --- -------------------------------------------------------------------------------------------------- ---------------------------- |
Spent a lot of time playing with different preselection cuts and then settled for only making cuts where
I was cutting almost no signal away and then letting the MVA's do the rest.
Some signals are just not reasonable to try to select:
1) decays like l+l-H where H->ffnunu
Even in cases where the ff are leptons the initial l+l- are more energetic
and those were being selected.
Changed to checking all opposite signed lepton pairs for consistency with coming
from a Z.
2) There was also a simple programming error which I've already fixed.
Cut table for BDT: (OLD)
Cuts: (1035.8644, 1196975) -0.0050 0 0 0 0 0
--- Likelihood: (1035.8644, 1196975) 1.0000 1.66886 315.8824 35510.98 0.3049 0.02967
--- Fisher: (1035.8644, 1196975) 0.0047 2.38122 586.8862 60157.76 0.5666 0.05026
--- BDTG: (1035.8644, 1196975) -0.9795 2.625 293.8675 12238.81 0.2837 0.01022
--- BDT: (1035.8644, 1196975) -0.0427 4.9889 146.0711 711.2005 0.141 0.0005942
--- --------------------------------------------------------------------------------------------------
^^^^^^
|
BDT plots:
...
Cut table for BDT: (NEW)NEW:
Code Block |
---|
cuts__ (Cut Name ): all others 2f 4f 6f aa 1f 3f 5f q e+mu tau nu Signif.
cut #0 (all ): 7.769e+07 625000 14507062 1019938 0 54073784 0 7464875 0 948 91 45 262 0.15 |
Code Block |
OLD: isigall= 1345.984602 isigpresel= 1460.101057 igood= 114.116455 ibad= 762.500000 intot= 77692005.984714 ipre= 876.616455 cuts__ (Cut Name ): all others 2f 4f 6f aa 1f 3f 5f q e+mu tau nu Signif. cut #0 (all ): 7.769e+07 625000 14507062 1019938 0 54073784 0 7464875 0 948 91 45 262 0.15 Delta(sig*BR) = 6.548594 +/- 0.046879 cut #1 (20.<PTvisJETS ): 1.283e+07 37500 5525512 746188 0 2968250 0 3546875 0 806 69 44 259 0.33 Delta(sig*BR) = 3.042092 +/- 0.024172 cut #2 (EvisJETS<220. ): 4.637e+06 25000 3287550 514400 0 130196 0 678875 0 608 42 41 219 0.42 Delta(sig*BR) = 2.365748 +/- 0.020248 cut #3 (50.<hmass<140. ): 3.306e+06 0 2542562 378675 0 60426 0 323875 0 536 38 35 206 0.45 Delta(sig*BR) = 2.233245 +/- 0.019503 cut #4 (||dph|-3.14159|>0.15 ): 2.772e+06 0 2083912 364150 0 57097 0 266000 0 514 36 33 202 0.47 Delta(sig*BR) = 2.120439 +/- 0.018965 cut #5 (nTrks>10 ): 1.426e+06 0 1136375 169088 0 30283 0 89875 0 466 10 19 177 0.56 Delta(sig*BR) = 1.777889 +/- 0.017994 cut #6 (jetthrust<0.99 ): 1.325e+06 0 1044700 165588 0 30033 0 84125 0 463 10 19 173 0.58 Delta(sig*BR) = 1.730704 +/- 0.017728 cut #7 (ej1>40. ): 1.313e+06 0 1035112 164625 0 29215 0 83250 0 462 10 19 173 0.58 Delta(sig*BR) = 1.725415 +/- 0.017704 cut #8 (MVA ): 8.766e+02 0 138 625 0 0 0 0 0 51 0 1 62 3.85 Delta(sig*BR) = 06.259452548594 +/- 0.015456 |
Remaining backgrounds: (OLD)
Code Block |
---|
$ sed 's/\./,/g' zzhpassing.dat | awk '{if (NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 762,5
12,5 1 106562 4f_sz 12,500000
12,5 1 106566 4f_sw 12,500000
12,5 1 106572 4f_sz 12,500000
12,5 1 106575 4f_zz 12,500000
12,5 1 106607 2f_z_ 12,500000
25 2 106551 4f_ww 12,500000
25 2 106553 4f_zz 12,500000
25 2 106571 4f_sz 12,500000
25 2 106573 4f_zz 12,500000
37,5 3 106574 4f_zz 12,500000
37,5 3 106578 4f_ww 12,500000
125 10 106608 2f_z_ 12,500000
162,5 13 106576 4f_zz 12,500000
237,5 19 106577 4f_ww 12,500000
|
Background contributions for Evis>140:
Code Block |
---|
[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '{if ($18>=140. && NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 187,5
12,5 1 106551 4f_ww 12,500000
12,5 1 106577 4f_ww 12,500000
25 2 106553 4f_zz 12,500000
25 2 106573 4f_zz 12,500000
25 2 106576 4f_zz 12,500000
37,5 3 106574 4f_zz 12,500000
50 4 106608 2f_z_ 12,500000
|
Background contributions for Evis<140: (OLD)
046879
cut #1 (y34>0.y34>0. ): 7.705e+07 625000 14449800 1018312 0 53568784 0 7387500 0 947 91 45 262 0.15 Delta(sig*BR) = 6.523157 +/- 0.046724
cut #2 (95.<hmass<140. 95.<hmass<140. ): 1.257e+07 150000 10264438 825238 0 387992 0 938000 0 901 86 43 213 0.35 Delta(sig*BR) = 2.854636 +/- 0.021075
cut #3 (25.<PTvisJETS<70 PTvisJETS no cut ): 5.637e+06 25000 4545812 744212 0 8750 0 312125 0 866 84 37 180 0.49 Delta(sig*BR) = 2.033046 +/- 0.015436
cut #4 (nTrks>5 nTrks>5 ): 3.826e+06 0 2957175 679838 0 8750 0 179500 0 866 81 36 176 0.59 Delta(sig*BR) = 1.688536 +/- 0.012688
cut #5 (jetthrust no cut jetthrust < 0.98 ): 3.826e+06 0 2957175 679838 0 8750 0 179500 0 866 81 36 176 0.59 Delta(sig*BR) = 1.688536 +/- 0.012688
cut #6 (ej1<120. ej1<120. ): 2.144e+06 0 1480162 539312 0 8750 0 114500 0 844 78 35 176 0.77 Delta(sig*BR) = 1.292336 +/- 0.009952
cut #7 (MVA ): 9.252e+02 0 50 712 0 0 0 0 0 79 27 3 53 5.35 Delta(sig*BR) = 0.186991 +/- 0.010382
|
Remaining backgrounds: (NEW)
Code Block |
---|
$ sed 's/\./,/g' zzhpassing.dat | awk '{if (NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n
Sum of weights = 762,5
12,5 1 106561 4f_sz 12,500000
12,5 1 106564 4f_sw 12,500000
12,5 1 106573 4f_zz 12,500000
12,5 1 106580 4f_zz 12,500000
12,5 1 106607 |
Code Block |
[neal@localhost weights]$ sed 's/\./,/g' zzhpassing.dat | awk '{if ($18<140. && NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n Sum of weights = 575 12,5 1 106551 4f_ww 12,500000 12,5 1 106562 4f_sz 12,500000 12,5 1 106566 4f_sw 12,500000 12,5 1 106572 4f_sz 12,500000 12,5 1 106575 4f_zz 12,500000 12,5 1 106607 2f_z_ 12,500000 25 2 106571 4f_sz 12,500000 37,5 3 106578 4f_ww 12,500000 75 6 106608 2f_z_ 12,500000 137,525 2 106551 4f_ww 12,500000 25 2 11106562 106576 4f_zzsz 12,500000 225 18 10657737,5 3 106575 4f_zz 12,500000 37,5 3 106608 2f_z_ 12,500000 50 4 106578 4f_ww 12,500000 <*************** |
Code Block |
---|
--- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg
--- --------------------------------------------------------------------------------------------------
--- Cuts: (309.89453, 1128865) -0.0050 0 0 0 0 0
--- Likelihood: (309.89453, 1128865) 1.0000 0.378385 299.2573 625191.7 0.9657 0.5538
--- Fisher: (309.89453, 1128865) 0.0088 1.63217 150.0501 8301.587 0.4842 0.007354
--- BDTG: (309.89453, 1128865) -0.9950 1.40141 134.4158 9065.234 0.4337 0.00803
--- BDT: (309.89453, 1128865) 0.0177 3.63521 38.5486 73.90136 0.1244 6.547e-05
--- --------------------------------------------------------------------------------------------------
after raising reco higgs mass cut from 50 GeV to 100 GeV:
--- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg
--- --------------------------------------------------------------------------------------------------
--- Cuts: (209.30833, 51800) -0.0050 0 0 0 0 0
--- Likelihood: (209.30833, 51800) 0.9105 1.08931 191.9554 30860.64 0.9171 0.5958
--- Fisher: (209.30833, 51800) 0.0176 1.78558 116.8125 4162.945 0.5581 0.08037
--- BDTG: (209.30833, 51800) -0.5098 1.00598 204.0195 40926.24 0.9747 0.7901
--- BDT: (209.30833, 51800) 0.0361 2.92564 72.9263 548.4119 0.3484 0.01059
--- --------------------------------------------------------------------------------------------------
|
Evis<140 GeV and only idrup = 106577 for the background: (OLD)
62,5 5 106572 4f_sz 12,500000
100 8 106574 4f_zz 12,500000
125 10 106577 4f_ww 12,500000
237,5 19 106576 4f_zz 12,500000 <----
|
Plans:
- just noticed that the preselection nTrks cut is on the PFO count and not the #charged tracks distribution shown so I will switch to
using the charged tracks count in the preselection - may still be able to do better with the leptonic Z decay selection
- high evis events with well reconstructed leptonic Z decays and a clear 6 jet topology (using y56 cut)
- write report
...