Update on the HZZ 250 GeV +80/-30 250/fb Analysis for 17 September 2013:
Analysis:
- divide into 4 jet / 6 jet topologies
- apply preselection depending on topology
- train/apply TMVA's
- validate with cut table
- check remaining backgrounds
Distributions before preselection with only a cut on the reconstructed Higgs mass:
Preselection:
Evis<140.0: (4 Jet category) #y34>0.0#hmass>95. && hmass<140. #PTvisJETS>25.0 && PTvisJETS<70.0 #nTrks>5. #ej1<120. Evis>140: (6 jet category) #y34>0.0 #hmass>90. && hmass<160. #ej1<120. #jetthrust<0.98 #nTrks>5
Distributions after preselection:
aaa
(Note: the lepton pair mass plot has the entries at less than 10 GeV randomly set to 10 and 130 GeV so that the signal peak can be seen.)
The TMVA variables:
- Evis JETS
- Mvis JETS
- PTvis JETS
- B Likeness
- C likeness
- Number of HE Electrons
- Higgs Mass
- Z energy
- Zstr energy
- Z cos theta
- Zstr cos theta
- Z mass
- Zstr mass
- azimuthal separation of Z jets
- thrust
- Number Charged Tracks
- Number of electrons
- Number of muons
- ----NEW VARIABLES:
- y34
- y56
- lepton pair (PDG ID1 = -ID2) mass closest to Mz
- jet pair mass closest to mW
jet pair1 - mW
+
jet pair2 - mW
Performance of different MVA options:
BEFORE
[neal@localhost weights]$ grep -A 8 -i optimal- ../hzzv14-presel-v4-new-common0410results.txt --- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg -- --------------------------------------------------------------------------------------------- --- Cuts: ( 664.0744,1312202.5) -0.0050 0 0 0 0 0 --- Likelihood: ( 664.0744,1312202.5) 1.0000 0.728991 644.0848 779980.1 0.9699 0.5944 --- Fisher: ( 664.0744,1312202.5) 0.0120 1.70335 258.2099 22721.13 0.3888 0.01732 --- BDTG: ( 664.0744,1312202.5) -0.9823 1.17754 538.5353 208621.3 0.811 0.159 --- BDT: ( 664.0744,1312202.5) -0.0462 3.42709 100.6751 762.2891 0.1516 0.0005809 -- ---------------------------------------------------------------------------------------------
NOW:
if ((fp = fopen("HZZ-TMVA-vars-ffh_zz_all_SM_background_+80e-_-30e+-v6-mini.txt","r"))==NULL) exit(0); [neal@localhost test]$ source storeresultsHZZv4.sh hzz-withlep-withmW-n2-Mhiggspresel-plusptntrkp8g16y34nvtx --- ================================================================================================== --- Classifier ( #signal, #backgr.) Optimal-cut S/sqrt(S+B) NSig NBkg EffSig EffBkg --- -------------------------------------------------------------------------------------------------- --- Cuts: (1035.8644, 1196975) -0.0050 0 0 0 0 0 --- Likelihood: (1035.8644, 1196975) 1.0000 1.66886 315.8824 35510.98 0.3049 0.02967 --- Fisher: (1035.8644, 1196975) 0.0047 2.38122 586.8862 60157.76 0.5666 0.05026 --- BDTG: (1035.8644, 1196975) -0.9795 2.625 293.8675 12238.81 0.2837 0.01022 --- BDT: (1035.8644, 1196975) -0.0427 4.9889 146.0711 711.2005 0.141 0.0005942 --- --------------------------------------------------------------------------------------------------
Spent a lot of time playing with different preselection cuts and then settled for only making cuts where
I was cutting almost no signal away and then letting the MVA's do the rest.
Some signals are just not reasonable to try to select:
1) decays like l+l-H where H->ffnunu
Even in cases where the ff are leptons the initial l+l- are more energetic
and those were being selected.
Changed to checking all opposite signed lepton pairs for consistency with coming
from a Z.
|
|
HZZ generated sample stats |
|
|
|
|
|
|
|
|
|
events |
weight |
|
I108056.001.stdhep |
10000 |
20000 |
1.29E-004 |
e1e1h_zz.Gwhizard- |
I108055.001.stdhep |
10000 |
20000 |
4.36E-003 |
e1e1h_zz.Gwhizard- |
I108054.001.stdhep |
10000 |
20000 |
4.11E-004 |
e1e1h_zz.Gwhizard- |
I108053.001.stdhep |
10000 |
20000 |
2.70E-005 |
e1e1h_zz.Gwhizard- |
I108058.001.stdhep |
10000 |
20000 |
4.29E-003 |
e2e2h_zz.Gwhizard- |
I108057.001.stdhep |
10000 |
20000 |
4.00E-004 |
e2e2h_zz.Gwhizard- |
I108060.001.stdhep |
10000 |
20000 |
4.28E-003 |
e3e3h_zz.Gwhizard- |
I108059.001.stdhep |
10000 |
20000 |
4.00E-004 |
e3e3h_zz.Gwhizard- |
|
|
Sum = |
1.43E-002 |
10.40% |
I108062.001.stdhep |
10000 |
20000 |
2.54E-002 |
nnh_zz.Gwhizard-1_ |
I108061.001.stdhep |
10000 |
20000 |
3.01E-003 |
nnh_zz.Gwhizard-1_ |
|
|
Sum = |
2.84E-002 |
20.68% |
I108064.001.stdhep |
10000 |
20000 |
8.67E-002 |
qqh_zz.Gwhizard-1_ |
I108063.001.stdhep |
10000 |
20000 |
8.08E-003 |
qqh_zz.Gwhizard-1_ |
|
|
Sum = |
9.47E-002 |
68.93% |
|
|
|
|
|
|
|
|
|
|
|
|
All = |
1.37E-001 |
|
2) There was also a simple programming error which I've already fixed.
Cut table for BDT: (NEW)NEW:
cuts__ (Cut Name ): all others 2f 4f 6f aa 1f 3f 5f q e+mu tau nu Signif. cut #0 (all ): 7.769e+07 625000 14507062 1019938 0 54073784 0 7464875 0 948 91 45 262 0.15 Delta(sig*BR) = 6.548594 +/- 0.046879 cut #1 (y34>0.y34>0. ): 7.705e+07 625000 14449800 1018312 0 53568784 0 7387500 0 947 91 45 262 0.15 Delta(sig*BR) = 6.523157 +/- 0.046724 cut #2 (95.<hmass<140. 95.<hmass<140. ): 1.257e+07 150000 10264438 825238 0 387992 0 938000 0 901 86 43 213 0.35 Delta(sig*BR) = 2.854636 +/- 0.021075 cut #3 (25.<PTvisJETS<70 PTvisJETS no cut ): 5.637e+06 25000 4545812 744212 0 8750 0 312125 0 866 84 37 180 0.49 Delta(sig*BR) = 2.033046 +/- 0.015436 cut #4 (nTrks>5 nTrks>5 ): 3.826e+06 0 2957175 679838 0 8750 0 179500 0 866 81 36 176 0.59 Delta(sig*BR) = 1.688536 +/- 0.012688 cut #5 (jetthrust no cut jetthrust < 0.98 ): 3.826e+06 0 2957175 679838 0 8750 0 179500 0 866 81 36 176 0.59 Delta(sig*BR) = 1.688536 +/- 0.012688 cut #6 (ej1<120. ej1<120. ): 2.144e+06 0 1480162 539312 0 8750 0 114500 0 844 78 35 176 0.77 Delta(sig*BR) = 1.292336 +/- 0.009952 cut #7 (MVA ): 9.252e+02 0 50 712 0 0 0 0 0 79 27 3 53 5.35 Delta(sig*BR) = 0.186991 +/- 0.010382
Remaining backgrounds: (NEW)
$ sed 's/\./,/g' zzhpassing.dat | awk '{if (NF==18) print $3,$7,$15}' | sort -n | uniq -c | awk '{if (NF==4) aa+=($1*$4);print $1*$4"\t"$0} END{print "Sum of weights = "aa}' | sort -k 1,1 -n Sum of weights = 762,5 12,5 1 106561 4f_sz 12,500000 12,5 1 106564 4f_sw 12,500000 12,5 1 106573 4f_zz 12,500000 12,5 1 106580 4f_zz 12,500000 12,5 1 106607 2f_z_ 12,500000 25 2 106551 4f_ww 12,500000 25 2 106562 4f_sz 12,500000 37,5 3 106575 4f_zz 12,500000 37,5 3 106608 2f_z_ 12,500000 50 4 106578 4f_ww 12,500000 62,5 5 106572 4f_sz 12,500000 100 8 106574 4f_zz 12,500000 125 10 106577 4f_ww 12,500000 237,5 19 106576 4f_zz 12,500000