Statistical Analysis Methods In High-Energy Physics

1y ago
2 Views
1 Downloads
6.14 MB
78 Pages
Last View : 1m ago
Last Download : 3m ago
Upload by : Isobel Thacker
Transcription

Statistical analysis methods inHigh-Energy PhysicsPart IIINicolas Berger (LAPP Annecy)

OutlineProfilingLook-Elsewhere EffectBayesian methodsStatistical modeling in practiceBuilding binned likelihoodsChoosing PDFs in unbinned likelihoodsImplementing systematicsBLUE

Frequentist ConstraintsPrototype: NP measured in a separate auxiliary experimente.g. luminosity measurement Build the combined likelihood of the main auxiliary measurementsL(μ , θ ; data) L main (μ ,θ ; main data) Laux ( θ ; aux. data )Independentmeasurements:Þ just a productobsGaussian form often used by default: Laux ( θ ; aux. data) G (θ ; θ ,σ syst )In the combined likelihood, systematic NPs are constrained now same as other NPs: all uncertainties statistical in nature Often no clear setup for auxiliary measurementse.g. theory uncertainties on missing HO terms from scale variations Implemented in the same way nevertheless (“pseudo-measurement”)3

Likelihood, the full version (binned case)Expected( k ) k 1. . ncatk)ii 1. n(dataL(μ , {θ j } j 1. . n ; {n }NP, {θobsjj 1. . n NP}bin yield) n cat k 1nsystP [ ni ;μ ϵ i , k ( ⃗θ ) N S , i , k ( ⃗θ ) B i , k ( ⃗θ) ]Bin Yields orPOIObservableexperimentsNPsSystematicsSig/Bkg Shapes,valuesPseudo- j 1obsG(θ j ;θ j ;1)efficienciesAuxiliaryDataMCData number of categories!4

Wilks’ TheoremThe likelihood usually has NPs: Systematics Parameters fitted in data What values to use when defining the hypotheses ? H(S 0, θ ?)Answer: let the data choose Þ use the best-fit values (Profiling)Þ Profile Likelihood Ratio (PLR) L(μ μ 0, θ μ )t μ 2 log L( μ , θ)00 θ μ best-fit value for μ μ 0 (conditional MLE)0θ overall best-fit value (unconditional MLE)Wilks’ Theorem: PLR also follows a χ2 ! Profiling “builds in” the effect of the NPsf ( t μ μ μ 0 ) f χ ( nÞ Can treat the PLR as a function of the POI only02dof 1)( tμ )0also with NPs present5

Gaussian ProfilingMeasure N(S,θ) S θ : Main measurement n G(S θ, σn) constraint (aux. meas.) : θobs G(θ, σθ)Then:MLEs:PLR:n ( S θ)λ ( S , θ) σn(obsS n θθ θ obs2) (θL ( S , θ) G ( n ; S θ , σ n) G ( θ obs ; θ , σ θ )obs θσθConditional MLE:)For S Ŝ, matches2MLE as it should θ(S) θ obs σ 2θ( S S)22σ n σ θStatistical L ( S S0, θ S )t S 2 log L( S , θ)SystematicUncertainty0Uncertainty0 λ ( S0, θ ( S0 )) λ ( S , θ) 2( S0 S)22σS 2σn σθ2σn σθStat uncertainty (on n) and syst (on θ) add in quadrature as expected6

Effect of ProfilingSystematics still affect the result even after profiling their NPs!e.g. again counting experiment: N(S,θ) S θ, measure n, constraint on θ 0.L ( S0 )t S 2 log L( S)1. No NP: N(S) S Ŝ fit: adjust S to N(Ŝ) Ŝ n S S0 fit: S S0 fixed Þ N(S0) S0, cannot adjustÞ tension between N(S0) S0 and n Þ large tS0 Þ strong exclusion of H(S0)0 L ( S S0 , θ S )t S 2 log L( S , θ)02. With NP: N(μ,θ) S θ Ŝ̂ fit: adjust N(Ŝ, θ̂) N(Ŝ, θ̂ 0) n using S only (avoid penalty on θ) S S0 fit: S S0 fixed, but θ̂(S0) can still pull N(S0,θ̂(S0)) towards nÞ smaller tS0 Þ reduced exclusion of H(S0)0N(S0)tS0, no systematicsN(S0, θ̂S0)tS0, with systematicsN(Ŝ)N(Ŝ, θ̂ 0)nNMore freedom Weaker exclusion7

Uncertainty decompositionAll systematics NPs fixed : statistical uncertainty onlyexp. syst. NPs fixed : stat theory uncertaintySubtraction in quadrature1σ intervalsσ syst σ 2total σ 2statσ theo σ 2total σ 2theoμ 0.99 0.12 (stat) 0.06 (syst) 0.06 ( theo)8

Gaussian ProfilingGaussian measurement with 1 POI μ and 1 NP θ:σ 2μC γ σμ σθ[T 1 μ μ 1 μ μ L(μ , θ ; μ , θ) exp C 2 θ θθ θ“data”[ ( ) ( )] λ(μ, θ) defines an ellipse: F μ μ (μ μ ) 2 2 Fμ θ (μ μ )(θ θ) F θ θ ( θ θ) 2λ (μ ,θ ; μ , θ)Uncertainty on μ: From C, with θincluded:γ σμ σ θσ 2θ]F C 1FFμ θ μμFμ θ F θ θ[θσμσθσμ( μ , θ )μ9]

Gaussian Profilingσ 2μC γ σμ σθ[ F μ μ (μ μ ) 2 2 Fμ θ (μ μ )( θ θ) Fθ θ ( θ θ) 2λ (μ ,θ ; μ , θ)F [Fμ μ Fμ θFμ θ F θθγ σμ σ θσ 2θ]]Profiled θ (minimize λ at fixed μ) : 1θ(μ ) θ F θ θ Fθ μ (μ μ )Profile likelihood ratio: μ μ 2 12 ( Fμ μ Fμ θ F 1 λ (μ , θ(μ) ; μ , θ)F(μ μ) C(μ μ) θθθμ )μμσμ(Proof of Wilks’ theorem. 1F μ μ C μ μ !!Uncertainty on μ: From C:From PLR:)2θ ( μ )θσμσμ( μ , θ )Profiled θ crosses ellipse atvertical tangents bydefinition (L is lower at otherpoints on the tangent)μ10

Gaussian Profiling F μ μ (μ μ ) 2 2 Fμ θ (μ μ )( θ θ) Fθ θ ( θ θ) 2λ (μ ,θ ; μ , θ)F C For fixed θ θ̂, λ(μ) defines an interval:2 F μ μ (μ μ ) λ (μ ,θ θ ; μ , θ)From C: From PLR: From λ(μ):σμσμσμσ μ 1 γ2) 11 γ 22[γσμ σ θ1σ 2θ]θUncertainty on μ: (μ μ 11σ 2μγσμ σθσμ21 γ 1 γ2( μ , θ )μ11

Gaussian Profiling F μ μ (μ μ ) 2 2 Fμ θ (μ μ )( θ θ) Fθ θ ( θ θ) 2λ (μ ,θ ; μ , θ)F C For fixed θ θ̂, λ(μ) defines an interval:2 F μ μ (μ μ ) λ (μ ,θ θ ; μ , θ)From C: From PLR: From λ(μ):σμσμσμσ μ 1 γ2) 11 γ 22[γσμ σ θ1σ 2θ]θUncertainty on μ: (μ μ 11σ 2μγσμ σθTotal uncertainty21 γ σμ σμ 1 γ2( μ , θ )Stat uncertainty ( 1 γ σ )2μSyst uncertainty2 ( γ σμ )2μ12

Profiling Example: ttH bbAnalysis uses low-S/B categories to constrain backgrounds. Reduction in large uncertainties on tt bkg Propagates to the high-S/B categories through thestatistical modelingÞ Care needed in the propagation (e.g. differentkinematic regimes)ATLAS-CONF-2016-080Fit13

Pull/Impact plotsATLAS-CONF-2016-058Systematics are described by NPsincluded in the fit. Nominally: NP central value 0 : corresponds tothe pre-fit expectation (usually MC) NP uncertainty 1 : since NPsnormalized to the value of the syst. :N N 0 (1 σ syst θ) , θ G ( 0 , 1)Fit results provide information onimpact of the systematic on the result: If central value ¹ 0: some datafeature absorbed by nonzero valueÞ Need investigation if large pull If uncertainty 1 : systematic isconstrained by the dataÞ Needs checking if this legitimateor a modeling issue Impact on result of 1σ shift of NP14

Pull/Impact plots13 TeV single-tATLAS-CONF-2016-058XS (arXiv:1612.07231)Systematics are described by NPsincluded in the fit. Nominally: NP central value 0 : corresponds tothe pre-fit expectation (usually MC) NP uncertainty 1 : since NPsnormalized to the value of the syst. :N N 0 (1 σ syst θ) , θ G ( 0 , 1)Fit results provide information onimpact of the systematic on the result: If central value ¹ 0: some datafeature absorbed by nonzero valueÞ Need investigation if large pull If uncertainty 1 : systematic isconstrained by the dataÞ Needs checking if this legitimateor a modeling issue Impact on result of 1σ shift of NP15

Profiling TakeawaysSystematic NP with an external constraint (auxiliary measurement). No special treatment, treated like any other NP: statistical and systematicuncertainties represented in the same way.When testing a hypothesis, use the best-fit valuesof the nuisance parameters: Profile Likelihood Ratio. L(μ μ 0, θμ ) L( μ , θ)0Wilks’ Theorem: the PLR has the same asymptotic properties as the LR withoutsystematics: can profile out NPs and just deal with POIs.Profiling systematics includes their effect into the total uncertainty. Gaussian:σ total σ 2stat σ 2systGuaranteed to work only as long as everything is Gaussian, but typicallyrobust against non-Gaussian behavior.Profiling can have unintended effects – need to carefully check behavior16

Beyond Asymptotics: ToysCMS-PAS-HIG-11-022Asymptotics usually work well, but break down insome cases – e.g. small event counts.Solution: generate pseudo data (toys) using the PDF,under the tested hypothesis Also randomize the observableobsG(θ; θ , σ syst )(θ ) of each auxiliary experiment:obs Samples the true distribution of the PLR Integrate above observed PLR to get the p-value Precision limited by number of generated toys,Small p-values (5σ : p 10-7!) Þ large toy samplesRepeat Ntoys timesp(data x)q0PDFPseudo data17

Toys: ExamplearXiv:1708.00212ATLAS X Zγ Search: covers 200 GeV mX 2.5 TeV for mX 1.6 TeV, low event counts Þ derive results from toysAsymptotic results (in gray) give optimistic result compared to toys (in blue)18

Comparison with LEP/TeVatron definitionsLikelihood ratios are not a new idea: LEP: Simple LR with NPs from MC – Compare μ 0 and μ 1Tevatron: PLR with profiled NPsL(μ 0, θ)q LEP 2 logL(μ 1, θ) L(μ 0, θ 0 )q Tevatron 2 log L(μ 1, θ 1 )Both compare to μ 1 instead of best-fit μ̂LEP/TevatronLHCH0H0μ 1H1H1 Asymptotically: LEP/Tevaton: q linear in μ Þ Gaussian LHC: q quadratic in μ Þ χ2Andrey Korytov, EPS 2011μ 0 Still use TeVatron-style for discrete cases19

Spin/Parity MeasurementsPhys. Rev. D 92 (2015) 01200420

Summary of Statistical Results ComputationMethods provide: Optimal use of information from the data under general hypotheses Arbitrarily complex/realistic models (up to computing constraints.) No Gaussian assumptions in the measurementsStill often assume Gaussian behavior of PLR – but weaker assumption andcan be lifted with toysSystematics treated as auxiliary measurements – modeling can be tailoredas needed Single PLR-based framework for all usual classes of measurementsDiscovery testingUpper limits on signal yieldsParameter estimation21

OutlineProfilingLook-Elsewhere EffectBayesian methodsStatistical modeling in practiceBuilding binned likelihoodsChoosing PDFs in unbinned likelihoodsImplementing systematicsBLUE

Look-Elsewhere Effect23

Look-Elsewhere effectSometimes, unknown parameters in signal modele.g. p-values as a function of mXÞ Effectively performing multiple, simultaneoussearches If e.g. small resolution and largescan range, many independent experiments More likely to find an excessanywhere in the range, ratherthan in a predefined location Look-elsewhere effect (LEE)Testing the same H0, but againstdifferent alternatives different p-values24

Global SignificanceProbability for a fluctuation anywhere in the range Global p-value.at a given location Local p-valueGlobalp-valueNp global 1 (1 p local ) N p localTrials factorLocalp-value pglobal plocal Þ Zglobal Zlocal – global fluctuation more likely less significantTrials factor : naively # of independent intervals:However this is usually wrong – more on this later?scan rangeN trials N indep peak widthFor searches over a parameter range, pglobal is the relevant p-value Depends on the scanned parameter rangese.g. X γγ : 200 mX 2000 GeV, 0 ΓX 10% mX. However what comes out of the usualasymptotic formulas is plocal.How to compute pglobal ? Toys (brute force) or asymptotic formulas.25

Global Significance from ToysLocal 3.9σPrinciple: repeat the analysis in toy data: generate pseudo-dataset perform the search, scanning over parametersas in the data report the largest significance found repeat many times The frequency at which a given Z0 is found is the global p-valuee.g. X γγ Search: Zlocal 3.9σ ( plocal 5 10-5),scanning 200 mX 2000 GeV and 0 ΓX 10% mX In toys, find such an excess 2% of the time pglobal 2 10-2, Zglobal 2.1σ Less exciting. Exact treatment CPU-intensive especially for large Z (need O(100)/pglobal toys)26

Global Significance from AsymptoticsPrinciple: approximate the global p-value in the asymptotic limit reference paper: Gross & Vitells, EPJ.C70:525-530,2010Nindep Asymptotic trials factor (1 POI): Trials factor is not just Nindep,also depends on Zlocal !scan rangepeak widthN trials 1 π N indep Z local2 Why ? slice scan range into Nindep regionsof size peak width search for a peak in each region Indeed gives Ntrials Nindep.However this misses peaks sitting onedges between regions true Ntrials is Nindep!27

Global Significance from AsymptoticsPrinciple: approximate the global p-value in the asymptotic limit reference paper: Gross & Vitells, EPJ.C70:525-530,2010Nindep Asymptotic trials factor (1 POI): Trials factor is not just Nindep,also depends on Zlocal !scan rangepeak widthN trials 1 π N indep Z local2 Why ? slice scan range into Nindep regionsof size peak width search for a peak in each region Indeed gives Ntrials Nindep.However this misses peaks sitting onedges between regions true Ntrials is Nindep!28

Illustrative ExampleTest on a simple example: generate toys with flat background (100 events/bin) count events in a fixed-size sliding window, look for excessesTwo configurations:1. Look only in 10 slices of the full spectrum2. Look in any window of same size as above, anywhere in the spectrumPredefinedSlicesExample toyLargest excess in predefined slicesLargest excess anywhere29

Illustrative Example (2)Very different results if the excess is near a boundary :1. Look only in 10 slices of the full spectrum2. Look in any window of same size as above, anywhere in the spectrum30

Illustrative Example (3)locZepindeverywhere:Ntrials 1 Zlocalpglobal(Zlocal)alSearch 2 πNNormalizedZlocal distributionSearch in predefinedbins: Ntrials 10oNEELSearch everywhereSearch in predefined binsSearching everywhere gives theextra Zlocal dependence31

ZGlobal Asymptotics ExtrapolationN trials 1 π N indep Z local2Asymptotic trials factor (1 POI): How to get Nindep ? Usually work with a slightly different formula:2N trials 1 1p local⟨ N up ( Z test ) ⟩ e2Zlocal Ztest2Number of excesses with Z Ztest calibrate for small Ztest, apply result to higher Zlocal.Can choose arbitrarily small Ztest many excesses can measure Nup in data (1 “toy”)Can also measure Nup in multiple toysif large stat uncertainty fromtoo few excessesNup 20ZtestZlocal32

In 2DO. Vitells and E. Gross, Astropart. Phys. 35 (2011) 230Generalization to 2D scans: considersections at a fixed Ztest, compute itsEuler characteristic φ, and use Generalizes 1Dbump counting1–4φ 21–1 0 -35Now need to determine2 constants N1 and N2,from Euler φ measurementsat 2 different Ztest values.33

OutlineProfilingLook-Elsewhere EffectBayesian methodsStatistical modeling in practiceBuilding binned likelihoodsChoosing PDFs in unbinned likelihoodsImplementing systematicsBLUE

Bayesian Methods35

Frequentist vs. BayesianAll methods described so far are frequentist Probabilities (p-values) refer to outcomesif the experiment were repeated identicallymany times Parameters value are fixed but unknown Probabilities apply to measurements: “mH 125.09 0.24 GeV” :Experiment 6Experiment 5Experiment 4Experiment 3Experiment 2Experiment 1μ*–σμ*μ* σ i.e. [125.09 – 0.24 ; 125.09 0.24 ] GeV has p 68% to contain the true mH. if we repeated the experiment many times, we would get differentintervals, 68% of which would contain the true mH. “5σ Higgs discovery” if there is really no Higgs, such fluctuations observed in 3.10 -7 of experimentsNot exactly the crucial question – what we would really like to know isWhat is the probability that the excess we see is a fluctuation we want P(no Higgs data) – but all we have is P(data no Higgs)36

Frequentist vs. BayesianCan use Bayes’ theorem to address this:P ( data μ )P (μ data) P (μ )P ( data)same as in the frequentistformalism ( likelihood)Prior Probabilityirrelevant normalization factorCan compute P(μ data), if we provide P(μ) Implicitly, we have now made μ into a random variable– Is mH, or the presence of H(125), randomly chosen ?– In fact, different definition of p: degree of belief, not from frequencies.– P(μ) Prior degree of belief – critical ingredient in the computationCompared to frequentist PLR: answers the “right” question answer depends on the prior“Bayesians address the questionseveryone is interested in by usingassumptions that no one believes.Frequentist use impeccable logic todeal with an issue that is of nointerest to anyone.” - Louis Lyons37

Bayesian methodsProbability distribution ( likelihood) : same form as frequentist case, butP(θ) constraints now priors for the systematics NPs, P(θ)not auxiliary measurements P(θmes; θ) Simply integrate them out, no need for profiling: P (μ) P (μ ,θ) d θ Use probability distribution P(μ) directly for limits, credibility intervalsBe.g. define 68% CL (“Credibility Level”) interval [A, B] by: P (μ ) d μ 68 %A No simple way to test for discovery Integration over NPs can be CPU-intensivePriors : most analyses still using flat priors in the analysis variable(s)Þ Parameterization-dependent: if flat in σ B , then not flat in κ Can use the Jeffreys’ or reference priors, but difficult in practiceFrequentist-Bayesian Hybrid methods (“Cousins-Highland”) Integrate out NPs as in Bayesian measurements Once only POIs left, Use P(data μ) in a frequentist way “Bayesian NPs, frequentist POIs” Some use in Run 1, now phased out in favor of frequentist PLR.38

Bayesian methods and CLs: CLs computationGaussian counting with systematic on background: n S B σsystθL( n ; S , θ) G ( n ; S B σ syst θ , σ stat ) G( θ obs 0 ;θ ,1)MLE: S n B Conditional MLE: θ(μ) σ syst22σ stat σ systPLR : λ (μ ) ( n S B)( S B n22σ stat σ syst)2Gaussian from previous studies, CLs limit isCL s :SCL sup[ n B Φ 1(1 0.05 Φ( n B22σ stat σ syst) )] σ2stat2 σ syst39

Bayesian methods and CLs: Bayesian caseGaussian counting with systematic on background: n S B σsystθP ( n S ,θ ) G ( n ; S B σ syst θ , σ stat ) G ( θ 0, 1)Bayesian: G(θ) is actually a prior on θ perform integral (marginalization)same effect as profiling!P ( n S) G ( S ; n B , σ 2stat σ 2syst )Need P(S n) a prior for S – take flat PDF over S 0 Truncate Gaussian at S 0: P ( S n) P ( n S) P ( S)22n B[ ( )])] [ ( )]P ( S n) G ( S ; n B , σ stat σ syst ) ΦBayesian Limit: P ( S n) dS 5 % SupBayesSup[ ( n B ΦS up ( n B)[ ( 1 Φ 1σ2stat σ1 0.05 Φ( 12systσ 2stat σ 2systn BΦσ 2stat σ 2systn Bσ2stat σ 12syst) )]22 σ stat σ systsame result as CLs!40

Example: W’ lν Search arXiv:1706.04786POI: W’ σ B use flat prior over [0, [.NPs: syst on signal ε (6 NPs), bkg (6), lumi (1) integrate over Gaussian priors41

Why 5σ ?One-sided discovery:5σ p0 3 10-7 1 chance in 3.5M Overly conservative ?Local 3.9σ, p0 5E-5 Do we even know the sampling distributions so far out ?Global 2.1σ, p0 2E-2Reasons for sticking with 5σ (from Louis Lyons): LEE : searches typically cover multipleindependent regions Global p-value is the relevant oneNtrials 1000 : local 5σ O(10-4) more reasonableMismodeled systematics: factor 2 error insyst-dominated analysis factor 2 error on Z History: 3σ and 4σ excesses do occur regularly,for the reasons above “Subconscious Bayes Factor” : p-value should beat least as small as the subjective p(S): P( fluct) P ( fluct B) P ( B)P( fluct S) P( S) P ( fluct B) P ( B)Extraordinary claims require extraodinary evidence Stay with 5σ.42

OutlineProfilingLook-Elsewhere EffectBayesian methodsStatistical modeling in practiceBuilding binned likelihoodsChoosing PDFs in unbinned likelihoodsImplementing systematicsBLUE

Statistical Modeling: in Practice44

Bulding statistical modelsSo far focus has been on concepts, but building a statistical model alsorequires numerical inputs: Data PDFs for all model components Constraint PDFs for all sources systematics Impact of each systematic uncertainty on all relevant model parameters Statistical methods are only as accurate (and/or optimal) as the descriptionprovided by the model!Technically, MC simulation provides most of these inputs. However 2problematic issues: Potential MC/data differences Limited MC statisticsWhich need to be addressed with (yet more) systematics.45

Statistical Modeling:I. Component PDFs46

PDFs : Binned likelihoodEur. Phys. J. C (2012) 72: 2241Binned case: PDF usually just a normalized histogram, from MC sample or Data control region (CR) Statistical uncertainties on the prediction: Data CR: counts as statistical uncertainty MC sample: uncertainty can be reduced without collecting more data(just need more CPU!) Counted as systematicJHEP 12 (2017) 024Independent counts in each bin a separate MC statistics NP in each bin Poisson constraints Pois(NiMC; Nitrue)Total uncertainty σ2data stats σ 2MC stats . need enough MC to avoid spoiling the sensitivity!47

MC Statistics Requirementse.g. Discovery: Total uncertainty: σ S 2 need σ MC stats σ data statsBMC BdataBy how much ? σ2data stats σ 2MC stats .σMC stats/σdata statsσdata MC stats/σdata stats111.4140.51.12250.21.02BMC/Bdata(α)(1/ α)In the presence of a signal (e.g. limit-setting,Nsig measurement), relevant uncertainty is (S B). S/B also matters:σS S [ (1 α-1) )Eur. Phys. J. C (2012) 72: 2241S B data11 B B MC 1 S/ Blow S/B : same problem as for discoveryhigh S/B : no issue, dominated by uncertaintyon signal itself.48

PDF shapes: Unbinned likelihoodSmooth backgrounds : Describe distribution using appropriate function Unbinned likelihood. Describes sideband signal region in one fit.Phys. Rev. Lett. 118 (2017), 191801Phys. Lett. B241 (1990) 278-282Crystal BallFunctionGaussiansARGUS function M2M21 exp[ a 1 2 ]E2E()ExpBDT 0.9Functions helpsmooth MC statsfluctuationsS. Oggero Ph. D. Thesis49

PDF Shapes: Unbinned likelihoodWidely used in HEP for smooth backgrounds ( no resonances or threshold)H γγ MeasurementsX jj SearchPhys.Lett. B754 (2016) 302-322exp(-a.m b.m2)(Gaussian )Ma 1 E(bME)( )c50

Signal Bias in Unbinned likelihoodsFunction usually ad-hoc (no closed form expression for (theory detectoreffects), or usually even theory by itself ) may not accurately describe the data Introduce free parameters, fit in sidebands Jan 2012 Higgs search paper(4.9 fb-1 of 2011 data) Biases may still remain due toexponentialfunctional form itselfProblematic especially for low S/B small mismodelings of B can be largecompared to S. χ2 test in sideband may not help: evena large bias on the scale of S ( B) mayremain within stat errors in the sideband!2.5σSituation doesn’t improve with more luminosity: Reduced statistical uncertainties in sideband, but Also reduced σS, in the same proportion51

Signal Bias in Unbinned likelihoodsFunction usually ad-hoc (no closed form expression for (theory detectoreffects), or usually even theory by itself ) may not accurately describe the data Introduce free parameters, fit in sidebands Jan 2012 Higgs search paper(4.9 fb-1 of 2011 data) Biases may still remain due topolynomialfunctional form itselfProblematic especially for low S/B small mismodelings of B can be largecompared to S.3.0σ χ2 test in sideband may not help: evena large bias on the scale of S ( B) mayremain within stat errors in the sideband!Situation doesn’t improve with more luminosity: Reduced statistical uncertainties in sideband, but Also reduced σS, in the same proportion52

Signal Bias in Unbinned likelihoodsIf data cannot fix B shape, use MC Measure signal bias NSS on “credible”shapes taken from MC (Spurious signal) take the maximum bias as systematicWorks well if the true distribution is somewherein the space of MC distributions scanned Also Impose:NSS 20% σstat (small contribution to σtotal)ORNSS 10% Sexp (small bias on measured S)Second criterion more stringent at higher S/ B.If criteria are not met, move to more complexfunctions ( more free parameters)53

Signal Bias in Unbinned likelihoodsProblem: for small MC stats, measured bias dominated by fluctuations again, need high MC stats (BMC 25 Bdata) when S/B is low.σMC stats/σdata statsσdata MC stats/σdata stats1100%1.41450%1.122520%1.02BMC/Bdata(α)(1/ α)[ (1 α-1) )NSS 20% σstat Can compromise on criterion level(50% instead of 20% ?) As before, better situation at at high S/BPhys. Rev. Lett. 118, 182001 (2017)54

Usual FunctionsComm. Soc. Math. Kharkov 13, 1-2, 1912.Polynomials: various basis choices (Chebyshev, Bernstein, )Bernstein basis:k kn kB ( x) x (1 x)for 0 x 1k,n( n) Positive coefficients positive polynomialeverywhere, useful to avoid numerical issuesin -2 log(PDF) computationExponential family : exp(polynomial)Power laws : xα, xα(1-x)β, JINST 10 (2015) no.04, P04015GaussiansCrystal Ball Functions Sums of the above Convolutions (resolution Breit-Wigner, .)55

Discrete ProfilingIdea: treat the type of function andnumber of parameters as discreteNPs, profiled in data Let data choose the best shape Similar principle as other NPs,except for discrete nature Need a penalty on Npars to avoidalways choosing functions with high NparsJINST 10 (2015) no.04, P04015Take lower envelope of allfunctions when profiling Used in the CMS H γγ analysis,works well in this context.Caveats: for N categories and M functional forms, MNpossibilities to check in principle – difficult in practice Need a well-chosen pool of sensible functions forthe method to work Large MC samples for selection and checks56

Gaussian Processes: 1-slide SummaryImage Credits:K. Cranmer57

Gaussian Processes: Longer 1-slide SummaryDescribe background distribution through the correlations between valuesat different points.2(x x) More flexible than a functional formK ( x1 , x2 ) exp 1 2 22L Correlation function (Kernel) can be []– Defined using a length scale, to ignore narrow peaks– Obtained from first principles (e.g. from known JES/PDF effects)arXiv:1709.05681 More flexible than functional form, degrees of freedom less ad-hoc Still need large MC samples to check for signal bias Mainly for Gaussian processes, not well-adapted to Poisson regime58

Statistical Modeling:II. Systematics59

Systematics NPsEach systematics NP represent an independent source of uncertainty Usually constrained by a single 1-D PDF (Gaussian, etc.)Sometimes multiple parameters conjointly constrained by an n-dim. PDF. multiple measurements constraining multiple NPsAssume n-dim Gaussian PDF: then can diagonalize the covariance matrix Cand re-express the uncertainties in basis of eigenvector NPs n 1-dim PDFsCan also diagonalize to prune irrelevant uncertainties: keep NPs with largeeigenvalues, combine in quadrature the othersPhys.Rev. D96 (2017) no.7, 07200280 NPs19 NPs vs 803 NPs vs. 8060

Systematics : Impact on ModelThe effect of each NP θi should be propagatedto all the relevant model parameters Xj. Propagation through MC:1. Apply 1σ systematic variations in MC, obtain shifted values Xj Xj0 (1 Δij). Possibly smooth out MC stats effects2. Implement systematic in model, e.g. replaceor morph shapes:θ -1θ 0Constrained by unit Gaussian0X j X j ( 1 Δ ij θ i )θ 1 can affect event yields, shapes, etc.Assumes Gaussian uncertainties and linear impact on model parameters61

Systematics : ConstraintsIdeally, constraint likelihood of auxiliary measurement e.g. Poisson for constraint from counting in a low-stat CR.Sometimes no clear auxiliary measurement Semi-arbitrary “pseudo-measurement” motivated by Central Limit Theorem: Gaussian for additive corrections Log-normal for multiplicative correctionsConstrained by unit GaussianGaussian:0 represent impact as X j X j ( 1 Δ ij θ i ) or similar morphing for distributionsCan include asymmetric variations Δ , Δ-:Xj X0j Δ θ θi 01 ij iΔ ij θ i θ i 0( {})However discontinuity in derivative at 0, so use smooth interpolation instead,e.g. implementation in RooStats::HistFactory::FlexibleInterpVar.62

Systematics : Log-normal ConstraintLog-normal: x log-normal if log(x) is normal always 0, useful to avoid numerical issues PDF:11 log ( x) X 0P ( s ; X 0, κ) exp κ2x κ 2 π((2))However usually simpler to implement as :0X j X j exp( κ ij θ i )where θi is constrained by a unit Gaussian as usual Correct form for a multiplicative uncertainty:nRMS( log ( k ))n 1log ( X 0 k 1) ( X 0 k2 ) .( X 0 k n ) log( X 0 ki ) G ( log X 0 , κ)n i 1 nnSimilarly to Gaussian represent X X0 eκθ G(log X0, κ) if θ G(0,1)Which κ to use ? κ RMS(X) only at first order. For larger uncertainties,e.g. Match 1σ variations: Xj(θ 1) Xj κ log ( X j / X 0j )Implemented in RooStats::HistFactory::FlexibleInterpVar.63

Systematics : Theory ConstraintsMissing high-order terms in perturbative calculations: evaluate from scalevariations – but no underlying random process. Possible constraint shapes: Gaussians (ATLAS/CMS Higgs analyses, see Yellow Report 4, I.4.1.d) Usually several independent “sources” ofuncertainty(QCD/EW/resummation) overall uncertainty may be rather Gaussian Numerically well-behaved Uncertainties add in quadrature as usual Flat constraints : “100% confidence” intervals no preference for any value in the range Need regularization to avoid numericalissues uncertainties add linearly For Higgs cross-sections, rather similar results for both cases64

Constraints : Two-point systematicsSometimes differences between 2 discrete cases e.g. Pythia vs. HerwigSolutions: Results for one case only Full results for both cases Single result with an uncertainty that covers the difference Two-point uncertaintyUsually implemented as 1D linear interpolations between the two cases However cannot guarantee this covers the space ofpossible configurations This is not even a pseudo-measurement.Ideally, need to define proper uncertainties withina single model, which would cover the other cases e.g. showering uncertainties within Pythia,covering Herwig Usually a difficult taskW. Verkerke, SOS 201465

Too simple modeling can have unintended effects e.g. single Jet E scale parameter:Þ Low-E jets calibrate high-E jets – intended ?JESProfiling IssuesθJESPre-fitPost-fitJet ETwo-point uncertainties: Interpolation may not cover full configurationspace, can lead to too-strong constraintsPre -fit constraintPost -fit constraintW. Verkerke, SOS 2014NP central values and uncertainties in pull/impact plotsprovide i

3 Frequentist Constraints Prototype: NP measured in a separate auxiliary experiment e.g. luminosity measurement Build the combined likelihood of the main auxiliary measurements Gaussian form often used by default: In the combined likelihood, systematic NPs are constrained now same as other NPs: all uncertainties statistical in nature Often no clear setup for auxiliary measurements

Related Documents:

Statistical Methods in Particle Physics WS 2017/18 K. Reygers 1. Basic Concepts Useful Reading Material G. Cowan, Statistical Data Analysis L. Lista, Statistical Methods for Data Analysis in Particle Physics Behnke, Kroeninger, Schott, Schoerner-Sadenius: Data Analysis in High Energy Physics: A Practical Guide to Statistical Methods

STATISTICAL METHODS 1 STATISTICAL METHODS Arnaud Delorme, Swartz Center for Computational Neuroscience, INC, University of San Diego California, CA92093-0961, La Jolla, USA. Email: arno@salk.edu. Keywords: statistical methods, inference, models, clinical, software, bootstrap, resampling, PCA, ICA Abstract: Statistics represents that body of methods by which characteristics of a population are .

Statistical methods are profoundly and widely used in biology and medicine. In biology, there are research areas dedicated to the application of statistical methods in biology; it comprises biometrics, biostatistics; in medical science statistical methods are used for the analysis of experimental data and clinical observations,

advanced statistical methods. The paper presents a few particular applications of some statistical software for the Taguchi methods as a quality enhancement insisting on the quality loss functions, the design of experiments and the new developments of statistical process control. Key words: Taguchi methods, software applications 1. Introduction

Module 5: Statistical Analysis. Statistical Analysis To answer more complex questions using your data, or in statistical terms, to test your hypothesis, you need to use more advanced statistical tests. This module revi

Statistical Analysis [Methods] All statistical analyses described in the methods section should have a corresponding set of results (and vice versa) Explain the purpose of your analysis If the analysis is using complex statistical methods, then a citation to appropriate papers should be given pay special attention to how these are described

In addition to the many applications of statistical graphics, there is also a large and rapidly growing research literature on statistical methods that use graphics. Recent years have seen statistical graphics discussed in complete books (for example, Chambers et al. 1983; Cleveland 1985,1991) and in collections of papers (Tukey 1988; Cleveland

STAT 2331, Intro to Statistical Methods, covers the basics of statistical analysis techniques and adequately prepares students for the quantitative components of various degree plans. In this course students learn about common techniques of basic statistical inference, with a focus on applications in business and the social sciences.