Hypothesis Testing With The Bootstrap

2y ago
38 Views
2 Downloads
492.17 KB
17 Pages
Last View : 1m ago
Last Download : 3m ago
Upload by : Mia Martinelli
Transcription

Hypothesis Testing with theBootstrapNoa HaasStatistics M.Sc. Seminar, Spring 2017Bootstrap and Resampling Methods

Bootstrap Hypothesis TestingA bootstrap hypothesis test starts with a teststatistic - 𝑑(𝒙) (not necessary an estimate of aparameter).We seek an achieved significance level𝐴𝑆𝐿 π‘ƒπ‘Ÿπ‘œπ‘π»0 𝑑 𝒙 𝑑(𝒙)Where the random variable 𝒙 has a distributionspecified by the null hypothesis 𝐻0 - denote as 𝐹0 .Bootstrap hypothesis testing uses a β€œplug-in” styleto estimate 𝐹0 .

The Two-Sample ProblemWe observe two independent random samples:𝐹 𝐺 𝒛 𝑧1 , 𝑧2 , , 𝑧𝑛 οΏ½οΏ½ π‘œπ‘“π’š 𝑦1 , 𝑦2 , , π‘¦π‘šAnd we wish to test the null hypothesis of no differencebetween F and G,𝐻0 : 𝐹 𝐺

Bootstrap Hypothesis Testing 𝐹 𝐺 Denote the combined sample by 𝒙, and its empiricaldistribution by 𝐹0 . Under 𝐻0 , 𝐹0 provides a non parametric estimate forthe common population that gave rise to both 𝒛 and π’š.1. Draw 𝑩 samples of size 𝑛 π‘š with replacementfrom 𝒙. Call the first n observations 𝒛 and theremaining π‘š – π’š 2. Evaluate 𝑑( ) on each sample - 𝑑(𝒙 𝒃 )3. Approximate π΄π‘†πΏπ‘π‘œπ‘œπ‘‘ byπ΄π‘†πΏπ‘π‘œπ‘œπ‘‘ # 𝑑 𝒙 𝒃 𝑑 𝒙 /𝐡* In the case that large values of 𝑑(𝒙 𝒃 ) are evidence against 𝐻0

Bootstrap Hypothesis Testing 𝐹 𝐺on the Mouse DataA histogram of bootstrapreplications of𝑑 𝒙 𝑧 𝑦for testing 𝐻0 : 𝐹 𝐺 on themouse data. The proportion ofvalues greater than 30.63 is .121.Calculating𝑧 𝑦𝑑 𝒙 𝜎 1 𝑛 1 π‘š(approximate pivotal) for thesame replications producedπ΄π‘†πΏπ‘π‘œπ‘œπ‘‘ .128

Testing Equality of Means Instead of testing 𝐻0 : 𝐹 𝐺, we wish to test H0 : πœ‡π‘§ πœ‡π‘¦ ,without assuming equal variances. We need estimates of 𝐹and 𝐺 that use only the assumption of common mean1. Define points 𝑧𝑖 𝑧𝑖 𝑧 π‘₯, 𝑖 1, , 𝑛, and 𝑦𝑖 𝑦𝑖 𝑦 π‘₯ , 𝑖 1, , π‘š. The empirical distributions of 𝒛 and π’šshares a common mean.2. Draw 𝑩 bootstrap samples with replacement 𝒛 , π’š from𝑧1 , 𝑧2 , , 𝑧𝑛 and 𝑦1 , 𝑦2 , , π‘¦π‘š respectivly3. Evaluate 𝑑( ) on each sample 𝑦 𝑧𝑑 𝒙 𝒃 πœŽπ‘§ 1 𝑛 πœŽπ‘¦ 1 π‘š4. Approximate π΄π‘†πΏπ‘π‘œπ‘œπ‘‘ byπ΄π‘†πΏπ‘π‘œπ‘œπ‘‘ # 𝑑 𝒙 𝒃 𝑑 𝒙 /𝐡

Permutation Test VS BootstrapHypothesis Testing Accuracy: In the two-sample problem, π΄π‘†πΏπ‘π‘’π‘Ÿπ‘š isthe exact probability of obtaining a test statisticas extreme as the one observed. In contrast, thebootstrap explicitly samples from estimatedprobability mechanism. π΄π‘†πΏπ‘π‘œπ‘œπ‘‘ has nointerpretation as an exact probability. Flexibility: When special symmetry isn’t required,the bootstrap testing can be applied much moregenerally than the permutation test. (Like in thetwo sample problem – permutation test is limitedto 𝐻0 : 𝐹 𝐺, or in the one-sample problem)

The One-Sample ProblemWe observe a random sample:𝐹 𝒛 𝑧1 , 𝑧2 , , 𝑧𝑛And we wish to test whether the mean of thepopulation equals to some predetermine value πœ‡0 𝐻0 : πœ‡π‘§ πœ‡0

Bootstrap Hypothesis Testing πœ‡π‘§ πœ‡0What is the appropriate way to estimate the nulldistribution?The empirical distribution 𝐹 is not anappropriate estimation, because it does notobey 𝐻0 .As before, we can use the empirical distributionof the points:𝑧𝑖 𝑧𝑖 𝑧 πœ‡0 , 𝑖 1, , 𝑛Which has a mean of πœ‡0 .

Bootstrap Hypothesis Testing πœ‡π‘§ πœ‡0The test will be based on the approximate𝑧 πœ‡0distribution of the test statistic 𝑑 𝒛 𝜎/ 𝑛We sample 𝑩 times 𝑧1 , , 𝑧𝑛 with replacementfrom 𝑧1 , , 𝑧𝑛 , and for each sample compute𝑧 πœ‡0 𝑑 𝒛 𝜎/ 𝑛And the estimated ASL is given byπ΄π‘†πΏπ‘π‘œπ‘œπ‘‘ # 𝑑 𝒛 𝒃 𝑑 𝒛 /𝐡* In the case that large values of 𝑑 𝒛 𝒃 are evidence against 𝐻0

Testing πœ‡π‘§ πœ‡0 on the Mouse DataTaking πœ‡0 129, the observed value of the test statistic is86.9 129𝑑 𝒛 1.6766.8/ 7(When estimating 𝜎 with the unbiased estimator for standarddeviation). For 94 of 1000 bootstrap samples, 𝑑 𝒛 wassmaller than -1.67, and thereforπ΄π‘†πΏπ‘π‘œπ‘œπ‘‘ .094For reference, the student’s t-test result for the same nullhypothesis on that data gives us42.1𝐴𝑆𝐿 π‘ƒπ‘Ÿπ‘œπ‘ 𝑑6 0.0766.8/ 7

Testing Multimodality of a PopulationA mode is defined to be a local maximum or β€œbump” of thepopulation densityThe data: π‘₯1 , , π‘₯485 Mexican stamps’ thickness from 1872.The number of modes is suggestive of the number of distincttype of paper used in the printing.

Testing Multimodality of a PopulationSince the histogram is not smooth, it is difficult to tell from it whether there aremore than one mode.A Gaussian kernel density with window size β„Ž estimate can be used in order toobtain a smoother estimate:1𝑓 𝑑; β„Ž π‘›β„Žπ‘›π‘– 1𝑑 π‘₯π‘–πœ™β„ŽAs 𝒉 increases, the number of modes in the density estimate is non-increasing

Testing Multimodality of a PopulationThe null hypothesis:𝐻0 : π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘šπ‘œπ‘‘π‘’π‘  1Versus π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘šπ‘œπ‘‘π‘’π‘  1. Since the number of modes decreases asβ„Ž increases, there is a smallest value of β„Ž such that 𝑓 𝑑; β„Ž has one mode.Call it β„Ž1 . In our case, β„Ž1 .0068.

Testing Multimodality of a PopulationIt seems reasonable to use 𝑓(𝑑; β„Ž1 ) as the estimated nulldistribution for our test of 𝐻0 . It is the density estimate thatuses least amount of smoothing among all estimated withone mode (conservative).A small adjustment to 𝑓 is needed because the formula artificiallyincreases the variance of the estimate with β„Ž12 . Let 𝑔 ; β„Ž1 be the rescaleestimate, that imposes variance equal to the sample variance.A natural choice for a test statistic is β„Ž1 - a large value of β„Ž1 isevidence against 𝐻0 .Putting all of this together, the achieved significance level isπ΄π‘†πΏπ‘π‘œπ‘œπ‘‘ π‘ƒπ‘Ÿπ‘œπ‘π‘” ;β„Ž1 β„Ž1 β„Ž1Where each bootstrap sample 𝒙 is drawn from 𝑔 ; β„Ž1

Testing Multimodality of a PopulationThe sampling from 𝑔 ; β„Ž1 is given by:1 β„Ž12 𝜎 2 2π‘₯𝑖 π‘₯ 1 𝑦𝑖 π‘₯ β„Ž1 πœ–π‘– ; 𝑖 1, , 𝑛Where 𝑦1 , , 𝑦𝑛 are sampled with replacement from π‘₯1 , , π‘₯𝑛 , andπœ–π‘– are standard normal random variables. (called smoothedbootstrap)In the stamps data, out of 500 bootstrap samples, none hadβ„Ž1 .0068, so π΄π‘†πΏπ‘π‘œπ‘œπ‘‘ 0.The results can be interpreted in sequential manner, moving on tohigher values of the least amount of modes. (Silverman 1981)When testing the same for 𝐻0 : π‘›π‘’π‘šπ‘π‘’π‘Ÿ π‘œπ‘“ π‘šπ‘œπ‘‘π‘’π‘  2, 146samples out of 500 had β„Ž2 .0033, which translates to π΄π‘†πΏπ‘π‘œπ‘œπ‘‘ 0.292.In our case, the inference process will end here.

SummaryA bootstrap hypothesis test is carried out using the followings:a) A test statistic 𝑑(𝒙)b) An approximate null distribution 𝐹0 for the data under 𝐻0Given these, we generate 𝐡 bootstrap values of 𝑑(𝒙 ) under 𝐹0 andestimate the achieved significance level byπ΄π‘†πΏπ‘π‘œπ‘œπ‘‘ # 𝑑 𝒙 𝒃 𝑑(𝒙) /𝐡The choice of test statistic 𝑑 𝒙 and the estimate of the null distributionwill determine the power of the test. In the stamp example, if the actualpopulation density is bimodal, but the Gaussian kernel density does notapproximate it accurately, then the suggested test will not have highpower.Bootstrap tests are useful when the alternative hypothesis is not wellspecified. In cases where there is parametric alternative hypothesis,likelihood or Bayesian methods might be preferable.

Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods . Bootstrap Hypothesis Testing A bootstrap hypothesis test starts with a test statistic - P( ) (not necessary an estimat

Related Documents:

May 02, 2018 · D. Program Evaluation ͟The organization has provided a description of the framework for how each program will be evaluated. The framework should include all the elements below: ͟The evaluation methods are cost-effective for the organization ͟Quantitative and qualitative data is being collected (at Basics tier, data collection must have begun)

Silat is a combative art of self-defense and survival rooted from Matay archipelago. It was traced at thΓ© early of Langkasuka Kingdom (2nd century CE) till thΓ© reign of Melaka (Malaysia) Sultanate era (13th century). Silat has now evolved to become part of social culture and tradition with thΓ© appearance of a fine physical and spiritual .

On an exceptional basis, Member States may request UNESCO to provide thé candidates with access to thé platform so they can complète thé form by themselves. Thèse requests must be addressed to esd rize unesco. or by 15 A ril 2021 UNESCO will provide thé nomineewith accessto thé platform via their émail address.

ΜΆThe leading indicator of employee engagement is based on the quality of the relationship between employee and supervisor Empower your managers! ΜΆHelp them understand the impact on the organization ΜΆShare important changes, plan options, tasks, and deadlines ΜΆProvide key messages and talking points ΜΆPrepare them to answer employee questions

Dr. Sunita Bharatwal** Dr. Pawan Garga*** Abstract Customer satisfaction is derived from thè functionalities and values, a product or Service can provide. The current study aims to segregate thè dimensions of ordine Service quality and gather insights on its impact on web shopping. The trends of purchases have

ChΓ­nh VΔƒn.- CΓ²n Δ‘α»©c ThαΊΏ tΓ΄n thΓ¬ tuệ giΓ‘c cα»±c kα»³ trong sαΊ‘ch 8: hiện hΓ nh bαΊ₯t nhα»‹ 9, Δ‘αΊ‘t Δ‘αΊΏn vΓ΄ tΖ°α»›ng 10, Δ‘α»©ng vΓ o chα»— Δ‘α»©ng của cΓ‘c Δ‘α»©c ThαΊΏ tΓ΄n 11, thể hiện tΓ­nh bΓ¬nh Δ‘αΊ³ng của cΓ‘c NgΓ i, Δ‘αΊΏn chα»— khΓ΄ng cΓ²n chΖ°α»›ng ngαΊ‘i 12, giΓ‘o phΓ‘p khΓ΄ng thể khuynh Δ‘αΊ£o, tΓ’m thα»©c khΓ΄ng bα»‹ cαΊ£n trở, cΓ‘i được

know how to create bootstrap weights in Stata and R know how to choose parameters of the bootstrap. Survey bootstrap Stas Kolenikov Bootstrap for i.i.d. data Variance estimation for complex surveys Survey bootstraps Software im-plementation Conclusions References Outline

Anatomy of a journal 1. Introduction This short activity will walk you through the different elements which form a Journal. Learning outcomes By the end of the activity you will be able to: Understand what an academic journal is Identify a journal article inside a journal Understand what a peer reviewed journal is 2. What is a journal? Firstly, let's look at a description of a .