Hypothesis Testing With The Bootstrap

2y ago

38 Views

2 Downloads

492.17 KB

17 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Mia Martinelli

Report this link

Download PDF

Transcription

Hypothesis Testing with theBootstrapNoa HaasStatistics M.Sc. Seminar, Spring 2017Bootstrap and Resampling Methods

Bootstrap Hypothesis TestingA bootstrap hypothesis test starts with a teststatistic - 𝑡(𝒙) (not necessary an estimate of aparameter).We seek an achieved significance level𝐴𝑆𝐿 𝑃𝑟𝑜𝑏𝐻0 𝑡 𝒙 𝑡(𝒙)Where the random variable 𝒙 has a distributionspecified by the null hypothesis 𝐻0 - denote as 𝐹0 .Bootstrap hypothesis testing uses a “plug-in” styleto estimate 𝐹0 .

The Two-Sample ProblemWe observe two independent random samples:𝐹 𝐺 𝒛 𝑧1 , 𝑧2 , , 𝑧𝑛 �� 𝑜𝑓𝒚 𝑦1 , 𝑦2 , , 𝑦𝑚And we wish to test the null hypothesis of no differencebetween F and G,𝐻0 : 𝐹 𝐺

Bootstrap Hypothesis Testing 𝐹 𝐺 Denote the combined sample by 𝒙, and its empiricaldistribution by 𝐹0 . Under 𝐻0 , 𝐹0 provides a non parametric estimate forthe common population that gave rise to both 𝒛 and 𝒚.1. Draw 𝑩 samples of size 𝑛 𝑚 with replacementfrom 𝒙. Call the first n observations 𝒛 and theremaining 𝑚 – 𝒚 2. Evaluate 𝑡( ) on each sample - 𝑡(𝒙 𝒃 )3. Approximate 𝐴𝑆𝐿𝑏𝑜𝑜𝑡 by𝐴𝑆𝐿𝑏𝑜𝑜𝑡 # 𝑡 𝒙 𝒃 𝑡 𝒙 /𝐵* In the case that large values of 𝑡(𝒙 𝒃 ) are evidence against 𝐻0

Bootstrap Hypothesis Testing 𝐹 𝐺on the Mouse DataA histogram of bootstrapreplications of𝑡 𝒙 𝑧 𝑦for testing 𝐻0 : 𝐹 𝐺 on themouse data. The proportion ofvalues greater than 30.63 is .121.Calculating𝑧 𝑦𝑡 𝒙 𝜎 1 𝑛 1 𝑚(approximate pivotal) for thesame replications produced𝐴𝑆𝐿𝑏𝑜𝑜𝑡 .128

Testing Equality of Means Instead of testing 𝐻0 : 𝐹 𝐺, we wish to test H0 : 𝜇𝑧 𝜇𝑦 ,without assuming equal variances. We need estimates of 𝐹and 𝐺 that use only the assumption of common mean1. Define points 𝑧𝑖 𝑧𝑖 𝑧 𝑥, 𝑖 1, , 𝑛, and 𝑦𝑖 𝑦𝑖 𝑦 𝑥 , 𝑖 1, , 𝑚. The empirical distributions of 𝒛 and 𝒚shares a common mean.2. Draw 𝑩 bootstrap samples with replacement 𝒛 , 𝒚 from𝑧1 , 𝑧2 , , 𝑧𝑛 and 𝑦1 , 𝑦2 , , 𝑦𝑚 respectivly3. Evaluate 𝑡( ) on each sample 𝑦 𝑧𝑡 𝒙 𝒃 𝜎𝑧 1 𝑛 𝜎𝑦 1 𝑚4. Approximate 𝐴𝑆𝐿𝑏𝑜𝑜𝑡 by𝐴𝑆𝐿𝑏𝑜𝑜𝑡 # 𝑡 𝒙 𝒃 𝑡 𝒙 /𝐵

Permutation Test VS BootstrapHypothesis Testing Accuracy: In the two-sample problem, 𝐴𝑆𝐿𝑝𝑒𝑟𝑚 isthe exact probability of obtaining a test statisticas extreme as the one observed. In contrast, thebootstrap explicitly samples from estimatedprobability mechanism. 𝐴𝑆𝐿𝑏𝑜𝑜𝑡 has nointerpretation as an exact probability. Flexibility: When special symmetry isn’t required,the bootstrap testing can be applied much moregenerally than the permutation test. (Like in thetwo sample problem – permutation test is limitedto 𝐻0 : 𝐹 𝐺, or in the one-sample problem)

The One-Sample ProblemWe observe a random sample:𝐹 𝒛 𝑧1 , 𝑧2 , , 𝑧𝑛And we wish to test whether the mean of thepopulation equals to some predetermine value 𝜇0 𝐻0 : 𝜇𝑧 𝜇0

Bootstrap Hypothesis Testing 𝜇𝑧 𝜇0What is the appropriate way to estimate the nulldistribution?The empirical distribution 𝐹 is not anappropriate estimation, because it does notobey 𝐻0 .As before, we can use the empirical distributionof the points:𝑧𝑖 𝑧𝑖 𝑧 𝜇0 , 𝑖 1, , 𝑛Which has a mean of 𝜇0 .

Bootstrap Hypothesis Testing 𝜇𝑧 𝜇0The test will be based on the approximate𝑧 𝜇0distribution of the test statistic 𝑡 𝒛 𝜎/ 𝑛We sample 𝑩 times 𝑧1 , , 𝑧𝑛 with replacementfrom 𝑧1 , , 𝑧𝑛 , and for each sample compute𝑧 𝜇0 𝑡 𝒛 𝜎/ 𝑛And the estimated ASL is given by𝐴𝑆𝐿𝑏𝑜𝑜𝑡 # 𝑡 𝒛 𝒃 𝑡 𝒛 /𝐵* In the case that large values of 𝑡 𝒛 𝒃 are evidence against 𝐻0

Testing 𝜇𝑧 𝜇0 on the Mouse DataTaking 𝜇0 129, the observed value of the test statistic is86.9 129𝑡 𝒛 1.6766.8/ 7(When estimating 𝜎 with the unbiased estimator for standarddeviation). For 94 of 1000 bootstrap samples, 𝑡 𝒛 wassmaller than -1.67, and therefor𝐴𝑆𝐿𝑏𝑜𝑜𝑡 .094For reference, the student’s t-test result for the same nullhypothesis on that data gives us42.1𝐴𝑆𝐿 𝑃𝑟𝑜𝑏 𝑡6 0.0766.8/ 7

Testing Multimodality of a PopulationA mode is defined to be a local maximum or “bump” of thepopulation densityThe data: 𝑥1 , , 𝑥485 Mexican stamps’ thickness from 1872.The number of modes is suggestive of the number of distincttype of paper used in the printing.

Testing Multimodality of a PopulationSince the histogram is not smooth, it is difficult to tell from it whether there aremore than one mode.A Gaussian kernel density with window size ℎ estimate can be used in order toobtain a smoother estimate:1𝑓 𝑡; ℎ 𝑛ℎ𝑛𝑖 1𝑡 𝑥𝑖𝜙ℎAs 𝒉 increases, the number of modes in the density estimate is non-increasing

Testing Multimodality of a PopulationThe null hypothesis:𝐻0 : 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑚𝑜𝑑𝑒𝑠 1Versus 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑚𝑜𝑑𝑒𝑠 1. Since the number of modes decreases asℎ increases, there is a smallest value of ℎ such that 𝑓 𝑡; ℎ has one mode.Call it ℎ1 . In our case, ℎ1 .0068.

Testing Multimodality of a PopulationIt seems reasonable to use 𝑓(𝑡; ℎ1 ) as the estimated nulldistribution for our test of 𝐻0 . It is the density estimate thatuses least amount of smoothing among all estimated withone mode (conservative).A small adjustment to 𝑓 is needed because the formula artificiallyincreases the variance of the estimate with ℎ12 . Let 𝑔 ; ℎ1 be the rescaleestimate, that imposes variance equal to the sample variance.A natural choice for a test statistic is ℎ1 - a large value of ℎ1 isevidence against 𝐻0 .Putting all of this together, the achieved significance level is𝐴𝑆𝐿𝑏𝑜𝑜𝑡 𝑃𝑟𝑜𝑏𝑔 ;ℎ1 ℎ1 ℎ1Where each bootstrap sample 𝒙 is drawn from 𝑔 ; ℎ1

Testing Multimodality of a PopulationThe sampling from 𝑔 ; ℎ1 is given by:1 ℎ12 𝜎 2 2𝑥𝑖 𝑥 1 𝑦𝑖 𝑥 ℎ1 𝜖𝑖 ; 𝑖 1, , 𝑛Where 𝑦1 , , 𝑦𝑛 are sampled with replacement from 𝑥1 , , 𝑥𝑛 , and𝜖𝑖 are standard normal random variables. (called smoothedbootstrap)In the stamps data, out of 500 bootstrap samples, none hadℎ1 .0068, so 𝐴𝑆𝐿𝑏𝑜𝑜𝑡 0.The results can be interpreted in sequential manner, moving on tohigher values of the least amount of modes. (Silverman 1981)When testing the same for 𝐻0 : 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑚𝑜𝑑𝑒𝑠 2, 146samples out of 500 had ℎ2 .0033, which translates to 𝐴𝑆𝐿𝑏𝑜𝑜𝑡 0.292.In our case, the inference process will end here.

SummaryA bootstrap hypothesis test is carried out using the followings:a) A test statistic 𝑡(𝒙)b) An approximate null distribution 𝐹0 for the data under 𝐻0Given these, we generate 𝐵 bootstrap values of 𝑡(𝒙 ) under 𝐹0 andestimate the achieved significance level by𝐴𝑆𝐿𝑏𝑜𝑜𝑡 # 𝑡 𝒙 𝒃 𝑡(𝒙) /𝐵The choice of test statistic 𝑡 𝒙 and the estimate of the null distributionwill determine the power of the test. In the stamp example, if the actualpopulation density is bimodal, but the Gaussian kernel density does notapproximate it accurately, then the suggested test will not have highpower.Bootstrap tests are useful when the alternative hypothesis is not wellspecified. In cases where there is parametric alternative hypothesis,likelihood or Bayesian methods might be preferable.

Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods . Bootstrap Hypothesis Testing A bootstrap hypothesis test starts with a test statistic - P( ) (not necessary an estimat

Related Documents:

Nonprofit Self-Assessment Checklist

May 02, 2018 · D. Program Evaluation ͟The organization has provided a description of the framework for how each program will be evaluated. The framework should include all the elements below: ͟The evaluation methods are cost-effective for the organization ͟Quantitative and qualitative data is being collected (at Basics tier, data collection must have begun)

1.4K Views

2y ago

Name of thé élément in thé language and script of thé ... - UNESCO

Silat is a combative art of self-defense and survival rooted from Matay archipelago. It was traced at thé early of Langkasuka Kingdom (2nd century CE) till thé reign of Melaka (Malaysia) Sultanate era (13th century). Silat has now evolved to become part of social culture and tradition with thé appearance of a fine physical and spiritual .

113 Views

9m ago

[Kl - Mauritius

On an exceptional basis, Member States may request UNESCO to provide thé candidates with access to thé platform so they can complète thé form by themselves. Thèse requests must be addressed to esd rize unesco. or by 15 A ril 2021 UNESCO will provide thé nomineewith accessto thé platform via their émail address.

466 Views

1y ago

Employee Benefits Event - Schneider Downs Tax Services

̶The leading indicator of employee engagement is based on the quality of the relationship between employee and supervisor Empower your managers! ̶Help them understand the impact on the organization ̶Share important changes, plan options, tasks, and deadlines ̶Provide key messages and talking points ̶Prepare them to answer employee questions

326 Views

1y ago

Study Investigating thè Effect of E- Service Quality on Customer's ...

Dr. Sunita Bharatwal** Dr. Pawan Garga*** Abstract Customer satisfaction is derived from thè functionalities and values, a product or Service can provide. The current study aims to segregate thè dimensions of ordine Service quality and gather insights on its impact on web shopping. The trends of purchases have

122 Views

9m ago

Kinh Giải Thâm Mật HT. Thích Trí Quang dịch giải

Chính Văn.- Còn đức Thế tôn thì tuệ giác cực kỳ trong sạch 8: hiện hành bất nhị 9, đạt đến vô tướng 10, đứng vào chỗ đứng của các đức Thế tôn 11, thể hiện tính bình đẳng của các Ngài, đến chỗ không còn chướng ngại 12, giáo pháp không thể khuynh đảo, tâm thức không bị cản trở, cái được

1.6K Views

3y ago

Bootstrap for complex survey data

know how to create bootstrap weights in Stata and R know how to choose parameters of the bootstrap. Survey bootstrap Stas Kolenikov Bootstrap for i.i.d. data Variance estimation for complex surveys Survey bootstraps Software im-plementation Conclusions References Outline

23 Views

2y ago

Anatomy of a journal - Open University

Anatomy of a journal 1. Introduction This short activity will walk you through the different elements which form a Journal. Learning outcomes By the end of the activity you will be able to: Understand what an academic journal is Identify a journal article inside a journal Understand what a peer reviewed journal is 2. What is a journal? Firstly, let's look at a description of a .

99 Views

3y ago

Recent Views

MOOSIC PRE ORDER OFFER 2018

9781860960147 Jazz Piano Grade 5: The CD 22.92 17.24 18.76 19.83 9781860960154 Jazz Piano from Scratch 55.00 41.36 45.02 47.58 9781860960161 Jazz Piano Aural Tests, Grades 1-3 18.15 13.65 14.86 15.70 9781860960505 Jazz Piano Aural Tests, Grades 4-5 15.29 11.50 12.52 13.23 Easier Piano Pieces (ABRSM)

3y ago

95 Views

Bethel A.M.E. Annual Women's Day Celebration

Annual Women's Day Celebration Theme: Steadfast and Faithful Women 1993 Bethel African Methodi st Epi scopal Church Champaign, Illinois The Ministry Thi.! Rev. Sleven A. Jackson, Pastor The Rev. O.G. Monroe. Assoc, Minister The Rl. Rev. James Haskell Mayo l1 ishop, f7011rt h Episcop;l) District The Rev. Lewis E. Grady. Jr. Prc. i ding Elder . Cover design taken from: Book of Black Heroes .

3y ago

97 Views

Automotive - Siemens Digital Industries Software

of this system requires a new level of close integration between mechanical, electrical and thermal domains. It becomes necessary to have true multi-domain data exchange between engineering software tools to inform the system design from an early concept stage. At the most progressive automotive OEMs, thermal, electrical

3y ago

51 Views

PRESENTER BIOGRAPHIES

PRESENTER BIOGRAPHIES. MDPH Commissioner Remarks: Cheryl Bartlett, RN Commissioner . MA Department of Public Health . Cheryl Bartlett was named Commissioner of the Massachusetts Department of Public Health in June 2013. As Commissioner Ms. Bartlett chairs the newly appointed Prevention and Wellness Advisory Board, which oversees a 60 million Prevention Trust Fund – the first of its kind in .

3y ago

116 Views

2019 SPLUNK INC. Splunk Certification Certification Exam .

Sample Questions Test Blueprint Splunk Core Certified Consultant Test Blueprint Splunk Certification Exams Table of Contents Please note: Sample questions (where available) are provided to give candidates a general idea of the formatting and type of questions for each of the exams listed above. The test blueprints provide much

3y ago

73 Views

Programme Specification BSc Chemistry (2020-21 )

The BSc Chemistry degree aims to enhance your enthusiasm for chemistry and to provide an intellectually stimulating learning environment. You will gain extensive in-depth knowledge and understanding of chemistry and related subjects, as well as a comprehensive training in practical chemistry and an appreciation of the importance of the discipline in different contexts. The programme will .

3y ago

51 Views

Chimney - Robot Virtual Games

Chimney Junior Each Total Correct balls on the Chimney Each ball will give you points if it is equal to the color indicated by the cube. 40 80 Incorrect balls on the Chimney Each ball will take you points if it is not equal to the color indicated by the cube. -5 -10 Park the robot Robot stops on Finish Area and simulation stops.

3y ago

42 Views

Timeline of the Cold War - truman.library

Timeline of the Cold War 1945 Defeat of Germany and Japan February 4-11: Yalta Conference meeting of FDR, Churchill, Stalin - the 'Big Three' Soviet Union has control of Eastern Europe. The Cold War Begins May 8: VE Day - Victory in Europe. Germany surrenders to the Red Army in Berlin July: Potsdam Conference - Germany was officially partitioned into four zones of occupation. August 6: The .

3y ago

253 Views

skinnytaste Cookbook Index

Naked Persian Turkey Burgers The Skinnytaste Cookbook Perfect Poultry 156 6 6 6 Orecchiette with Sausage, Baby Kale, and Bell Pepper The Skinnytaste Cookbook Perfect Poultry 181 11 11 4. RECIPE COOKBOOK CHAPTER PG SP Roasted Poblanos Rellenos with Chicken The Skinnytaste Cookbook Perfect Poultry 173 7 10 5

3y ago

71 Views

3-in-1 Cooking System - NinjaKitchen

5 ˆˇ 6 Getting to Know the Ninja 3-in-1 Cooking System Control Panel Function Dial Turn the dial to select Stovetop, Slow Cook or Oven mode. Stovetop - Use the Ninja 3-in-1 Cooking System as you would a stovetop.

3y ago

39 Views

BIOLOGY - Michigan

Credit for high school Earth Science, Biology, Physics, and Chemistry will be defined as meeting both essential and core subject area content expectations. Assessment Prerequisite Knowledge and Skills Basic Science Knowledge Orientation Towards Learning Reading, Writing, Communication Basic Mathematics Conventions, Probability, Statistics .

3y ago

27 Views

Investigatingrespiration*in*ectotherms(crickets)*

ets)*

Males"of" ud" chirpingsoundbyrubbingtheir forewingstogether;theydothisto p .

3y ago

34 Views

The Criminal Justice Response to Child Abuse: Lessons .

Rates of Criminal Justice Action on Investigated Cases Study Sample N Rate Tjaden & Thoennes, 1992 CPS 833 4% prosecuted Finkelhor, 1983 State clearing - house data 6096 24% criminal justice action taken Stroud, Martens & Barker, 2000 &KLOGUHQ¶V Advocacy Center 1043 56% referred to p rosecutors

3y ago

45 Views

Curriculum Adaptations for Exceptional Students

Adapting curriculum and instruction . The Center for School and Community Integration, Institute for the Study of Developmental Disabilities. Why do we want to use curriculum adaptations? Looking at learning in new and different ways. Get creative! EM 1.1.8 – Student understands concepts of

3y ago

29 Views

Brunei Darussalam In Brief - information.gov.bn

‘Brunei Darussalam In Brief’ is a publication where it discusses briefly on the socio-economic welfare of Brunei Darussalam in general. No part of this publication may be reproduced, stored in a retrieval system or transmitted in any form by any means without prior written permission from

3y ago

65 Views

Hypothesis Testing With The Bootstrap

It looks like you're using an ad-blocker