Reading 20: Comparison Of Frequentist And Bayesian Inference

2y ago

20 Views

2 Downloads

204.69 KB

7 Pages

Last View : 12d ago

Last Download : 3m ago

Upload by : Grant Gall

Report this link

Download PDF

Transcription

Comparison of frequentist and Bayesian inference.Class 20, 18.05Jeremy Orloﬀ and Jonathan Bloom1Learning Goals1. Be able to explain the diﬀerence between the p-value and a posterior probability to adoctor.2IntroductionWe have now learned about two schools of statistical inference: Bayesian and frequentist.Both approaches allow one to evaluate evidence about competing hypotheses. In these noteswe will review and compare the two approaches, starting from Bayes’ formula.3Bayes’ formula as touchstoneIn our ﬁrst unit (probability) we learned Bayes’ formula, a perfectly abstract statementabout conditional probabilities of events:P (A B) P (B A)P (A).P (B)We began our second unit (Bayesian inference) by reinterpreting the events in Bayes’ for mula:P (D H)P (H)P (H D) .P (D)Now H is a hypothesis and D is data which may give evidence for or against H. Each termin Bayes’ formula has a name and a role. The prior P (H) is the probability that H is true before the data is considered. The posterior P (H D) is the probability that H is true after the data is considered. The likelihood P (D H) is the evidence about H provided by the data D. P (D) is the total probability of the data taking into account all possible hypotheses.If the prior and likelihood are known for all hypotheses, then Bayes’ formula computes theposterior exactly. Such was the case when we rolled a die randomly selected from a cupwhose contents you knew. We call this the deductive logic of probability theory, and it givesa direct way to compare hypotheses, draw conclusions, and make decisions.In most experiments, the prior probabilities on hypotheses are not known. In this case, ourrecourse is the art of statistical inference: we either make up a prior (Bayesian) or do ourbest using only the likelihood (frequentist).1

18.05 class 20, Comparison of frequentist and Bayesian inference., Spring 20142The Bayesian school models uncertainty by a probability distribution over hypotheses.One’s ability to make inferences depends on one’s degree of conﬁdence in the chosen prior,and the robustness of the ﬁndings to alternate prior distributions may be relevant andimportant.The frequentist school only uses conditional distributions of data given speciﬁc hypotheses.The presumption is that some hypothesis (parameter specifying the conditional distributionof the data) is true and that the observed data is sampled from that distribution. Inparticular, the frequentist approach does not depend on a subjective prior that may varyfrom one investigator to another.These two schools may be further contrasted as follows:Bayesian inference uses probabilities for both hypotheses and data. depends on the prior and likelihood of observed data. requires one to know or construct a ‘subjective prior’. dominated statistical practice before the 20th century. may be computationally intensive due to integration over many parameters.Frequentist inference (NHST) never uses or gives the probability of a hypothesis (no prior or posterior). depends on the likelihood P (D H)) for both observed and unobserved data. does not require a prior. dominated statistical practice during the 20th century. tends to be less computationally intensive.Frequentist measures like p-values and conﬁdence intervals continue to dominate research,especially in the life sciences. However, in the current era of powerful computers andbig data, Bayesian methods have undergone an enormous renaissance in ﬁelds like ma chine learning and genetics. There are now a number of large, ongoing clinical trials usingBayesian protocols, something that would have been hard to imagine a generation ago.While professional divisions remain, the consensus forming among top statisticians is thatthe most eﬀective approaches to complex problems often draw on the best insights fromboth schools working in concert.44.1Critiques and defensesCritique of Bayesian inference1. The main critique of Bayesian inference is that a subjective prior is, well, subjective.There is no single method for choosing a prior, so diﬀerent people will produce diﬀerentpriors and may therefore arrive at diﬀerent posteriors and conclusions.

18.05 class 20, Comparison of frequentist and Bayesian inference., Spring 201432. Furthermore, there are philosophical objections to assigning probabilities to hypotheses,as hypotheses do not constitute outcomes of repeatable experiments in which one can mea sure long-term frequency. Rather, a hypothesis is either true or false, regardless of whetherone knows which is the case. A coin is either fair or unfair; treatment 1 is either better orworse than treatment 2; the sun will or will not come up tomorrow.4.2Defense of Bayesian inference1. The probability of hypotheses is exactly what we need to make decisions. When thedoctor tells me a screening test came back positive I want to know what is the probabilitythis means I’m sick. That is, I want to know the probability of the hypothesis “I’m sick”.2. Using Bayes’ theorem is logically rigorous. Once we have a prior all our calculationshave the certainty of deductive logic.3. By trying diﬀerent priors we can see how sensitive our results are to the choice of prior.4. It is easy to communicate a result framed in terms of probabilities of hypotheses.5. Even though the prior may be subjective, one can specify the assumptions used to arriveat it, which allows other people to challenge it or try other priors.6. The evidence derived from the data is independent of notions about ‘data more extreme’that depend on the exact experimental setup (see the “Stopping rules” section below).7. Data can be used as it comes in. There is no requirement that every contingency beplanned for ahead of time.4.3Critique of frequentist inference1. It is ad-hoc and does not carry the force of deductive logic. Notions like ‘data moreextreme’ are not well deﬁned. The p-value depends on the exact experimental setup (seethe “Stopping rules” section below).2. Experiments must be fully speciﬁed ahead of time. This can lead to paradoxical seemingresults. See the ‘voltmeter story’ in:http://en.wikipedia.org/wiki/Likelihood principle3. The p-value and signiﬁcance level are notoriously prone to misinterpretation. Carefulstatisticians know that a signiﬁcance level of 0.05 means the probability of a type I erroris 5%. That is, if the null hypothesis is true then 5% of the time it will be rejected due torandomness. Many (most) other people erroneously think a p-value of 0.05 means that theprobability of the null hypothesis is 5%.Strictly speaking you could argue that this is not a critique of frequentist inference but,rather, a critique of popular ignorance. Still, the subtlety of the ideas certainly contributesto the problem. (see “Mind your p’s” below).4.4Defense of frequentist inference1. It is objective: all statisticians will agree on the p-value. Any individual can then decideif the p-value warrants rejecting the null hypothesis.

18.05 class 20, Comparison of frequentist and Bayesian inference., Spring 201442. Hypothesis testing using frequentist signiﬁcance testing is applied in the statistical anal ysis of scientiﬁc investigations, evaluating the strength of evidence against a null hypothesiswith data. The interpretation of the results is left to the user of the tests. Diﬀerent usersmay apply diﬀerent signiﬁcance levels for determining statistical signiﬁcance. Frequentiststatistics does not pretend to provide a way to choose the signiﬁcance level; rather it ex plicitly describes the trade-oﬀ between type I and type II errors.3. Frequentist experimental design demands a careful description of the experiment andmethods of analysis before starting. This helps control for experimenter bias.4. The frequentist approach has been used for over 100 years and we have seen tremendousscientiﬁc progress. Although the frequentist herself would not put a probability on the beliefthat frequentist methods are valuable shoudn’t this history give the Bayesian a strong priorbelief in the utility of frequentist methods?5Mind your p’s.We run a two-sample t-test for equal means, with α 0.05, and obtain a p-value of 0.04.What are the odds that the two samples are drawn from distributions with the same mean?(a) 19/1(b) 1/19(c) 1/20(d) 1/24(e) unknownanswer: (e) unknown. Frequentist methods only give probabilities of statistics conditionedon hypotheses. They do not give probabilities of hypotheses.6Stopping rulesWhen running a series of trials we need a rule on when to stop. Two common rules are:1. Run exactly n trials and stop.2. Run trials until you see a certain result and then stop.In this example we’ll consider two coin tossing experiments.Experiment 1: Toss the coin exactly 6 times and report the number of heads.Experiment 2: Toss the coin until the ﬁrst tails and report the number of heads.Jon is worried that his coin is biased towards heads, so before using it in class he tests itfor fairness. He runs an experiment and reports to Jerry that his sequence of tosses wasHHHHHT . But Jerry is only half-listening, and he forgets which experiment Jon ran toproduce the data.Frequentist approach.Since he’s forgotten which experiment Jon ran, Jerry the frequentist decides to computethe p-values for both experiments given Jon’s data.Let θ be the probability of heads. We have the null and one-sided alternative hypothesesH0 : θ 0.5,HA : θ 0.5.Experiment 1: The null distribution is binomial(6, 0.5) so, the one sided p-value is theprobability of 5 or 6 heads in 6 tosses. Using R we getp 1 - pbinom(4, 6, 0.5) 0.1094.

18.05 class 20, Comparison of frequentist and Bayesian inference., Spring 20145Experiment 2: The null distribution is geometric(0.5) so, the one sided p-value is the prob ability of 5 or more heads before the ﬁrst tails. Using R we getp 1 - pgeom(4, 0.5) 0.0313.Using the typical signiﬁcance level of 0.05, the same data leads to opposite conclusions! Wewould reject H0 in experiment 2, but not in experiment 1.The frequentist is ﬁne with this. The set of possible outcomes is diﬀerent for the diﬀerentexperiments so the notion of extreme data, and therefore p-value, is diﬀerent. For example,in experiment 1 we would consider T HHHHH to be as extreme as HHHHHT . In ex periment 2 we would never see T HHHHH since the experiment would end after the ﬁrsttails.Bayesian approach.Jerry the Bayesian knows it doesn’t matter which of the two experiments Jon ran, sincethe binomial and geometric likelihood functions (columns) for the data HHHHHT areproportional. In either case, he must make up a prior, and he chooses Beta(3,3). This is arelatively ﬂat prior concentrated over the interval 0.25 θ 0.75.See or Beta(3,3)Posterior Beta(8,4)0.01.02.03.0Since the beta and binomial (or geometric) distributions form a conjugate pair the Bayesianupdate is simple. Data of 5 heads and 1 tails gives a posterior distribution Beta(8,4). Hereis a graph of the prior and the posterior. The blue lines at the bottom are 50% and 90%probability intervals for the posterior.0.25.50θ.751.0Prior and posterior distributions with 0.5 and 0.9 probability intervalsHere are the relevant computations in R:Posterior 50% probability interval: qbeta(c(0.25, 0.75), 8, 4) [0.58 0.76]Posterior 90% probability interval: qbeta(c(0.05, 0.95), 8, 4) [0.44 0.86]P (θ 0.50 data) 1- pbeta(0.5, posterior.a, posterior.b) 0.89Starting from the prior Beta(3,3), the posterior probability that the coin is biased towardheads is 0.89.

18.05 class 20, Comparison of frequentist and Bayesian inference., Spring 201476Making decisionsQuite often the goal of statistical inference is to help with making a decision, e.g. whetheror not to undergo surgery, how much to invest in a stock, whether or not to go to graduateschool, etc.In statistical decision theory, consequences of taking actions are measured by a utilityfunction. The utility function assigns a weight to each possible outcome; in the language ofprobability, it is simply a random variable.For example, in my investments I could assign a utility of d to the outcome of a gain ofd dollars per share of a stock (if d 0 my utility is negative). On the other hand, if mytolerance for risk is low, I will assign a more negative utility to losses than to gains (say, d2 if d 0 and d if d 0).A decision rule combines the expected utility with evidence for each hypothesis given bythe data (e.g., p-values or posterior distributions) into a formal statistical framework formaking decisions.In this setting, the frequentist will consider the expected utility given a hypothesisE(U H)where U is the random variable representing utility. There are frequentist methods forcombining the expected utility with p-values of hypotheses to guide decisions.The Bayesian can combine E(U H) with the posterior (or prior if it’s before data is col lected) to create a Bayesian decision rule.In either framework, two people considering the same investment may have diﬀerent utilityfunctions and make diﬀerent decisions. For example, a riskier stock (with higher potentialupside and downside) will be more appealing with respect to the ﬁrst utility function abovethan with respect to the second (loss-averse) one.A signiﬁcant theoretical result is that for any decision rule there is a Bayesian decision rulewhich is, in a precise sense, at least as good a rule.

MIT OpenCourseWarehttps://ocw.mit.edu18.05 Introduction to Probability and StatisticsSpring 2014For information about citing these materials or our Terms of Use, visit: https://ocw.mit.edu/terms.

Comparison of frequentist and Bayesian inference. Class 20, 18.05 Jeremy Orloﬀ and Jonathan Bloom. 1 Learning Goals. 1. Be able to explain the diﬀerence between the p-value and a posterior probability to a doctor. 2 Introduction. We have now learned about two schools of statistical inference: Bayesian and frequentist.

Related Documents:

Alphabet All About the - Newark, NJ

All About the Alphabet Reading Alphabet Fun: A Reading Alphabet Fun: B Reading Alphabet Fun: C Reading Alphabet Fun: D Reading Alphabet Fun: E Reading Alphabet Fun: F Reading Alphabet Fun: G Reading Alphabet Fun: H Reading Alphabet Fun: I Reading Alphabet Fun: J Reading Alphabet Fun: K Reading Alphabet Fu

112 Views

1y ago

Bayesian data analysis in population ecology: motivations ...

Keywords Frequentist inference Hierarchical modeling Missing data Occupancy model Spatial analysis State-space modeling Introduction During the 20th century scientists in many ﬁelds of study (including ecology) largely relied on the frequentist system of inference for the analysis of their data. This approach was appealing for at least two .

9 Views

3y ago

Chapter 12 Bayesian Inference - CMU Statistics

methods and Bayesian methods. Most of the methods we have discussed so far are fre-quentist. It is important to understand both approaches. At the risk of oversimplifying, the difference is this: Frequentist versus Bayesian Methods In frequentist inference, probabilities are interpreted as long run frequencies.File Size: 1MB

37 Views

2y ago

A Non-Parametric Bayesian Approach to the Instrumental ...

Bayesian methods, we provide evidence that Bayesian interval estimators perform well compared to available frequentist estimators, under frequentist performance criteria. The Bayesian non-parametric approach attempts to uncover and exploit structure in the data. For example, if the e

30 Views

2y ago

Bayesian Analysis

1 Introduction The goal of statistics is to make informed, data supported decisions in the face of uncertainty. The basis of frequentist statistics is to gather data to test a hypothesis and/or construct con-ﬁdence intervals in order to draw conclusions. The frequentist approach is probably the most

7 Views

2y ago

Connections between statistical practice in elementary particle physics ...

of frequentist methods in HEP, I avoid getting very deep into philosophy, and barely mention Bayesian methods. 2 Background about our eld and the HEP prototype test In HEP, the frequentist sampling properties of essentially everything we do are studied using traditional Monte Carlo (MC) simulation methods (less often Markov Chain). We perform

6 Views

1y ago

The Interplay of Silent Reading, Reading-while-listening ...

Recent studies have suggested that reading-while-listening can assist in fostering reading skills. For example, Chang and Millet (2015) evidenced a superior rate of reading, and level of reading comprehension, for audio-assisted reading (reading-while-listening) over silent reading.

74 Views

3y ago

Am I My Brother's Keeper?

day I am going to buy a car just like that.'' He thei1 explained : ''You see, mister, Harm can't waJk. I go downtow11. and look at' all e nice Tiiii;-J(S in the store window, and come home and try tc, tell Harry what it is all about, but r tell it very good. Some day J am going to make

84 Views

3y ago

Recent Views

Stock Market Development and Economic Growth: Empirical Evidence from China

measures used to proxy for stock market size and the size of real economy. Most of the existing studies use stock market index as a proxy for measuring the growth and development of stock market in a country. We argue that stock market index may not be a good measure of stock market size when looking at its association with economic growth.

1y ago

263 Views

Lasso Technique Application In Stock Market Modelling: An Empirical .

This research tries to see the influence of G7 and ASEAN-4 stock market on Indonesian stock market by using LASSO model. Stock market estimation method had been conducted such as Stock Market Forecasting Using LASSO Linear Regression Model (Roy et al., 2015) and Mali et al., (2017) on Open Price Prediction of Stock Market Using Regression Analysis.

3m ago

18 Views

The Stock Market Profits Blueprint - Liberated Stock Trader

The stock market profits blueprint has been hand crafted to enable you to understand all the factors that play on the stock market. It is called a blueprint because a blueprint is in effect an architectural document to show how something is designed. The Blueprint will show you a powerful way to envisage how the stock market and the stock market

1y ago

181 Views

Factors Affecting Performance of Stock Market: Evidence from . - HRMARS

We used the data of Colombo Stock Exchange (CSE) for Sri Lankan stock market in this research which is the main stock exchange of Sri Lanka. The market capitalization of CSE is over 20 billion USD. Colombo stock exchange is the first south Asian region stock market and overall 52nd who obtain the membership of World Federation of Exchanges.

11m ago

103 Views

Stock Market Development in the Philippines: Past and Present

Philippine stock market. This paper may serve as a basis for further research on the stock market development in the country. This paper is organized as follows: Section 2 traces the origins of the stock market in the Philippines while section 3 outlines the reforms that have been implemented to strengthen the stock market.

1y ago

128 Views

Columbus,Ohio 1890

Slicing Steaks 3563 Beef Tender, Select In Stock 3852 Angus XT Shoulder Clod, Choice In Stock 3853 Angus XT Chuck Roll, Choice 20/up In Stock 3856 Angus XT Peeled Knuckle In Stock 3857 Angus XT Inside Rounds In Stock 3858 Angus XT Flats, Choice In Stock 3859 Angus XT Eye Of Round, Choice In Stock 3507 Point Off Bnls Beef Brisket, Choice In Stock

2y ago

268 Views

Buying Your First Stock - Stock-Trak

Stock Market Game Time: 15 Minutes Requires: StockTrak Curriculum , Computer Access Buying Your First Stock This lesson is an introduction to buying a stock. Students will be introduced to basic vocabulary that is involved with a buying and owning a stock. Stu-dents will be going through the entire process of buying a stock from looking

1y ago

164 Views

1.11.1. Where to Find Wall Street Training - Investing 101

investing and day trading, how to trade stock options, online free stock trading, market timing strategies, and mutual funds. But, first—learn what these terms mean. Play stock market games:Play stock market games: A stock simulation market game will train you to be comfortable with investing

2y ago

125 Views

Stock Price Prediction Using RNN and LSTM - JETIR

1. BASIC INTRODUCTION OF STOCK MARKET A stock market is a public market for trading of company stocks. Stock market prediction is the task to find the future price of a company stock. The price of a share depends on the number of people who want to buy or sell it. If there are more buyers, then prices will rise. If the seller has a number of .

1y ago

114 Views

Stock Market Wealth Effects - Harvard University

negative stock return and a subsequent decline in household spending and employment. We use a local labor market analysis to address this empirical challenge and provide quantitative evidence on the stock market consumption wealth e ect. Our empirical strategy combines regional heterogeneity in stock market wealth with aggregate movements in stock

1y ago

104 Views

Artificial Intelligence Approach for Stock Market - IJSER

The forecast of stock market helps investors to make investment decisions, via giving them strong insights about the behavior of stock market for avoiding investment risks. It was found that news has an influence on the stock price behavior [2]. The stock market is a constantly changing indicator of economic activity all over the world.

1y ago

109 Views

The Stock Market Game Student Activity Packet - Maryland Council on .

1. The Stock Market Game Kick Off! (3 mins) 2. Intro to Investing (4 mins) 3. Intro to Companies (3 mins) 4. Intro to Stocks (4 mins) 5. Building Your Portfolio (5 mins) 6. The Stock Market Game Trading Portfolio (6 mins) 7. The Stock Market Game Rules (6 mins) 8. Conducting Research (5 mins) 9. Entering Stock Trades (4 mins) 10. Assessing Risk .

1y ago

114 Views

Stock Market Uncertainty and the Stock-Bond Return Relation

implied volatility and stock turnover may prove useful for ﬁnancial applications that need to under-stand and predict stock and bond return co-movements. Finally, our empirical results suggest that the beneﬁts of stock-bond diversiﬁcation increase during periods of high stock market uncertainty. This study is organized as follow.

1y ago

158 Views

The Stock Market Crash of 1929, Great Depression, Dust .

The Stock Market Crash of 1929 In 1929, the Stock Market Crashed!! The stock of a business represents the original money paid into or invested in the business by its founders. So the stock represents how much mone

2y ago

358 Views

Web Based Stock Forecasters - Winlab

Stock market prediction is the act of trying to determine the future value of a company stock or other financial instrument traded on a financial exchange. The successful prediction of a stock's future price could yield significant profit. The stock market is not an efficient market.

1y ago

102 Views

Reading 20: Comparison Of Frequentist And Bayesian Inference

It looks like you're using an ad-blocker