Analysis With R. Introduction To Bayesian Data

3y ago

36 Views

6 Downloads

8.93 MB

102 Pages

Last View : 9d ago

Last Download : 3m ago

Upload by : Carlos Cepeda

Report this link

Download PDF

Transcription

?Introduction to Bayesian DataAnalysis with R.Rasmus Bååth, Lund University@rabaath rasmus.baath@gmail.comhttp://www.sumsar.net

Source: I borrowed these three examples from apresentation. But which presentation I can’t rememberor find. I you know what presentation it could havebeen, please let me know and I will credit it here.

What do these have in common? Complex problems Large inherent uncertainty that needs to bequantified. Requires efficient integration of manysources of information. They all use Bayesian data analysis.

Bayesian data analysis is a great tool! and R is a great tool for doing Bayesian dataanalysis.But if you google “Bayesian” you getphilosophy:Subjective vs ObjectiveFrequentism vs Bayesianismp-values vs subjective probabilities

?Bayesian data analysisWhat? Why? How?

Overview of this tutorial What is Bayesian data analysis?Prediction contest Why use Bayesian data analysis?Exercises How to interpret and perform a Bayesiandata analysis in R?More Exercises

Why am I here? I use Bayesian methods in myresearch at Lund University whereI also run a network for peopleinterested in Bayes. I’m working on an R-package tomake simple Bayesian analysessimple to run. I blog about Bayesian dataanalysis.www.sumsar.net

What is Bayesian dataanalysis? It is when you use probability to representuncertainty in all parts of a statisticalmodel. A flexible extension of maximum likelihood. Potentially the most information-efficientmethod to fit a statistical model.(But potentially also the mostcomputationally intensive method )

Bayesian models as generative modelsParametersµρθσGenerative model5, 2, 7, 8, 3, 9, 1, 2, .DataIf we know theparameters

Bayesian models as generative modelsParametersµρθσGenerative model5, 2, 7, 8, 3, 9, 1, 2, .DataWhen we knowthe data.

How many fish are in the lake? An actual problem in Abundanceestimation. Use in, for example, wildlifemanagement. Also other uses, for example, to estimatehow many DKK 1,000 bills are in circulation.

How many fish are in the lake? The problem: We can’t catch them all. But we can catch some of them.

Mark and Re-capture1. Catch a couple of fish.2. Mark them and throw them back.

Mark and Re-capture1.2.3.4.Catch a couple of fish.Mark them and throw them back.At a later point, catch a couple of fish again.Count how many are marked.20 were marked and five out of the 20 thatwere caught the second time were marked.

So, how many fish are in the lake? What are the probable number of fish in thelake? We have almost already described thesolution! (If we know about Bayesian DataAnalysis, that is.)

ParametersµρθσGenerative model5, 2, 7, 8, 3, 9, 1, 2, .Data

ParametersNo. of Fish1. Mark 20 “fish”2. Sample 20 “fish”3. Count the no. marked fish5 marked fishData

ParametersUncertaintyNo. of Fish1. Mark 20 “fish”2. Sample 20 “fish”3. Count the no. marked fish5 marked fishData

Uniform(0, 250)UncertaintyNo. of Fish1. Mark 20 “fish”2. Sample 20 “fish”3. Count the no. marked fish5 marked fishData

One simple way of fitting the model1. Draw a large random sample from the “prior” probabilitydistribution on the parameters. Here, for example:no fish: [63, 30, 167, 30, 164, 222, 225, 42, 122, ]2. Plug in each draw into the generative model whichgenerates a vector of “fake” data. For example:fish 63fish 30fishpickfishpick413fish 167 fish 30fishpick5fishpick15

One simple way of fitting the model3. Keep only those parameter values that generated the datathat was actually observed.fish 63fish 30fishpickfishpick4 5?fish 167 fish 3013 5 ?fishpick5 5?fishpick 15 5 ?

One simple way of fitting the model3. Keep only those parameter values that generated the datathat was actually observed.4. The distribution of the retained parameters now representthe probability that the data was produced by a certainparameter value. For example:[167, 135, 148, 90, 162, 88, 98, 110, 176, ]fish 63fish 30fishpickfishpick4 5?fish 167 fish 3013 5 ?fishpick5 5?fishpick 15 5 ?

Time for a demonstrationThe script that was “live coded” can be found here:http://rpubs.com/rasmusab/live coding user 2015 bayes tutorial

PriorMaximum likelihoodestimatePosterior50 %CredibleIntervallPosterior median

P(100 5 o ) P(100 ) P(5 o 100 )n fishpick5o

P(100 5 o ) P(100 ) P(5 o 100 )Σ P(n ) · P(5 o n )n fishpick5o

n fishpick5oP(100 5 o ) P(100 ) P(5 o 100 )Σ P(n ) · P(5 o n )Parameters ΘGenerativemodelData DP(Θ D) P(Θ) · P(D Θ)Σ P(Θ) · P(D Θ)Bayes theorem

What have we done? We have specified prior informationP(Θ) A generative modelP(D Θ) And have calculated the probability ofof different parameter valuesP(Θ D)

What have we done? In this example we used a capturerecapture model with one parameter. But the general method works on anygenerative model and with anynumber of parameters. The specific computational methodwe used only works in rare cases.

What is not Bayesian dataanalysis? A category of models Subjective Not necessarily the most computationallyefficient method of fitting a model. Anything new.

Bayes 1701–1761Laplace 1749–1827InverseProbability

“Bayesian data analysis” isnot the best of names.“Probabilistic modeling”would be better!Bayes 1701–1761Laplace 1749–1827InverseBayesians!ProbabilityFisher 1890–1962

UseR! 2015 prediction competitionhttp://bit.ly/1LuF64m20 minutes

Why use Bayesian dataanalysis? You have great flexibility when buildingmodels, and can focus on that, rather thancomputational issues.Why?

“Marked fish get shy! It is half as likely to catch a markedfish compared to a fish that has not been marked.”

ParametersNo. of Fish1. Mark 20 “fish”2. Sample 20 “fish”3. Count the no. marked fish5 marked fishData

ParametersNo. of Fish1. Mark 20 “fish”2. Sample 20 “fish”, where there is a50% chance to sample a markedfish compared to a unmarked fish.3. Count the no. marked fish5 marked fishData

DemoThe script that was “live coded” can be found here:http://rpubs.com/rasmusab/live coding user 2015 bayes tutorial

“There has always been plenty of fish in the lake.Around 200, I would say!”

DemoThe script that was “live coded” can be found here:http://rpubs.com/rasmusab/live coding user 2015 bayes tutorial

“If you’re not using a informative prior, you’releaving money on the table.”- Robert Weiss, UCLA, Los Angeles.

Why use Bayesian dataanalysis? You have great flexibility when buildingmodels, and can focus on that, rather thancomputational issues. You can include information sources inaddition to the data, for example, expertopinion. The result of a Bayesian analysis retains theuncertainty of the estimated parameters,which is very useful in decision analysis.

draw id no fish190216232024985176.

draw id no fishno fish * 10019090002162162003202202004989800517617600.

“If there are less than 50 fish in the lake, they wontlast the season. It will cost 10 000 kr to plant new fishinto the lake!

draw id no fishcatch 80 fishx 100 kr1908000216280003202800049880005717100.

draw id no fishcatch 80 fishx 100 krfishleftrepopulation .

profit - min(no fish, 80) * 100 (no fish - 80 50) * 10000draw id no fishcatch 80 fishx 100 krfishleftrepopulation .

draw id no fishcatch 80 fishx 100 krfishleftrepopulation .

mean(profit)[1] -1013draw id no fishcatch 80 fishx 100 krfishleftrepopulation .

What’s the optimal catch quota?Catch quota: 27 fishExpected profit: 2409 kr

80vs

t.test(y)

t.test(y1, y2)

lm(y 1 x)

glm(y 1 x, family “poisson”)

Why not use Bayesian dataanalysis? Everything is just working fine as it is. I’m not that interested in uncertainty. It’s too computationally demanding.

Exercise 1Bayesian A/B testing forSwedish Fish Incorporatedhttp://bit.ly/1SSCAaj

How to interpret and perform aBayesian data analysis in R? Interpreting the result of an Bayesian dataanalysis is usually straight forward.How?

With 95% probability the support of the voters lie withinthis band.

How to interpret and perform aBayesian data analysis in R? Interpreting the result of an Bayesian dataanalysis is usually straight forward. But if you scratch the surface there is a lotof Bayesian jargon!

PriorMaximum likelihoodestimatePosterior50 %CredibleIntervallPosterior median

More Bayesian Jargon Priors Objective priors Subjective priors Informative priors Improper priors Conjugate priorsCompletely datadriven modelExpert opinionBayesianmodels

More Bayesian Jargon: Distributions!

More Bayesian Jargon: Distributions! The usual suspects: The Normalx Normal(µ, σ)x - rnorm(n draw, mu, sd)

More Bayesian Jargon: Distributions! The usual suspects: The Binomialx Binomial(p, n)x - rbinom(n draw, size, prob)

More Bayesian Jargon: Distributions! The usual suspects: The Poissonx Poisson(λ)x - rpois(n draw, lambda)

More Bayesian Jargon: Distributions! Less common beasts: The Betax Beta( , )x - rbeta(n draw, shape1, shape2)

More Bayesian Jargon: Distributions! Less common beasts: The Gammax Gamma(k, θ)x - rgamma(n draw, shape, scale)

More Bayesian Jargon: Distributions! Less common beasts: The Hypergeometricn fishpick5o Fisher's noncentral hypergeometricdistributionn fish-pick withshy marked fish5o When it comes to distributions, Wikipedia isyour friend!

More Bayesian Jargon Samples, samples, samples.Prior samples:[63, 30, 167, 30, 164, 222, 225, 42, 122, ]Posterior samples:[167, 135, 148, 90, 162, 88, 98, 110, 176, ] Methods to generate posterior samples: Approximate Bayesian Computation (ABC) Markov Chain Monte Carlo (MCMC) Metropolis-Hastings Gibbs Sampling Hamiltonian monte carlo Other methods Conjugate models Laplace Approximation Etc. Etc. Etc.

Faster Bayesian computation We have been doing approximate Bayesiancomputation, which is the most general andslowest method for fitting a Bayesianmodel. Faster methods have in common that: They require that the likelihood that the generativemodel will generate any given data can becalculated.5on fishpickP(5 o n )

MCMC: The Metropolis-Hastingalgorithm The “classic” MCMC algorithm. Performs arandom walk in the parameter space, andwill stay at a parameter value proportionalto its posterior probability.Source: lis-hastings-sampling/ A good R implementation can be found inthe MCMCpack package as the functionMCMCmetrop1R(fun, theta.init, .)

Source: is/

MCMC: Gibbs sampling and JAGS Similar to Metropolis,but moves by oneparameter at a time. Can be much moreefficient, but usuallyrequired custom builtsampling schemes. Unless you use JAGS!

JAGS: Just Another Gibbs Sampler A cross-platform implementation of theBUGS language, an R-likeprobabilistic programminglanguage. It builds a custom Gibbssampler for you. Created by Martyn Plummer,member of the R core group. Made to be called from R.

JAGS: Just Another Gibbs Sampler JAGS is tailor made for building generativemodels. A minimal JAGS program:JAGS codemodel {n - 30p dunif(0, 1)x dbinom(p, n)}R coden - 30p - runif(1, 0, 1)x - rbinom(1, p, n)

JAGS codemodel {n - 30p dunif(0, 1)x dbinom(p, n)}R coden - 30p - runif(1, 0, 1)x - rbinom(1, p, n)jags.model(., data list())

JAGS codemodel {n - 30p dunif(0, 1)x dbinom(p, n)}R coden - 30p - runif(1, 0, 1)x - rbinom(1, p, n) ABC stepjags.model(., data list(x 10))

JAGS: Just Another Gibbs Sampler JAGS is declarative model {n - 30p dunif(0, 1)x dbinom(p, n)}model {x dbinom(p, n)n - 30p dunif(0, 1)}

JAGS: Just Another Gibbs Sampler JAGS is declarative model {n - 30n - n 1p dunif(0, 1)x dbinom(p, n)x dbinom(p, n)}

JAGS: Just Another Gibbs Sampler JAGS is (unfortunately not) vectorized.JAGS codeR code m - 1:5 x - rpois(5, m) x[1] 0 2 5 4 6model {x dpois(m)}

JAGS: Just Another Gibbs Sampler JAGS is (unfortunately not) vectorized.JAGS codeR code m - 1:5 x - rpois(5, m) x[1] 0 2 5 4 6model {for(i in 1:length(m)) {x dpois(m[i])}}

JAGS: Just Another Gibbs Sampler JAGS is (unfortunately not) vectorized.JAGS codeR code m - 1:5 x - rpois(5, m) x[1] 0 2 5 4 6 model {for(i in 1:length(m)) {x[i] dpois(m[i])}}

DemoThe script that was “live coded” can be found here:http://rpubs.com/rasmusab/live coding user 2015 bayes tutorial

Exercise 2Bayesian computation with JAGS andfarmer Jönshttp://bit.ly/1RGuK0X

Fitting Bayesian models in RPre-specified models and ageneral metropolis algorithm.Gibbs sampling

Fitting Bayesian models in RR-inlaHamiltonianMonte CarloIntegrated nested LaplaceapproximationExtra everything!

Some things we have not covered Priors Distributions Decision analysis and “post-processing” ofposteriors. Model selection and Bayes factors Philosophy Math

To summarize Bayesian data analysis What? Bayesian data analysis is a flexible method to fit anytype of statistical model. Maximum likelihood is a special case of Bayesianmodel fitting. Why? Makes it possible to define highly custom models. Makes it possible to include information from manysources, for example, data and expert knowledge. Quantifies and retains the uncertainty in parameterestimates and predictions. How? R! Using ABC, MCMCpack, JAGS, STAN, R-inla, etc.

Summer reading / listening

"The theory of probabilities is at bottom nothingbut common sense reduced to calculus; itenables us to appreciate with exactness thatwhich accurate minds feel with a sort of instinctfor which ofttimes they are unable to account."Pierre-Simon LaplaceEssai philosophique sur les Probabilités (1814)

!Introduction to Bayesian DataAnalysis with R.Rasmus Bååth, Lund University@rabaath rasmus.baath@gmail.comhttp://www.sumsar.net

Bayesian data analysis is a great tool! and R is a great tool for doing Bayesian data analysis. But if you google “Bayesian” you get philosophy: Subjective vs Objective Frequentism vs Bayesianism p-values vs subjective probabilities

Related Documents:

Craft Council of Newfoundland and Labrador - Webflow

work/products (Beading, Candles, Carving, Food Products, Soap, Weaving, etc.) ⃝I understand that if my work contains Indigenous visual representation that it is a reflection of the Indigenous culture of my native region. ⃝To the best of my knowledge, my work/products fall within Craft Council standards and expectations with respect to

306 Views

2y ago

ICE Analysis Manual

Present ICE Analysis in Environmental Document 54 Scoping Activities 55 ICE Analysis Analysis 56 ICE Analysis Conclusions 57 . Presenting the ICE Analysis 59 The ICE Analysis Presentation (Other Information) 60 Typical ICE Analysis Outline 61 ICE Analysis for Categorical Exclusions (CE) 62 STAGE III: Mitigation ICE Analysis Mitigation 47 .

16 Views

8m ago

COURSE BROCHURE - ifmcinstitute

Module 7: Fundamental Analysis (NCFM Certification) 1. Introduction of Fundamental Analysis What is Fundamental & Technical Analysis? Difference between technical & fundamental analysis Features & benefits of Fundamental analysis 2. Top-Down Approach in Fundamental Analysis Economic Analysis Industry Analysis Company analysis 3.

105 Views

3y ago

Analysis of Knowledge Management within Five Key Areas

Qualitative analysis, quantitative analysis, non-financial indicator analysis, financial indicator analysis, internal performance analysis, external performance analysis, project-orientated analysis, organization-orientated analysis 8 [36] Area-based Knowledge measurement in products and processes,

32 Views

1y ago

Oasys GSA - GitHub Pages

Oasys GSA Contents Notation 8 Degrees of freedom 10 Active degrees of freedom 10 Degrees of Freedom with no Local Stiffness 11 Analysis Options 13 Static Analysis 13 Static P-delta Analysis 13 Modal Analysis 14 Modal P-delta Analysis 14 Ritz Analysis 15 Modal Buckling Analysis 16 Model Stability Analysis 17 Non-linear Static Analysis 18

25 Views

1y ago

Financial performance analysis: a case study of Nike

Research Design: Financial Performance Analysis In this study, financial performance analysis will be used. The analysis is based on three types of analysis methods which are horizontal analysis, trend analysis and ratio analysis. All data analysis is based on the items on the financial statement. A financial statement is a written record

92 Views

1y ago

Princeton Lectures in Analysis - UC Davis

Princeton Lectures in Analysis. The individual titles are: Fourier Analysis: An Introduction Complex Analysis Real Analysis: Measure Theory, Integration, and Hilbert Spaces and Functional Analysis: Introduction to Further Topics in Analysis. All four books are now available; all four books bear the unmistakable imprint of Eli Stein.

8 Views

1y ago

Real Analysis Description of Analysis

Advanced courses in analysis include topics such as measure theory, Lebesgue integration, metric space theory, functional analysis, and complex analysis. Most programs offer at least the courses in introductory analysis and complex analysis. Analysis courses are considered among

36 Views

2y ago

Recent Views

Family Law and You Booklet - lsc.sa.gov.au

FAMILY LAW AND YOU The Family Law Act is the main law that deals with divorce, disputes about children and property matters. All children are covered by the Family Law Act, no matter where in Australia they live or who their parents are. The courts that can make decisions under the Family Law Act are federal courts called Family Law Courts.

1y ago

143 Views

12 PUBLIC LAW AND PRIVATE LAW - Home: The National .

INTRODUCTION TO LAW MODULE - 3 Public Law and Private Law Classification of Law 164 Notes z define Criminal Law; z list the differences between Public and Private Law; and z discuss the role of Judges in shaping Law 12.1 MEANING AND NATURE OF PUBLIC LAW Public Law is that part of law, which governs relationship between the State

3y ago

745 Views

Dr. Ram Manohar Lohiya National Law University, Lucknow

2. Health and Medicine Law 3. Int. Commercial Arbitration 4. Law and Agriculture IXth SEMESTER 1. Consumer Protection Law 2. Law, Science and Technology 3. Women and Law 4. Land Law (UP) Xth SEMESTER 1. Real Estate Law 2. Law and Economics 3. Sports Law 4. Law and Education **Seminar Courses Xth SEMESTER (i) Law and Morality (ii) Legislative .

3y ago

496 Views

Case Law Update by Victor P. Valmus Family Law uarterly

Family Law uarterly Official Publication of the Cobb County Family Law Section The Cobb Case Law Update The Cobb Family Law uarterlyJune, 201 The Cobb Family Law Quarterly June, 2014 In this Edition Business Valuation and Reporting in Matrimonial Disputes by Marc L. Effron, CPA/ CFF, JD, CVA and Kevin P. Couillard, ASA, CFA

1y ago

114 Views

Board Beans Collection - BOARD BEANS - Board Beans

Catan Family 3 4 4 Checkers Family 2 2 2 Cherry Picking Family 2 6 3 Cinco Linko Family 2 4 4 . Lost Cities Family 2 2 2 Love Letter Family 2 4 4 Machi Koro Family 2 4 4 Magic Maze Family 1 8 4 4. . Top Gun Strategy Game Family 2 4 2 Tri-Ominos Family 2 6 3,4 Trivial Pursuit: Family Edition Family 2 36 4

2y ago

384 Views

Companies Law - Cayman Islands dollar

Law 1 of 1971-15th December, 1970 Law 7 of 2000- 20th July, 2000 Law 7 of 1973-28th June, 1973 Law 5 of 2001-20th April, 2001 Law 24 of 1974-22nd November, 1974 Law 10 of 2001-25th May, 2001 Law 25 of 1975-9th December, 1975 Law 29 of 2001-26th September, 2001 Law 19 of 1977-10th November, 1977 Law 46 of 2001-14th January, 2002

3y ago

454 Views

It’s the Law!

ciples stated in Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law. Students will be able to explain the application of Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law to observations or events related to SCUBA diving. MateriaLs None audio/visuaL MateriaLs None teachinG tiMe

2y ago

378 Views

WHAT LAW IS ? An Introduction to Law

common law system civil law system!! sources of law in civil law !! a1. primary: statutes (written law) enacted by legislative power are the principal source of law. ! a2. two subsidiary sources of law: ! a2.1 administrative regulations a.2.2 customs!! ! sources of law in common law !!! b1. two primary sources of

1y ago

385 Views

Intermediate Law Law and You Worksheet 3: Australian law - Home Affairs

4. There are different kinds of law to deal with different kinds of problems. Four important kinds of law are civil law, criminal law, family law and administrative law. Civil law deals with disputes between individuals; for example, if someone sells you goods that are faulty, or that cause you injury or damage, you can take that person to court.

4m ago

110 Views

What is Family Law? - Courts and Tribunals Judiciary

What is family law? After all, the law of inheritance is usually thought of as a branch of property law and thus a matter for the Chancery rather than the Family Division. And family 1 Changing families: family law yesterday, today and tomorrow - a view from south of the Border [2018] Fam Law 538, 542-3.

1y ago

128 Views

Domestic Violence and Family Law in Papua New Guinea

Family law in PNG Family law deals with issues relating to family and domestic relationships. Major topics covered by family law include marriage, divorce, child maintenance, prop - erty claims following separation and the custody and adoption of children (Jessep and Luluaki 1985:11). Much of PNG's family law legislation was adopted as

1y ago

126 Views

Faculty of Juridical, Social and Political Sciences Year .

Law L Law IV 8 Drept procesual civil II / Civil Procedure Law II 5 Law L Law IV 8 Dreptul comerțului internațional / International ommercial Law 4 Law L Law IV 8 riminalistică / Forensics 4 Law L Law IV 8 Practică de cercetare pentru elaborarea lucrării de lincență(3 săptămân

2y ago

384 Views

Ohm ’s Law

Ohm ’s Law Ohm's law states that, in an electrical circuit, the current passing through most materials is directly proportional to the potential difference applied across them. 3-1—3-3: Ohm ’s Law Formulas There are three forms of Ohm’s Law: I V/R V IR R V/I where:File Size: 1MBPage Count: 40Explore furtherOhm's Law Quiz MCQs with Answers Ohm Lawohmlaw.comOhm’s Law Worksheet - Basic Electricity - All About omohms law worksheet - eering.orgOhm’s Law Worksheet - Richmond County School Systemwww.rcboe.orgOhm's Law with Examples - Physics Problems with Solutions ended to you b

2y ago

295 Views

Family Law for the Future — An Inquiry into the Family Law .

Review of the Family Law System On 27 September 2017, the Australian Law Reform Commission received Terms of Reference to undertake an inquiry into the family law system. On behalf of the Members of the Commission involved in this Inquiry, and in accordance with the Australian Law

3y ago

136 Views

Practice Material - Family - Law Society of British Columbia

The Law Society's . Report of the Family Law Task Force: Best Practice Guidelines for Law-yers Practising Family Law. Family law has undergone significant changes over the past several years, and more changes are underway. 2. It is important to verify that your legal knowledge and re-sources are current. For example, note these changes:

1y ago

125 Views

Analysis With R. Introduction To Bayesian Data

It looks like you're using an ad-blocker