Notes On Bayesian Computation And MCMC

2y ago

19 Views

2 Downloads

256.63 KB

15 Pages

Last View : 10d ago

Last Download : 3m ago

Upload by : Genevieve Webb

Report this link

Download PDF

Transcription

Notes on Bayesian Computation and MCMCAdam N. SmithJuly 31, 2021This note reviews some of the key results underlying MCMC theory,discusses the theoretical underpinnings of popular MCMC algorithms(the Metropolis-Hastings algorithm and Gibbs sampler), and presents afew applications in the context of economic choice models.1PreliminariesMarkov chain Monte Carlo (MCMC) methods are an indispensable tool in theBayesian paradigm. In some sense, MCMC put Bayesian analysis “on the map”by making it feasible to generate posterior samples from a much wider class ofBayesian models. While non-conjugate priors and normalizing constants would posechallenges to analytical posterior sampling solutions (or preclude them altogether),they are no longer issues with MCMC. The broad idea of MCMC is to construct aMarkov chain that converges to the required posterior distribution. The goal of thisnote is to define canonical MCMC algorithms and briefly explain why they work.It is first worth mentioning the fascinating history of Bayesian computation andMCMC methods, which has been discussed at length in Robert and Casella (2011)and Martin et al. (2020). A short version is as follows. The underlying methods dateback to Los Alamos, New Mexico and statistical physics applications during WorldWar II. The earliest developments include the Monte Carlo method (Metropolisand Ulam, 1949) and the Metropolis algorithm (Metropolis et al., 1953), which wasthen extended by Hastings (1970). The ideas of Gibbs sampling also have rootsin Hastings (1970), but were rigorously laid out in Geman and Geman (1984). Aseminal paper by Gelfand and Smith (1990) popularized MCMC to the Bayesiancommunity, which sparked an MCMC revolution in the ’90s. Robert and Casella(2011) also cites a conference held at Ohio State University in February 1991 as aseed of this revolution. The conference was organized by Alan Gelfand, Prem Goel,and Adrian Smith and included talks from nearly everyone who would now make the“Who’s Who of MCMC” list. The conference program is included in the appendixof Robert and Casella (2011).1

DefinitionsA Markov chain is a sequence of random variables θ(0) , θ(1) , . . .evolving over time, where this evolution adherers to a specific Markov structure.That is, the present state θ(r) only depends on the past through the last stateθ(r 1) or, formally, P (θ(r) θ(0) , θ(1) , . . . , θ(r 1) ) P (θ(r) θ(r 1) ). A Markov chain isgoverned by the transition kernel P (θ, A) which defines the probability of reachingRset A Θ from state θ Θ. Its density p(θ, ·) is defined as P (θ, A) A p(θ, ϑ)dϑand is a valid pdf conditional on θ.The broad goal of Markov chain theory is to identify conditions under which aMarkov chain possesses a (unique) stationary distribution and also converges to thisstationary distribution in the limit. Formally, the distribution π is a stationarydistribution of the Markov transition kernel P ifZZP (θ, A)π(θ)dθ π(ϑ)dϑΘ(1)ATo understand why MCMC algorithms “work”, we first need to understand some ofthe key properties of Markov chain theory which will help us establish the existenceand uniqueness of, and convergence to, this stationary distribution.1. (Irreducibility) A Markov chain is π-irreducible if for any initial state θ,P (θ, A) 0 whenever π(A) 0. That is, there is always some way to reachany state (with positive probability under π) from any other state.2. (Recurrence) A stronger version of irreducibility is Harris recurrence. A Markovchain is Harris recurrent if, for all A with π(A) 0 and all θ, the chain willreach A with probability 1. As Chan and Geyer (1994) put it, “Harris recurrence essentially says that there is no measure-theoretic pathology . the mainpoint of Harris recurrence is that asymptotics do not depend on the startingdistribution.”3. (Reversibility) A Markov chain is reversible if the distribution of (θ(r) , θ(r 1) )is the same as the distribution of (θ(r 1) , θ(r) ). This imposes exchangeabilitybetween θ(r) and θ(r 1) .4. (Detailed Balance) A Markov chain with transition kernel P satisfies detailedbalance with respect to the distribution π ifP (θ, ϑ)π(θ) P (ϑ, θ)π(ϑ)2for all θ, ϑ Θ.(2)

Importantly, detailed balance is a sufficient (but not necessary) condition forreversibility.5. (Periodicity) A Markov chain is periodic if there are portions of the statespace that it can only visit at certain regularly spaced times; otherwise, thechain is aperiodic. A sufficient (but not necessary) condition for an aperiodicchain is P (θ(r) θ(r 1) ) 0.6. (Ergodicity) A Markov chain is ergodic if its limiting distribution equals itsstationary distribution. That is, if for all sets A,lim P (r) (θ, A) π(A) 0r (3)where k · k is the total variation norm.Key ResultsWe now state a few key results connecting the properties above todesired properties of Markov chains. A formal presentation of these results anddiscussion of their proofs can be found in Robert and Casella (2004).R1 If the Markov transition kernel P is reversible w.r.t. π, then π is a stationarydistribution of P .R2 If the Markov transition kernel P is π-irreducible, then π is the uniquestationary distribution of P .R3 If the Markov transition kernel P is π-irreducible and aperiodic, thenkP (r) (θ, A) π(A)k 0 for almost-every θ. If P is also Harris recurrent,then convergence holds for every θ.A corollary to R3 is an ergodic theorem (i.e., a law of large numbers) which saysthat sample averages converge to the appropriate population integral.R1 Xlimg(θ(r) ) Eπ [g(θ)]R R(4)r 1This last result constitutes the second “MC” of MCMC. After we construct theappropriate Markov chain, we can use Monte Carlo integration to compute posteriormoments for any set of parameters (or functions of parameters).3

2MCMC AlgorithmsWe now turn to the construction of MCMC algorithms. Specifically, an MCMCalgorithm for simulating a distribution π is any algorithm producing an ergodicMarkov chain {θ(r) } whose stationary distribution is π. In our context, we need πto be a posterior π(θ x) so that {θ(r) } can be treated as draws from π(θ x). Our“recipe” to check the validity of an MCMC algorithm is to verify that the transitionkernel: (i) satisfies detailed balance with respect to π; (ii) is π-irreducible; and (iii)is aperiodic. Together, (i) and (ii) show that the Markov chain has the posterior asits unique stationary distribution, and (ii) and (iii) show that the chain is ergodicand thus converges to the posterior.2.1Metropolis-Hastings AlgorithmOne of the most general and widely-used MCMC algorithms is the MetropolisHastings (MH) algorithm (Metropolis et al., 1953; Hastings, 1970). Given a targetposterior π(θ x) and conditional density q(ϑ θ), the algorithm generates a chain{θ(r) } through the following steps.Algorithm 1: Metropolis-HastingsInitialize θ(0)For r 1, . . . , R1. Let θ θ(r 1) and generate a candidate value ϑ q(· θ)2. Compute the MH acceptance probability π(ϑ x) q(θ ϑ)α(θ, ϑ) min 1,π(θ x) q(ϑ θ)3. Set θ(r) ϑ with probability α(θ, ϑ)Otherwise set θ(r) θProof Our first task is to verify that the MH algorithm is valid (formal resultscan be found in Tierney, 1994). Let θ and ϑ two states of the chain.4

(i) (Detailed Balance) The MH transition kernel is defined asP (θ, ϑ) α(θ, ϑ)q(ϑ θ)First step is to show that P (θ, ϑ) satisfies the detailed balance condition.α(θ, ϑ)q(ϑ θ)π(θ x) α(ϑ, θ)q(θ ϑ)π(ϑ x)First assume π(ϑ x)q(θ ϑ) π(θ x)q(ϑ θ) so that α(θ, ϑ) 1 and, by symmetry, α(ϑ, θ) π(θ x)q(ϑ θ)/(π(ϑ x)q(θ ϑ)). It follows that(LHS) α(θ, ϑ)q(ϑ θ)π(θ x) q(ϑ θ)π(θ x)(RHS) α(ϑ, θ)q(θ ϑ)π(ϑ x) π(θ x)q(ϑ θ)q(θ ϑ)π(ϑ x) π(θ x)q(ϑ θ)π(ϑ x)q(θ ϑ)so the LHS and RHS are equal and detailed balance is satisfied. Note thatbecause of symmetry, the same is true for the case when π(ϑ x)q(θ ϑ) π(θ x)q(ϑ θ). Therefore, the MH update is reversible.(ii) (π-Irreducible) A sufficient condition for irreducibility is that q(ϑ θ) 0 forall (ϑ, θ) in the support of the posterior π(θ x). Moreover, by Corollary 2 inTierney (1994), if a Metropolis-Hastings chain is π-irreducible then it is Harrisrecurrent.(iii) (Aperiodic) Since the MH acceptance probability guarantees that some candidate draws will be “rejected” then P (θ(r) θ(r 1) ) 0 and the chain isaperiodic.By (ii) and (iii) above, the MH chain is ergodic as desired!VirtuesA few comments are also in order about the virtues of the MH algorithm.First, the algorithm is both simple and remarkably general in the sense that it“works” under very mild assumptions on proposal densities and with no assumptionsof likelihood structure or prior conjugacy. Moreover, the ratio of posteriors in theMH acceptance ratio implies that the normalizing constants cancel, and so thealgorithm only requires evaluations of the likelihood and prior. This is one of themain reasons why the MH algorithm was a breakthrough for Bayesian computation.5

Proposal Densities There are two main classes of proposal densities, each leadingto a different “type” of MH algorithm. The first specifies proposals of the form:q(ϑ θ) q(ϑ)(5)and so the proposed value is independent of the current state. MH algorithms withsuch proposals are referred to as independence MH algorithms. The second specifiesproposals of the form:q(ϑ θ) q(kϑ θk)(6)and are thus symmetric. In this case, we can we can write ϑ θ where has a symmetric distribution. MH algorithms with this type of symmetric proposaldensity are referred to as random-walk MH algorithms. One feature of symmetricproposals is that the ratio of proposals will cancel in the MH acceptance ratio, andso the only objects that must be evaluated are the likelihood and prior. In fact, theoriginal Metropolis algorithm (Metropolis et al., 1953) assumed symmetric proposalsand so one innovation of Hastings (1970) was to provide an extension to asymmetricproposals.2.2Gibbs SamplerThe MH is very general in the sense that the only requirements are evaluationsof the likelihood, prior, and proposal densities. The Gibbs sampler requires astronger set of conditions to be met – namely, given a p-dimensional parametervector θ (θ1 , . . . , θp ) and target posterior π(θ x), we must be able to sample fromeach parameter’s full conditional distribution π(θj θ j , x). Consequently, Gibbssamplers are most commonly used when some form of conditional conjugacy existsso that the full conditionals can be expressed analytically.The idea of Gibbs sampling is in some ways both rooted in and justified bythe Hammersley-Clifford Theorem, which proves that we can write out a jointdistribution p(θ1 , ., θp ) in terms of only the full conditional distributions p(θj θ j ).For example, consider the bivariate distribution p(θ1 , θ2 ). It follows that6

p(θ1 , θ2 ) p(θ2 θ1 ) p(θ1 )1 p(θ2 θ1 ) 1(7)p(θ1 ) p(θ2 θ1 ) R p(θ2 θ1 ) R p(θ2 θ1 ) R1p(θ2 )p(θ1 ) dθ21p(θ1 ,θ2 )/p(θ1 )p(θ1 ,θ2 )/p(θ2 ) dθ21p(θ2 θ1 )p(θ1 θ2 ) dθ2and so the set of full conditional distributions, p(θ1 θ2 ) and p(θ2 θ1 ), summarize allinformation in the joint distribution.Formally, given a target posterior π(θ x), the Gibbs sampler generates a chain{θ(r) }by iteratively sampling from each parameter’s full conditional distribution.Algorithm 2: Gibbs Sampler(0)(0)Initialize θ(0) (θ1 , . . . , θp )For r 1, . . . , R(r)(r 1)(r)(r)1. Sample θ1 π(θ1 θ1(r 1), ., θp, x)(r)(r 1)(r 1)(r)(r)2. Sample θ2 π(θ2 θ1 , θ3, ., θp, x).(r)p. Sample θp π(θp θ1 , θ2 , ., θp 1 , x)Proof To demonstrate validity, we will show that the Gibbs sampler is a specialcase of the MH algorithm with acceptance probability equal to 1. Consider a movefrom θ (θ1 , . . . , θj , . . . , θp ) ϑ (θ1 , . . . , ϑj , . . . , θp ). That is, we will modify thejth component of the θ vector. The proposals under a Gibbs sampler take the form:q(ϑ θ) π(ϑj θ j , x)q(θ ϑ) π(θj θ j , x)7

and so the MH acceptance ratio is equal to:π(ϑj θ j , x)π(θ j x) π(θj θ j , x)π(ϑ x) q(θ ϑ)π(ϑ x) π(θj θ j , x) 1.π(θ x) q(ϑ θ)π(θ x) π(ϑj θ j , x)π(θj θ j , x)π(θ j x) π(ϑj θ j , x)Thus, a Gibbs update for θj is equivalent to an MH update where the acceptanceprobability is equal to 1.3ExamplesIn this section, we show how the MCMC algorithms discussed above can be appliedto Bayesian choice models. We specifically consider a binary probit model (Gibbssampler), a multinomial logit model (MH algorithm), and a hierarchical multinomiallogit model (hybrid Gibbs or Metropolis-within-Gibbs sampler). These examples canalso be found in Chapters 3.8, 3.11, and 5.4 of Rossi et al. (2005).3.1Binary probit model (Gibbs sampler)The binary probit model is a latent variable model with a binary outcome yi {0, 1}.zi x0i β εi , εi N (0, 1) 1 if z 0iyi 0 otherwise(8)(9)The first step to a Bayesian analysis is to write down the likelihood, prior, andposterior.(i) Likelihoodp(y X, β) nYΦ(x0i β)yi [1 Φ(x0i β)]1 yii 1(ii) Priorβ N (β̄, A 1 )8

(iii) Posteriorπ(β y, x) p(y X, β)p(β) nYi 1nY!p(yi xi , β) p(β)!Φ(x0i β)yi [1 Φ(x0i β)]1 yii 1 10exp (β β̄) A(β β̄)2The computational challenge with this model is that the posterior does not have ananalytic expression (conjugate priors do not exist). Direct analytic sampling fromthe posterior is therefore infeasible.Let’s see how we can set up a Gibbs sampler to solve this problem. The first“solution” is to treat the latent zi as a model parameter, thus augmenting theparameter vector to be (β, z1 , . . . , zn ). This is referred to as data augmentationand was a contemporaneous development to the Gibbs sampler (Tanner and Wong,1987). We now have a set of two full conditional distributions: π(β z1 , . . . , zn , y, X)and π(z1 , . . . , zn β, y, X).Note that conditional on the z vector, the full conditional for β becomes theposterior of a Bayesian linear regression model with normal conjugate priors (ormore specifically, conditionally conjugate priors). Therefore, we haveβ z, y, X N (β̃, (X 0 X A) 1 )(10)where β̃ (X 0 X A) 1 (X 0 z Aβ̄). So conditional on knowing the latent outcomevariables, we have an analytic expression for the posterior from which we can easilygenerate samples. Turning to the second full conditional, zi is a truncated normalrandom variable with truncation points governed by the choice outcome yi . Thatis,zi β, yi 1, xi N (x0i β, 1) · I[0, ) (zi )zi β, yi 0, xi N (x0i β, 1)(11)· I( ,0) (zi )and so zi is truncated below at 0 if yi 1 and truncated above at 0 if yi 0.Sampling from this full conditional can be made using a function like rtnorm() inR, for example.Together, (10) and (11) give us analytic expressions for the necessary full condi-9

tional distributions. We can then use a Gibbs sampler to sample from the posterior.1(0)(0)Specifically, initialize β (0) , z1 , . . . , zn and then for r 1, . . . , R, do the following.(r 1) Sample β (r) z1(r 1), . . . , zn, y, X from (10)(r) Sample z1 β (r) , y, X from (11).(r) Sample zn β (r) , y, X from (11)(r 1)Note that the terms z iare absent from the right-hand-size of the zi full condi-tional since the zi ’s are assumed to be iid.3.2Multinomial logit model (Metropolis-Hastings algorithm)The multinomial logit model is another latent variable model, but now with a multinomial outcome yi {1, . . . , J} whereuij x0ij β εij ,εij TIEVandyi jifuij uik for all k 6 j.We again start by writing down the likelihood, prior, and posterior.(i) Likelihoodp(y β, X) n YJY1(yi j)piji 1 j 1exp(x0ij β)pij PJ0k 1 exp(xik β)(ii) Priorβ N (β̄, A 1 )1The function rbprobitGibbs() in the bayesm R package (Rossi, 2019) implements this Gibbssampling algorithm for the binary probit model.10

(iii) Posteriorπ(β y, X) p(y X, β)p(β) nY!p(yi xi , β) p(β)i 1 #1(yi j) " n YJ0Yexp(xij β)10 exp (β β̄) A(β β̄) PJ02k 1 exp(xik β)i 1 j 1Just like the case of the binary probit model, the computational challenge with themultinomial logit model is that the posterior does not have an analytic expressionand so direct analytic sampling from the posterior above is infeasible. Moreover,we cannot find conjugate or even conditionally conjugate priors for β, so a Gibbssampler is also infeasible.We therefore turn to a random-walk MH algorithm.2 Initialize β (0) and then forr 1, . . . , R, do the following. Generate a candidate value β N (β (r 1) , s2 ) Compute the MH acceptance probabilityα(β(r 1)p(y X, β )p(β ), β ) min 1,p(y X, β (r 1) )p(β (r 1) ) Set β (r) β with probability α(β (r 1) , β ) and set β (r) β (r 1) otherwiseNote that the proposal density is parameterized by a variance term s2 which actsas a “step size” of the MH proposal. For guidance on choosing s2 in the context oflogit models, see Section 3.11 of Rossi et al. (2005). Also note that the random-walkproposal is symmetric and so the MH acceptance ratio only involves evaluations ofthe likelihood and prior (and not the proposal).2The function rmnlIndepMetrop() in the bayesm R package (Rossi, 2019) implements an independence MH algorithm for the multinomial logit model.11

3.3Hierarchical multinomial logit model (Metropolis-within-Gibbs)A hierarchical multinomial logit model extends the logit model above to a paneldata setting with customers i 1, . . . , n and purchase occasions t 1, . . . , Ti :uijt x0ijt βi εijt ,εijt TIEVandyit jifuijt uikt for all k 6 j.We again start by writing down the likelihood, prior, and posterior.(i) Likelihoodp(y X, β) pijt Ti Yn YJY1(y j)pijt iti 1 t 1 j 1exp(x0ijt βi )PJ0k 1 exp(xikt βi )(ii) Priorβi , Vβ N ( 0 zi , Vβ )Here zi is a d-vector of observed customer characteristics variables (like demographics) and is a d p matrix describing how average preferences changewith observed characteristics. While the mean function captures “observedheterogeneity”, the covariance matrix Vβ captures “unobserved heterogeneity.” That is, two customers are allowed to have different βi vectors even iftheir zi vectors are identical. Since we want to make inferences about ( , Vβ ),we need to specify a second-stage prior: Vβ A 1 )vec( ) Vβ N (vec( ),Vβ IW (ν, V )which is the conjugate prior for a multivariate regression model (Rossi et al.,2005).12

(iii) Posteriorπ(β, , Vβ y, X, z) p(y X, β)p(β , Vβ )p( , Vβ )"T#!niYY p(yit xi , βi ) p(βi , Vβ ) p( Vβ )p(Vβ )i 1t 1 #1(yit j) " Ti YnJYYexp(x0ij β)1 exp (βi 0 zi )0 V 1 (βi 0 zi ) PJβ0 β)2exp(xk 1iki 1 t 1 j 1 1 10 d/2 Vβ exp tr ( ) A( )Vβ2 1 1 (ν p 1)/2 Vβ exp tr V Vβ2 Just like both models described above, the computational challenge with this modelis that the posterior does not have an analytic expression and so direct analyticsampling from the posterior is infeasible. But the hierarchical nature of the modelallows us to sample “blocks” of parameters at a time. In particular, we can designa Gibbs sampler that will iteratively sample from the following full conditionals:β , Vβ , y, X , Vβ β, y, Xwhere the first block is the set of customer-level parameters β1 , . . . , βn and thesecond is the set of population-level parameters ( , Vβ ). Note that the samplingproblem associated with the β vector is the same problem that appeared abovefor the multinomial logit model. The only difference is that here we have paneldata, and so we must loop over customers and do a total of n MH updates. Thenconditional on the β’s, there is an analytic expression for the full conditional for( , Vβ ) and so sampling is exact.We will use a Metropolis-within-Gibbs algorithm to sample from the posteriorabove.3 This hybrid algorithm gets its name from the fact we are iteratively sampling between full conditionals (i.e., Gibbs sampling), but we are swapping in an3The function rhierMnlRwMixture() in the bayesm R package (Rossi, 2019) implements thisMetropolis-within-Gibbs algorithm for the hierarchical multinomial logit model. The only differenceis that the function is more general and allows for mixtures of normals heterogeneity.13

MH update for parameters with non-conjugate priors and complicated full conditionals (i.e., the βi ’s).(0)(0)(0)β1 , . . . , β1 , (0) , VβSpecifically, initialize the vector of model parametersand then for r 1, . . . , R, do the following. (MH update) For i 1, . . . , n(r 1)– Generate a candidate value βi N (βi, s2 Vβ )– Compute the MH acceptance probability (r 1) (r 1) p(yi xi , βi )p(βi , Vβ)(r 1)α(βi, βi ) min 1, p(yi xi , β (r 1) )p(β (r 1) (r 1) , V (r 1) ) i(r)– Set βi(r 1) β with probability α(βiβ(r), β ) and set βi(r 1) βiother-wise(r) (Gibbs update) Sample ( (r) , Vβ ) from the posterior of a Bayesian multivariate regression model. Although the structure of this posterior closely matchesthat of a Bayesian multiple regression model, there is more matrix algebragiven the multivariate nature of the response. See Section 2.12 of Rossi et al.(2005) for details.ReferencesChan, K. S. and Geyer, C. J. (1994). Discussion: Markov chains for exploringposterior distributions. The Annals of Statistics, 22(4):1747–1758.Gelfand, A. E. and Smith, A. F. M. (1990). Sampling-based approaches to calculating marginal densities.Journal of the American Statistical Association,85(410):398–409.Geman, S. and Geman, D. (1984). Stochastic relaxation, Gibbs distributions, andthe Bayesian restoration of images. IEEE Transactions on Pattern Analysis andMachine Intelligence, PAMI-6(6):721–741.Hastings, W. K. (1970). Monte Carlo sampling methods using Markov chains andtheir applications. Biometrika, 57(1):97–109.Martin, G. M., Frazier, D. T., and Robert, C. P. (2020). Computing Bayes: Bayesiancomputation from 1763 to the 21st century. Working Paper.14

Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., Teller, A. H., and Teller, E.(1953). Equation of state calculations by fast computing machines. The Journalof Chemical Physics, 21(6):1087–1092.Metropolis, N. and Ulam, S. (1949). The Monte Carlo method. Journal of theAmerican Statistical Association, 44(247):335—341.Robert, C. and Casella, G. (2011). A short history of Markov chain Monte Carlo:Subjective recollections from incomplete data. Statistical Science, 26(1):102–115.Robert, C. P. and Casella, G. (2004). Monte Carlo Statistical Methods. Springer.Rossi, P. E. (2019). bayesm: Bayesian Inference for Marketing/Micro-Econometrics,R package version 3.1-4 edition.Rossi, P. E., Allenby, G. M., and McCulloch, R. (2005). Bayesian Statistics andMarketing. John Wiley & Sons.Tanner, M. A. and Wong, W. H. (1987). The calculation of posterior distributions bydata augmentation. Journal of the American Statistical Association, 82(398):528–540.Tierney, L. (1994). Markov chains for exploring posterior distributions. The Annalsof Statistics, 22(4):1701–1728.15

Markov chain Monte Carlo (MCMC) methods are an indispensable tool in the Bayesian paradigm. In some sense, MCMC put Bayesian analysis \on the map" by making it feasible to generate posterior samples from a much wider class of Bayesian models. While

Related Documents:

Lecture Notes on Bayesian Nonparametrics Peter Orbanz

value of the parameter remains uncertain given a nite number of observations, and Bayesian statistics uses the posterior distribution to express this uncertainty. A nonparametric Bayesian model is a Bayesian model whose parameter space has in nite dimension. To de ne a nonparametric Bayesian model, we have

21 Views

1y ago

Computational Bayesian Statistics

Computational Bayesian Statistics An Introduction M. Antónia Amaral Turkman Carlos Daniel Paulino Peter Müller. Contents Preface to the English Version viii Preface ix 1 Bayesian Inference 1 1.1 The Classical Paradigm 2 1.2 The Bayesian Paradigm 5 1.3 Bayesian Inference 8 1.3.1 Parametric Inference 8

25 Views

2y ago

Peer-Assisted Computation Offloading in Wireless Networks

these works focus on trafﬁc ofﬂoading rather than computation ofﬂoading, and computation ofﬂoading decisions have to con-sider the delay and energy consumption of both computation execution and data transmission. In this paper, we propose a Peer-Assisted Computation Ofﬂoading (PACO) framework to enable computation ofﬂoad-

43 Views

3y ago

Intro to Theory Computation Notes - GitHub Pages

Intro to Theory Computation Notes New Beginnings, Summer 2018 David Lu August 26, 2018 Contents 1 Theory of Computation 2 2 Alphabets 2 . theory of computation class at PSU (CS311) is primarily a class about abstract machines. The graduate theory of computation class (CS581) is concerned more with diving in to the .

25 Views

2y ago

CS663 Theory of Computation 1 Introduction

CS663 Theory of Computation 1 Introduction 1.1 What is Theory of Computation? Theory of Computation is to study the fundamental capabilities and limitations of computers. It is all about bounds. It contains three areas. Automata theory: Models of computation. Seeking a precise but concise deﬁnition of a computer. FA!PDA!LBA!TM.

14 Views

2y ago

Lectures 10 and 11. Bayesian and Quasi-Bayesian Methods

Lectures 10 and 11. Bayesian and Quasi-Bayesian Methods Fall, 2007 . and therefore is as eﬃcient as θ in large samples. For likelihood framework this was formally shown by Bickel and Yahav (1969) and many others. . with least absolute deviation estimator (median regression) Estimator rmse mad mean bias med. bias med.ad n 200 Q-mean Q .

9 Views

1y ago

BAYESIAN STATISTICS - UV

Mathematical statistics uses two major paradigms, conventional (or frequentist), and Bayesian. Bayesian methods provide a complete paradigm for both statistical inference and decision mak-ing under uncertainty. Bayesian methods may be derived from an axiomatic system, and hence provideageneral, coherentmethodology.

15 Views

2y ago

Introduction to accounting Chapter 1

Cambridge IGCSE and O Level Accounting 1.4 The statement of financial position The accounting equation may be shown in the form of a statement of financial posi

142 Views

3y ago

Recent Views

Grammar as a Foreign Language - List of Proceedings

Grammar as a Foreign Language Oriol Vinyals Google vinyals@google.com Lukasz Kaiser Google lukaszkaiser@google.com Terry Koo Google terrykoo@google.com Slav Petrov Google slav@google.com Ilya Sutskever Google ilyasu@google.com Geoffrey Hinton Google geoffhinton@google.com Abstract Synta

2y ago

445 Views

Attention is All you Need - NIPS

Google Brain avaswani@google.com Noam Shazeer Google Brain noam@google.com Niki Parmar Google Research nikip@google.com Jakob Uszkoreit Google Research usz@google.com Llion Jones Google Research llion@google.com Aidan N. Gomezy University of Toronto aidan@cs.toronto.edu Łukasz Kaiser Google Brain lukaszkaiser@google.com Illia Polosukhinz illia .

1y ago

303 Views

GSA Implementation of Google (G) Suite

Google Meet Classic Hangouts Google Chat Google Calendar Google Drive and Shared Drive Google Docs Google Sheets Google Slides Google Forms Google Sites Google Keep Apps Script D

2y ago

316 Views

Google Drive (Google Docs, Google Sheets, Google Slides)

Google Drive (Google Docs, Google Sheets, Google Slides) Employees are automatically issued a Kyrene Google account. Navigate to drive.google.com. Use Kyrene email address and network password to login. Launch in Chrome browser for best experience. Google Drive is a cloud storage sys

2y ago

388 Views

Quick Guide of Using Google Home to Control Smart Devices

Configuration needs Google Home app. Search "Google Home" in App Store or Google Play to install the app. 3.1 Set up Google Home with Google Home app You can skip this part if your Google Home is already set up. 1. Make sure your Google Home is energized. 2. Open the Google Home app by tapping the app icon on your mobile device. 3.

1y ago

326 Views

Elaboração de Provas Online usando o Formulário Google Docs

2 Após o login acesse o Google Drive ou o Google Docs e selecione a ferramenta Google Forms (Formulários). Clique na caixa de Ferramentas do Google, localizada no canto direito superior da tela e selecione o Google Drive. Na tela do Google Drive clique em New , opção More e selecione Google Forms. OBS: É possível acessar o google

11m ago

123 Views

ACS WASC Templates

File upload, Folder upload, Google Docs, Google Sheets, or Google Slides. You can also create Google Forms, Google Drawings, Google My Maps, etc. Share with exactly who you want — without email attachments. Search or sort your list of files, folders, and Google Docs. Preview files and Google Docs.

2y ago

366 Views

Google Drive - San Bernardino City Unified School District

Google Apps All of the Google applications that are available upon logging into Google.com (G , Gmail, Gphotos, Gdrive, etc.). Google Suite Google’s online cloud based office companion applications (Docs, Sheets, Slides). Google Drive Google’s online cloud storage and file sharing/collaboration application.

2y ago

378 Views

Single Sign On for Google Apps with NetScaler Unified Gateway

Google Apps for Work is a suite of cloud computing productivity and collaboration applications provided by Google on a subscription basis. It includes Google’s popular web applications including Gmail, Google Drive, Google Hangouts, Google Calendar and Google

2y ago

295 Views

Serviceteil

Google 84, 87, 124 Google 110 Google AdWords 101, 103 Google Alerts 127 Google Analytics 89 Google Maps 100, 110, 173 Google-Maps 63 Google Places 100, 103, 124 Graphiken 66 H Haftung 170 Haftungsausschluss 72 Hausfarbe 11 Headline 35 Heilmittelwerbegesetz 14, 69, 163 Heilversprechen 164 HONcode 78 HTML 58 HWG 31 I Imagefilm 31

2y ago

336 Views

Best practices for managing identities when you move to Google Cloud

Google Cloud. To provide t he informat ion an organizat ion would ne e d to transfer data and ownership from one Google Account to anot her for s ome of t he noncore Google s er vice s, such as Google Ads, Google Analyt ics, or DV360. Intende d audience Organizat ion administrators. Sta planning Google Cloud / Google Wor kspace migrat ion. Key .

1y ago

481 Views

MANAGERIAL FINANCE - GBV

of Managerial Finance page 2 Introduction to Managerial Finance 1 Starbucks—A Taste for Growth page 3 1.1 Finance and Business What Is Finance? 4 Major Areas and Opportunities in Finance 4 Legal Forms of Business Organization 5 Why Study Managerial Finance? Review Questions 9 1.2 The Managerial Finance Function 9 Organization of the Finance

3y ago

6.8K Views

Chapter 1 The roles of finance function in organisations

The roles of the finance function in organisations 4. The role of ethics in the role of the finance function Ethics is the system of moral principles that examines the concept of right and wrong. Ethics underpins an organisation’s sustained value creation. The roles that the finance function performs should be carried out in an .File Size: 888KBPage Count: 10Explore furtherRole of the Finance Function in the Financial Management .www.managementstudyguide.c Roles and Responsibilities of a Finance Department in a .www.pharmapproach.comRoles and Responsibilities of a Finance Department .www.smythecpa.comTop 10 – Functions of Business Finance in an om23 Functions and Duties of Accounting and Finance nded to you b

2y ago

335 Views

Introduction - Google Earth User Guide

Google Earth Community: Learn from other Google Earth users by asking questions and sharing answers on the Google Earth Community forums. Using Google Earth: This blog describes how you can use some of the interesting features of Google Earth. Selecting a Server Note: This section is relevant to Google Earth Pro and EC users.

3y ago

288 Views

Using Google Forms to Manage Officials Signups

Google Sheets, deleting a response from the form or sheet will not affect the other. Once the Google Form is linked to a Google Sheet, clicking on the spreadsheet icon will open the linked Google Sheet. Google Responses Sheet Google automatically creates and populates the sp

2y ago

276 Views

Notes On Bayesian Computation And MCMC

It looks like you're using an ad-blocker