The Multivariate Gaussian Distribution

3y ago

40 Views

3 Downloads

335.12 KB

10 Pages

Last View : 9d ago

Last Download : 3m ago

Upload by : Julius Prosser

Report this link

Download PDF

Transcription

The Multivariate Gaussian DistributionChuong B. DoOctober 10, 2008 T A vector-valued random variable X X1 · · · Xn is said to have a multivariatenormal (or Gaussian) distribution with mean µ Rn and covariance matrix Σ Sn 1if its probability density function2 is given by 11T 1p(x; µ, Σ) exp (x µ) Σ (x µ) .(2π)n/2 Σ 1/22We write this as X N (µ, Σ). In these notes, we describe multivariate Gaussians and someof their basic properties.1Relationship to univariate GaussiansRecall that the density function of a univariate normal (or Gaussian) distribution isgiven by 1122p(x; µ, σ ) exp 2 (x µ) .2σ2πσHere, the argument of the exponential function, 2σ1 2 (x µ)2 , is a quadratic function of thevariable x. Furthermore, the parabola points downwards, as the coefficient of the quadratic1term is negative. The coefficient in front, 2πσ, is a constant that does not depend on x;hence, we can think of it as simply a “normalization factor” used to ensure that Z 112 exp 2 (x µ) 1.2σ2πσ Recall from the section notes on linear algebra that Sn is the space of symmetric positive definite n nmatrices, defined as Sn A Rn n : A AT and xT Ax 0 for all x Rn such that x 6 0 .12In these notes, we use the notation p( ) to denote density functions, instead of fX ( ) (as in the sectionnotes on probability theory).1

0500 500123456789 5 1010 10Figure 1: The figure on the left shows a univariate Gaussian density for a single variable X.The figure on the right shows a multivariate Gaussian density over two variables X1 and X2 .In the case of the multivariate Gaussian density, the argument of the exponential function, 21 (x µ)T Σ 1 (x µ), is a quadratic form in the vector variable x. Since Σ is positivedefinite, and since the inverse of any positive definite matrix is also positive definite, thenfor any non-zero vector z, z T Σ 1 z 0. This implies that for any vector x 6 µ,(x µ)T Σ 1 (x µ) 01 (x µ)T Σ 1 (x µ) 0.2Like in the univariate case, you can think of the argument of the exponential function asbeing a downward opening quadratic bowl. The coefficient in front (i.e., (2π)n/21 Σ 1/2 ) has aneven more complicated form than in the univariate case. However, it still does not dependon x, and hence it is again simply a normalization factor used to ensure that Z Z Z 11T 1···exp (x µ) Σ (x µ) dx1 dx2 · · · dxn 1.(2π)n/2 Σ 1/2 2 2The covariance matrixThe concept of the covariance matrix is vital to understanding multivariate Gaussiandistributions. Recall that for a pair of random variables X and Y , their covariance isdefined asCov[X, Y ] E[(X E[X])(Y E[Y ])] E[XY ] E[X]E[Y ].When working with multiple variables, the covariance matrix provides a succinct way tosummarize the covariances of all pairs of variables. In particular, the covariance matrix,which we usually denote as Σ, is the n n matrix whose (i, j)th entry is Cov[Xi , Xj ].2

The following proposition (whose proof is provided in the Appendix A.1) gives an alternative way to characterize the covariance matrix of a random vector X:Proposition 1. For any random vector X with mean µ and covariance matrix Σ,Σ E[(X µ)(X µ)T ] E[XX T ] µµT .(1)In the definition of multivariate Gaussians, we required that the covariance matrix Σbe symmetric positive definite (i.e., Σ Sn ). Why does this restriction exist? As seenin the following proposition, the covariance matrix of any random vector must always besymmetric positive semidefinite:Proposition 2. Suppose that Σ is the covariance matrix corresponding to some randomvector X. Then Σ is symmetric positive semidefinite.Proof. The symmetry of Σ follows immediately from its definition. Next, for any vectorz Rn , observe thatz T Σz n XnXi 1 j 1n XnXi 1 j 1n XnXi 1 j 1"(Σij zi zj )(2)(Cov[Xi , Xj ] · zi zj )(E[(Xi E[Xi ])(Xj E[Xj ])] · zi zj )#n XnX(Xi E[Xi ])(Xj E[Xj ]) · zi zj . E(3)i 1 j 1Here, (2) follows from the formula for expanding a quadratic form (see section notes on linearalgebra), and (3) follows by linearity of expectations (see probability notes).observe that the quantity inside the brackets is of the formP ToP complete the Tproof,2xxzz (xz) 0 (see problem set #1). Therefore, the quantity inside theij i j i jexpectation is always nonnegative, and hence the expectation itself must be nonnegative.We conclude that z T Σz 0.From the above proposition it follows that Σ must be symmetric positive semidefinite inorder for it to be a valid covariance matrix. However, in order for Σ 1 to exist (as required inthe definition of the multivariate Gaussian density), then Σ must be invertible and hence fullrank. Since any full rank symmetric positive semidefinite matrix is necessarily symmetricpositive definite, it follows that Σ must be symmetric positive definite.3

3The diagonal covariance matrix caseTo get an intuition for what a multivariate Gaussian is, consider the simple case where n 2,and where the covariance matrix Σ is diagonal, i.e., 2 σ1 0µ1x1Σ µ x 0 σ22µ2x2In this case, the multivariate Gaussian density has the form, T 2 1 !11 x1 µ 1σ1 0x1 µ1p(x; µ, Σ) exp 1/20 σ22x2 µ22 x2 µ 2σ12 02π0 σ22# ! T " 1 0211 x1 µ1x µ11σ1, exp 0 σ12 x2 µ22π(σ12 · σ22 0 · 0)1/22 x2 µ22where we have relied on the explicit formula for the determinant of a 2 2 matrix3 , and thefact that the inverse of a diagonal matrix is simply found by taking the reciprocal of eachdiagonal entry. Continuing,#! T " 1(x µ)211 x1 µ 111σ1p(x; µ, Σ) exp 1(x2 µ2 )2πσ1 σ22 x2 µ 2σ22 11122exp 2 (x1 µ1 ) 2 (x2 µ2 ) 2πσ1 σ22σ12σ2 111122 exp 2 (x1 µ1 ) · exp 2 (x2 µ2 ) .2σ12σ22πσ12πσ2The last equation we recognize to simply be the product of two independent Gaussian densities, one with mean µ1 and variance σ12 , and the other with mean µ2 and variance σ22 .More generally, one can show that an n-dimensional Gaussian with mean µ Rn anddiagonal covariance matrix Σ diag(σ12 , σ22 , . . . , σn2 ) is the same as a collection of n independent Gaussian random variables with mean µi and variance σi2 , respectively.4IsocontoursAnother way to understand a multivariate Gaussian conceptually is to understand the shapeof its isocontours. For a function f : R2 R, an isocontour is a set of the form x R2 : f (x) c .for some c R.4a b ad bc.c d4Isocontours are often also known as level curves. More generally, a level set of a function f : Rn R,is a set of the form x R2 : f (x) c for some c R.3Namely,4

4.1Shape of isocontoursWhat do the isocontours of a multivariate Gaussian look like? As before, let’s consider thecase where n 2, and Σ is diagonal, i.e., 2 σ1 0µ1x1Σ µ x 0 σ22µ2x2As we showed in the last section, 11122p(x; µ, Σ) exp 2 (x1 µ1 ) 2 (x2 µ2 ) .2πσ1 σ22σ12σ2(4)Now, let’s consider the level set consisting of all points where p(x; µ, Σ) c for some constantc R. In particular, consider the set of all x1 , x2 R such that 11122c exp 2 (x1 µ1 ) 2 (x2 µ2 )2πσ1 σ22σ12σ2 11222πcσ1 σ2 exp 2 (x1 µ1 ) 2 (x2 µ2 )2σ12σ211log(2πcσ1 σ2 ) 2 (x1 µ1 )2 2 (x2 µ2 )22σ12σ2 111log 2 (x1 µ1 )2 2 (x2 µ2 )22πcσ1 σ22σ12σ22(x2 µ2 )2(x1 µ1 ) . 1 2σ12 log 2πcσ11 σ22σ22 log 2πcσ11 σ2Definingr1 s2σ12log 12πcσ1 σ2 r2 s2σ22log 1,2πcσ1 σ2it follows that1 x1 µ1r1 2 x2 µ2r2 2.(5)Equation (5) should be familiar to you from high school analytic geometry: it is the equationof an axis-aligned ellipse, with center (µ1 , µ2 ), where the x1 axis has length 2r1 and thex2 axis has length 2r2 !4.2Length of axesTo get a better understanding of how the shape of the level curves vary as a function ofthe variances of the multivariate Gaussian distribution, suppose that we are interested in5

8866442200 2 2 4 6 6 4 20246810 4 412 20246810Figure 2:The figure on the left shows a heatmap indicating values of the density function for an3and diagonal covariance matrix Σ axis-aligned multivariate Gaussian with mean µ 2 25 0. Notice that the Gaussian is centered at (3, 2), and that the isocontours are all0 9elliptically shaped with major/minor axis lengths in a 5:3 ratio. The figure on the rightshows a heatmap indicating valuesof the density function for a non axis-aligned multivariate 10 53. Here, the ellipses areand covariance matrix Σ Gaussian with mean µ 5 52again centered at (3, 2), but now the major and minor axes have been rotated via a lineartransformation.6

the values of r1 and r2 at which c is equal to a fraction 1/e of the peak height of Gaussiandensity.First, observe that maximum of Equation (4) occurs where x1 µ1 and x2 µ2 . Substituting these values into Equation (4), we see that the peak height of the Gaussian densityis 2πσ11 σ2 . Second, we substitute c 1e 2πσ11 σ2 into the equations for r1 and r2 to obtainv uuur1 t2σ12 log v uuur2 t2σ22 log 12πσ1 σ2 ·1e 12πσ1 σ212πσ1 σ2 ·1e 12πσ1 σ2 σ1 2 σ2 2.From this, it follows that the axis length needed to reach a fraction 1/e of the peak height ofthe Gaussian density in the ith dimension grows in proportion to the standard deviation σi .Intuitively, this again makes sense: the smaller the variance of some random variable xi , themore “tightly” peaked the Gaussian distribution in that dimension, and hence the smallerthe radius ri .4.3Non-diagonal case, higher dimensionsClearly, the above derivations rely on the assumption that Σ is a diagonal matrix. However,in the non-diagonal case, it turns out that the picture is not all that different. Insteadof being an axis-aligned ellipse, the isocontours turn out to be simply rotated ellipses.Furthermore, in the n-dimensional case, the level sets form geometrical structures known asellipsoids in Rn .5Linear transformation interpretationIn the last few sections, we focused primarily on providing an intuition for how multivariateGaussians with diagonal covariance matrices behaved. In particular, we found that an ndimensional multivariate Gaussian with diagonal covariance matrix could be viewed simplyas a collection of n independent Gaussian-distributed random variables with means and variances µi and σi2 , respectvely. In this section, we dig a little deeper and provide a quantitativeinterpretation of multivariate Gaussians when the covariance matrix is not diagonal.The key result of this section is the following theorem (see proof in Appendix A.2).Theorem 1. Let X N (µ, Σ) for some µ Rn and Σ Sn . Then, there exists a matrixB Rn n such that if we define Z B 1 (X µ), then Z N (0, I).7

To understand the meaning of this theorem, note that if Z N (0, I), then using theanalysis from Section 4, Z can be thought of as a collection of n independent standard normalrandom variables (i.e., Zi N (0, 1)). Furthermore, if Z B 1 (X µ) then X BZ µfollows from simple algebra.Consequently, the theorem states that any random variable X with a multivariate Gaussian distribution can be interpreted as the result of applying a linear transformation (X BZ µ) to some collection of n independent standard normal random variables (Z).8

Appendix A.1Proof. We prove the first of the two equalities in (1); the proof of the other equality is similar. Cov[X1 , X1 ] · · · Cov[X1 , Xn ] .Σ .Cov[Xn , X1 ] · · · Cov[Xn , Xn ] E[(X1 µ1 )2 ]· · · E[(X1 µ1 )(Xn µn )] . .E[(Xn µn )(X1 µ1 )] (X1 µ1 )2 . E .···E[(Xn µn )2 ] · · · (X1 µ1 )(Xn µn ) . .2(Xn µn )(X1 µ1 ) · · ·(Xn µn ) X1 µ1 . E X1 µ1 · · · Xn µn .Xn µn E (X µ)(X µ)T .(6)(7)Here, (6) follows from the fact that the expectation of a matrix is simply the matrix foundby taking the componentwise expectation of each entry. Also, (7) follows from the fact thatfor any vector z Rn , z1 z1 z1 z2 · · · z1 znz1 z2 z2 z1 z2 z2 · · · z2 zn zz T . z1 z2 · · · zn . . . . . .zn z1 zn z2 · · · zn znznAppendix A.2We restate the theorem below:Theorem 1. Let X N (µ, Σ) for some µ Rn and Σ Sn . Then, there exists a matrixB Rn n such that if we define Z B 1 (X µ), then Z N (0, I).The derivation of this theorem requires some advanced linear algebra and probabilitytheory and can be skipped for the purposes of this class. Our argument will consist of twoparts. First, we will show that the covariance matrix Σ can be factorized as Σ BB Tfor some invertible matrix B. Second, we will perform a “change-of-variable” from X to adifferent vector valued random variable Z using the relation Z B 1 (X µ).9

Step 1: Factorizing the covariance matrix. Recall the following two properties ofsymmetric matrices from the notes on linear algebra5 :1. Any real symmetric matrix A Rn n can always be represented as A U ΛU T , whereU is a full rank orthogonal matrix containing of the eigenvectors of A as its columns,and Λ is a diagonal matrix containing A’s eigenvalues.2. If A is symmetric positive definite, all its eigenvalues are positive.Since the covariance matrix Σ is positive definite, using the first fact, we can write Σ U ΛU Tfor some appropriately defined matrices U and Λ. Using the second fact, we can defineΛ1/2 Rn n to be the diagonal matrix whose entries are the square roots of the correspondingentries from Λ. Since Λ Λ1/2 (Λ1/2 )T , we haveΣ U ΛU T U Λ1/2 (Λ1/2 )T U T U Λ1/2 (U Λ1/2 )T BB T ,where B U Λ1/2 .6 In this case, then Σ 1 B T B 1 , so we can rewrite the standardformula for the density of a multivariate Gaussian as 11T T 1(8)exp (x µ) B B (x µ) .p(x; µ, Σ) (2π)n/2 BB T 1/22Step 2: Change of variables. Now, define the vector-valued random variable Z B 1 (X µ). A basic formula of probability theory, which we did not introduce in the sectionnotes on probability theory, is the “change-of-variables” formula for relating vector-valuedrandom variables: T Suppose that X X1 · · · Xn Rn is a vector-valued random variable withjoint density function fX : Rn R. If Z H(X) Rn where H is a bijective,differentiable function, then Z has joint density fZ : Rn R, where x x11· · · z z1n . .fZ (z) fX (x) · det . xn xn· · · zn z1Using the change-of-variable formula, one can show (after some algebra, which we’ll skip)that the vector variable Z has the following joint density: 11 TpZ (z) exp z z .(9)(2π)n/22The claim follows immediately. 5See section on “Eigenvalues and Eigenvectors of Symmetric Matrices.”To show that B is invertible, it suffices to observe that U is an invertible matrix, and right-multiplyingU by a diagonal matrix (with no zero diagonal entries) will rescale its columns but will not change its rank.610

The Multivariate Gaussian Distribution Chuong B. Do October 10, 2008 A vector-valued random variable X X1 ··· Xn T is said to have a multivariate normal (or Gaussian) distribution with mean µ Rn and covariance matrix Σ Sn 1

Related Documents:

Nonprofit Self-Assessment Checklist

May 02, 2018 · D. Program Evaluation ͟The organization has provided a description of the framework for how each program will be evaluated. The framework should include all the elements below: ͟The evaluation methods are cost-effective for the organization ͟Quantitative and qualitative data is being collected (at Basics tier, data collection must have begun)

1.4K Views

2y ago

Name of thé élément in thé language and script of thé ... - UNESCO

Silat is a combative art of self-defense and survival rooted from Matay archipelago. It was traced at thé early of Langkasuka Kingdom (2nd century CE) till thé reign of Melaka (Malaysia) Sultanate era (13th century). Silat has now evolved to become part of social culture and tradition with thé appearance of a fine physical and spiritual .

113 Views

9m ago

[Kl - Mauritius

On an exceptional basis, Member States may request UNESCO to provide thé candidates with access to thé platform so they can complète thé form by themselves. Thèse requests must be addressed to esd rize unesco. or by 15 A ril 2021 UNESCO will provide thé nomineewith accessto thé platform via their émail address.

468 Views

1y ago

Employee Benefits Event - Schneider Downs Tax Services

̶The leading indicator of employee engagement is based on the quality of the relationship between employee and supervisor Empower your managers! ̶Help them understand the impact on the organization ̶Share important changes, plan options, tasks, and deadlines ̶Provide key messages and talking points ̶Prepare them to answer employee questions

327 Views

1y ago

Study Investigating thè Effect of E- Service Quality on Customer's ...

Dr. Sunita Bharatwal** Dr. Pawan Garga*** Abstract Customer satisfaction is derived from thè functionalities and values, a product or Service can provide. The current study aims to segregate thè dimensions of ordine Service quality and gather insights on its impact on web shopping. The trends of purchases have

122 Views

9m ago

Kinh Giải Thâm Mật HT. Thích Trí Quang dịch giải

Chính Văn.- Còn đức Thế tôn thì tuệ giác cực kỳ trong sạch 8: hiện hành bất nhị 9, đạt đến vô tướng 10, đứng vào chỗ đứng của các đức Thế tôn 11, thể hiện tính bình đẳng của các Ngài, đến chỗ không còn chướng ngại 12, giáo pháp không thể khuynh đảo, tâm thức không bị cản trở, cái được

1.6K Views

3y ago

Gaussian Filtering - Auckland

Gaussian filters might not preserve image brightness. 5/25/2010 9 Gaussian Filtering examples Is the kernel a 1D Gaussian kernel?Is the kernel 1 6 1 a 1D Gaussian kernel? Give a suitable integer-value 5 by 5 convolution mask that approximates a Gaussian function with a σof 1.4. .

11 Views

1y ago

Craig Phadrig Hillfort, Inverness Archaeological ...

Illustration by: Steven Birch, Mary Peteranna Date of Fieldwork: 9-18 February 2015 Date of Report: 17th March 2015 Enquiries to: AOC Archaeology Group Shore Street Cromarty Ross-shire IV11 8XL Mob. 07972 259 255 E-mail inverness@aocarchaeology.com This document has been prepared in accordance with AOC standard operating procedures. Author: Mary Peteranna Date: 24/03/2015 Approved by: Martin .

48 Views

3y ago

Recent Views

Grammar as a Foreign Language - List of Proceedings

Grammar as a Foreign Language Oriol Vinyals Google vinyals@google.com Lukasz Kaiser Google lukaszkaiser@google.com Terry Koo Google terrykoo@google.com Slav Petrov Google slav@google.com Ilya Sutskever Google ilyasu@google.com Geoffrey Hinton Google geoffhinton@google.com Abstract Synta

2y ago

445 Views

Attention is All you Need - NIPS

Google Brain avaswani@google.com Noam Shazeer Google Brain noam@google.com Niki Parmar Google Research nikip@google.com Jakob Uszkoreit Google Research usz@google.com Llion Jones Google Research llion@google.com Aidan N. Gomezy University of Toronto aidan@cs.toronto.edu Łukasz Kaiser Google Brain lukaszkaiser@google.com Illia Polosukhinz illia .

1y ago

303 Views

GSA Implementation of Google (G) Suite

Google Meet Classic Hangouts Google Chat Google Calendar Google Drive and Shared Drive Google Docs Google Sheets Google Slides Google Forms Google Sites Google Keep Apps Script D

2y ago

316 Views

Google Drive (Google Docs, Google Sheets, Google Slides)

Google Drive (Google Docs, Google Sheets, Google Slides) Employees are automatically issued a Kyrene Google account. Navigate to drive.google.com. Use Kyrene email address and network password to login. Launch in Chrome browser for best experience. Google Drive is a cloud storage sys

2y ago

388 Views

Quick Guide of Using Google Home to Control Smart Devices

Configuration needs Google Home app. Search "Google Home" in App Store or Google Play to install the app. 3.1 Set up Google Home with Google Home app You can skip this part if your Google Home is already set up. 1. Make sure your Google Home is energized. 2. Open the Google Home app by tapping the app icon on your mobile device. 3.

1y ago

326 Views

Elaboração de Provas Online usando o Formulário Google Docs

2 Após o login acesse o Google Drive ou o Google Docs e selecione a ferramenta Google Forms (Formulários). Clique na caixa de Ferramentas do Google, localizada no canto direito superior da tela e selecione o Google Drive. Na tela do Google Drive clique em New , opção More e selecione Google Forms. OBS: É possível acessar o google

11m ago

123 Views

ACS WASC Templates

File upload, Folder upload, Google Docs, Google Sheets, or Google Slides. You can also create Google Forms, Google Drawings, Google My Maps, etc. Share with exactly who you want — without email attachments. Search or sort your list of files, folders, and Google Docs. Preview files and Google Docs.

2y ago

366 Views

Google Drive - San Bernardino City Unified School District

Google Apps All of the Google applications that are available upon logging into Google.com (G , Gmail, Gphotos, Gdrive, etc.). Google Suite Google’s online cloud based office companion applications (Docs, Sheets, Slides). Google Drive Google’s online cloud storage and file sharing/collaboration application.

2y ago

378 Views

Single Sign On for Google Apps with NetScaler Unified Gateway

Google Apps for Work is a suite of cloud computing productivity and collaboration applications provided by Google on a subscription basis. It includes Google’s popular web applications including Gmail, Google Drive, Google Hangouts, Google Calendar and Google

2y ago

295 Views

Serviceteil

Google 84, 87, 124 Google 110 Google AdWords 101, 103 Google Alerts 127 Google Analytics 89 Google Maps 100, 110, 173 Google-Maps 63 Google Places 100, 103, 124 Graphiken 66 H Haftung 170 Haftungsausschluss 72 Hausfarbe 11 Headline 35 Heilmittelwerbegesetz 14, 69, 163 Heilversprechen 164 HONcode 78 HTML 58 HWG 31 I Imagefilm 31

2y ago

336 Views

Best practices for managing identities when you move to Google Cloud

Google Cloud. To provide t he informat ion an organizat ion would ne e d to transfer data and ownership from one Google Account to anot her for s ome of t he noncore Google s er vice s, such as Google Ads, Google Analyt ics, or DV360. Intende d audience Organizat ion administrators. Sta planning Google Cloud / Google Wor kspace migrat ion. Key .

1y ago

481 Views

Introduction - Google Earth User Guide

Google Earth Community: Learn from other Google Earth users by asking questions and sharing answers on the Google Earth Community forums. Using Google Earth: This blog describes how you can use some of the interesting features of Google Earth. Selecting a Server Note: This section is relevant to Google Earth Pro and EC users.

3y ago

288 Views

Using Google Forms to Manage Officials Signups

Google Sheets, deleting a response from the form or sheet will not affect the other. Once the Google Form is linked to a Google Sheet, clicking on the spreadsheet icon will open the linked Google Sheet. Google Responses Sheet Google automatically creates and populates the sp

2y ago

276 Views

Google Cheat Sheets - Shake Up Learning

Google Slides Cheat Sheet p. 15-18 Google Sheets Cheat Sheet p. 19-22 Google Drawings Cheat Sheet p. 23-26 Google Drive for iOS Cheat Sheet p. 27-29 Google Chrome Cheat Sheet p. 30-32 ShakeUpLearning.com Google Cheat Sheets - By Kasey Bell 3

2y ago

296 Views

ChromeBox CXI (McQueen) UM (date) EN

Create a new Google Account. You can create a new Google Account if you don’t already have one. Click . Create a Google Account. on the right to set up a new account. A Google Account gives you access to useful web services developed by Google, such as Gmail, Google Docs, and Google Calendar. Browse as a guest

2y ago

177 Views

The Multivariate Gaussian Distribution

It looks like you're using an ad-blocker