ECE531 Lecture 9: Information Inequality And The Cramer-Rao Lower Bound

1y ago

8 Views

2 Downloads

881.94 KB

43 Pages

Last View : 7d ago

Last Download : 3m ago

Upload by : Randy Pettway

Report this link

Download PDF

Transcription

ECE531 Lecture 9: Information Inequality and the CRLB ECE531 Lecture 9: Information Inequality and the Cramer-Rao Lower Bound D. Richard Brown III Worcester Polytechnic Institute 26-March-2009 Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 1 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Introduction In this lecture, we continue our study of estimators under the squared error cost function. Squared error: Estimator variance determines performance. We will develop a new procedure for finding MVU estimators: Compute a lower bound on the variance. Guess at a good unbiased estimator and compare its variance to the lower bound. If a given unbiased estimator achieves the lower bound, it must be the MVU estimator. “Guessing and checking” might be easier sometimes than grinding through the RBLS theorem. A good lower bound also provides a benchmark by which we can compare the performance of different estimators. We will develop a lower bound on estimator variance that can be applied to both biased and unbiased estimators. In the special case of unbiased estimators, this lower bound simplifies to the famous Cramer-Rao lower bound (CRLB). Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 2 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Intuition: When Can We Expect Low Variance? Suppose our parameter space Λ R the scalar observation densities are pY (y ; θ) U(0, 1). What can we say about the performance of a good estimator θ̂(y) in this case? Suppose now that the scalar observation densities are pY (y ; θ) U(θ ǫ, θ ǫ) for some small value of ǫ. What can we say about the performance of a good estimator θ̂(y) in this case? Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 3 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Intuition: When Can We Expect Low Variance? The minimum achievable variance of an estimator is somehow related to the sensitivity of the density pY (y ; θ) to changes in the parameter θ. If the density pY (y ; θ) is insensitive to the parameter θ, then we can’t expect even the MVU estimator to do very well. If the density pY (y ; θ) is sensitive to changes in the parameter θ, then the achievable performance (minimum variance) should be better. Our notion of sensitivity: Hold y fixed. How “steep” is pY (y ; θ) as we vary the parameter θ? This steepness should somehow be averaged over the observations. Terminology: When we discuss pY (y ; θ) with y fixed and θ as a variable, we call this a “likelihood function”. It is not a valid pdf in θ. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 4 / 43

ECE531 Lecture 9: Information Inequality and the CRLB 2 Example: Rayleigh Family pY (y ; θ) y y σ2 σ2 e with θ σ 3.5 1.2 3 1 0.8 2 θ σ pY(y;θ) 2.5 1.5 0.6 1 0.4 0.5 0.2 0 1.2 0 0.2 0.4 0.6 0.8 1 y 1 0.8 0.5 0.6 0.4 y Worcester Polytechnic Institute 1 0.2 θ σ D. Richard Brown III 26-March-2009 5 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Calculus Review Some useful results: ln f (θ) θ 2 ln f (θ) θ 2 Worcester Polytechnic Institute θ " θ f (θ) f (θ) θ f (θ) # f (θ) 2 f (θ) f (θ) 2 θ 2 θ f (θ) f 2 (θ) 2 f (θ) ln f (θ) f (θ) θ 2 θ 2 D. Richard Brown III 26-March-2009 6 / 43

ECE531 Lecture 9: Information Inequality and the CRLB A Definition of “Sensitivity” (Scalar Parameter) We require the likelihood function pY (y ; θ) to be differentiable with respect to θ for each y Y. Holding y fixed, the relative steepness of the likelihood function pY (y ; θ) (as a function of θ) can be expressed as ψ(y ; θ) : : Eθ [ψ (Y ; θ)] Eθ "„ Z „ «2 2 Y θ) ln pY (y ; θ) pY (y ; θ) θ Averaging the steepness: We compute the mean squared value of ψ as I(θ) θ pY (y ; ln pY (y ; θ) θ ln pY (Y ; θ) θ «2 # pY (y ; θ) dy Z p θ Y Y (y ; θ) pY (y ; θ) !2 pY (y ; θ) dy Terminology: I(θ) is called the “Fisher information” that the random observation Y can tell us, on average, about the parameter θ. Fisher information 6 mutual information (information theory). Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 7 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Single Sample of Unknown Parameter in Noise Suppose we get one sample of an unknown parameter θ R corrupted by zero-mean additive Gaussian noise, i.e. Y θ W where W N (0, σ 2 ). The likelihood function is then (y θ)2 1 exp pY (y ; θ) 2σ 2 2πσ The relative slope of pY (y ; θ) with respect to θ can be easily computed ψ(y ; θ) : θ pY (y ; θ) θ y pY (y ; θ) σ2 The Fisher information is then Z θ y 2 1 (y θ)2 I(θ) exp dy σ2 2σ 2 2πσ 2 Z t 1 1 2 dt 2 t exp 2 2 σ 2πσ Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 8 / 43

Theorem (The Information Inequality (scalar parameter)) Suppose that θ̂(y) is an estimator of the parameter θ and that we have a family of densities {pY (y ; θ) ; θ Λ}. If the following conditions hold: 1. Λ is an open interval 2. Yθ : {y Y pY (y ; θ) 0} is the same for all θ Λ (all densities in the family share common support in Y) θ pY (y ; θ) exists and is finite for all θ Λ and all y in the common support of {pY (y ; θ) ; θ Λ} R R h(y)p (y ; θ) dy exists and equals 4. θ Y Y h(y) θ pY (y ; θ) dy for all Y θ Λ, for h(y) θ̂(y) and h(y) 1 3. then varθ [θ̂(Y )] h θ Eθ n oi2 θ̂(Y ) I(θ)

ECE531 Lecture 9: Information Inequality and the CRLB Example (continued) Let’s return to our example where we get a scalar observation of an unknown parameter θ R in zero-mean Gaussian noise: 1 (y θ)2 pY (y ; θ) exp 2σ 2 2πσ We’ve already computed the Fisher information: I(θ) 1 σ2 Suppose restrict our attention to unbiased estimators. What can we say about n we o θ̂(Y ) ? E θ θ Since all of the regularity conditions are satisfied (check this!), we can say varθ [θ̂(Y )] σ 2 . The estimator θ̂(y) y is unbiased and it is easy to show that it achieves this minimum variance bound. Hence θ̂(y) y is MVU. No need to use RBLS. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 10 / 43

ECE531 Lecture 9: Information Inequality and the CRLB The Information Inequality (Scalar Parameter) A key claim in the proof of the information inequality: Z h n oi n o θ̂(y) Eθ θ̂(Y ) Eθ θ̂(Y ) pY (y ; θ) dy. θ θ Y To show this: Z h Y n oi θ̂(y) Eθ θ̂(Y ) pY (y ; θ) dy θ c4 Worcester Polytechnic Institute Z pY (y ; θ) dy θ n oZ pY (y ; θ) dy Eθ θ̂(Y ) Y θ Z θ̂(y)pY (y ; θ) dy θ Y n o Z Eθ θ̂(Y ) pY (y ; θ) dy θ Y n o n o Eθ θ̂(Y ) Eθ θ̂(Y ) (1) θ θ n o Eθ θ̂(Y ) θ θ̂(y) Y D. Richard Brown III 26-March-2009 11 / 43

ECE531 Lecture 9: Information Inequality and the CRLB The Information Inequality (Scalar Parameter) Taking the claim as a given now, we can write: » –2 »Z h n o–2 n oi Eθ θ̂(Y ) pY (y ; θ) dy θ̂(y) Eθ θ̂(Y ) θ θ Y #2 "Z h n oi pY (y ; θ) θ pY (y ; θ) dy θ̂(y) Eθ θ̂(Y ) pY (y ; θ) Y – –2 »Z h n oi » ln pY (y ; θ) pY (y ; θ) dy θ̂(y) Eθ θ̂(Y ) θ Y » h –ﬀ–2 n oi » Eθ θ̂(Y ) Eθ θ̂(Y ) ln pY (Y ; θ) θ (» h –2 ) n oi2 ﬀ Schwarz Eθ θ̂(Y ) Eθ θ̂(Y ) Eθ ln pY (Y ; θ) θ h i varθ θ̂(Y ) · I(θ) h i [ E {θ̂(Y )}]2 , which is the result we wanted. Hence, varθ θ̂(Y ) θ θI(θ) Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 12 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Remarks A key consequence of the the regularity conditions that we used in our derivation of the bound is that Z Eθ ln pY (Y ; θ) pY (y ; θ) dy 0 for all θ Λ θ Y θ Suppose our observations were Y U(0, θ) for θ 0. Obviously, this fails to satisfy our regularity conditions, e.g. lack common support for all densities pY (y ; θ). But the real problem is that Z Z θ 1 ln pY (Y ; θ) pY (y ; θ) dy dy Eθ θ θ θ θ 0 Z 1 θ 1 2 dy 6 0 θ 0 θ When Eθ θ ln pY (Y ; θ) 6 0, the whole derivation of the information inequality breaks down. Checking the regularity conditions is important. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 13 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Y U (0, θ) for θ 0 5 4.5 4 3.5 Y p (y;θ) 3 2.5 2 1.5 1 0 0.5 0 0.2 0.2 0.4 0.3 0.4 0.5 0.6 0.6 0.7 0.8 0.8 0.9 1 1 y θ Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 14 / 43

ECE531 Lecture 9: Information Inequality and the CRLB The Information Inequality (Scalar Parameter) Lemma If, in addition to conditions 1-4, we also have 2 p θ 2 Y (y ; θ) exists for all θ Λ and y in the common support of pY (y ; θ) and Z Z 2 2 pY (y ; θ) dy p (y ; θ) dy Y θ2 θ2 n 2 o then I(θ) Eθ θ 2 ln pY (Y ; θ) . 5. Proof: (Lehmann TPE 1998). We can use our calculus result derived earlier to write 2 ln pY (y ; θ) θ2 2 p θ 2 Y (y ; θ) pY (y ; θ) " p θ Y (y ; θ) pY (y ; θ) #2 . The result follows by taking the expectation of both sides and applying condition 5. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 15 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Unbiased Estimators: The Cramer-Rao Lower Bound For the particular case when the estimator θ̂(y) is unbiased, we know that n o Eθ θ̂(y) θ and, consequently Hence, n o Eθ θ̂(y) 1. θ n o varθ θ̂(y) 1 . I(θ) This result is known as the Cramer-Rao lower bound (originally described by Fisher in 1922 but not well-known until Rao and Cramer worked on it in 1945 and 1946, respectively). Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 16 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Estimating a Constant in White Gaussian Noise Suppose we have n observations given by Yk θ Wk i.i.d. k 0, . . . , n 1 where Wk N (0, σ 2 ). The unknown parameter θ can take on any value on the real line and we have no prior pdf. We know from our studyPof the RBLS theorem that the sample mean n 1 yk is MVU. Let’s compute the CRLB to see estimator θ̂(y) ȳ n1 k 0 if it also attains the minimum variance bound. We have ( P ) n 1 k 0 (yk θ)2 1 exp pY (y ; θ) 2σ 2 (2πσ 2 )n/2 It is not difficult to compute n 1 1 X n ln pY (y ; θ) 2 (yk θ) 2 (ȳ θ) θ σ σ k 0 Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 17 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Estimating a Constant in White Gaussian Noise Since condition 5 holds, we can take another derivative to get: 2 n ln pY (y ; θ) 2 2 θ σ Hence I(θ) Eθ n 2 ln pY (Y ; θ) 2 2 θ σ and we can say that h i σ2 varθ θ̂(Y ) n for any unbiased estimator. Note that the lower bound is equal to the variance of our MVU estimator θ̂(y) ȳ. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 18 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Remarks If an unbiased estimator attains the CRLB, it must be MVU. The converse is not always true. In other words, not all MVU estimators attain the CRLB. An estimator that is unbiased and attains the CRLB is said to be efficient. When we had one observation, the information was I(θ) σ12 . When we had n observations, the information became I(θ) σn2 . This additive information property is only true when the observations are independent. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 19 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Additive Information from Independent Observations Lemma If X and Y are independent random variables satisfying all of the regularity conditions with densities pX (x ; θ) and pY (y ; θ) parameterized by θ then I(θ) IX (θ) IY (θ) where IX (θ), IY (θ), and I(θ) are the information about θ contained in X, Y , and {X, Y }, respectively. Corollary If X0 , . . . , Xn 1 are i.i.d. satisfying all of the regularity conditions, and each has information I(θ) about θ, then the information in {X0 , . . . , Xn 1 } about θ is nI(θ). Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 20 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Additive Information from Independent Observations Proof of Lemma. Since X and Y are independent, their joint pdf can be written as a product of the marginals. We can can then write I(θ) 2 Eθ ln pX (X ; θ) ln pY (Y ; θ) θ θ ln pX (X ; θ) Eθ ln pY (Y ; θ) IY (θ) IX (θ) 2Eθ θ θ The term Z Z θ pX (x ; θ) Eθ ln pX (X ; θ) pX (x ; θ) dx pX (x ; θ) dx 0 θ θ X X pX (x ; θ) and the desired result follows immediately. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 21 / 43

ECE531 Lecture 9: Information Inequality and the CRLB CRLB for Signals in Zero-Mean White Gausian Noise We assume the general system model Yk sk (θ) Wk for k 0, 1, . . . , n 1 where sk (θ) is a deterministic signal with an unknown real-valued i.i.d. non-random scalar parameter θ and where Wk N (0, σ 2 ). We only assume that sk (θ) doesn’t violate any of our regularity conditions. To compute the Fisher information, we can differentiate twice to get: ( 2 ) n 1 2 1 X 2 [yk sk (θ)] 2 sk (θ) ln pY (y ; θ) 2 sk (θ) θ 2 σ θ θ k 0 We then take the expected value (over the observations) to get 2 2 n 1 1 X I(θ) Eθ ln p (Y ; θ) s (θ) Y k θ 2 σ2 θ k 0 and the CRLB follows immediately as 1/I(θ). Note additive information. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 22 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Sinusoidal Frequency Estimation in AWGN Consider the case where Yk a cos(θk φ) Wk for k 0, 1, . . . , n 1 i.i.d. where a and φ are known, θ (0, π), and Wk N (0, σ 2 ). You can confirm that the regularity conditions 1-5 are all satisfied here. To compute the CRLB, we can apply our general result for signals in zero-mean AWGN. h i σ2 σ2 varθ θ̂(Y ) P P 2 n 1 n 1 a2 k 0 (k sin(θk φ))2 k 0 θ sk (θ) Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 23 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Sinusoidal Frequency Estimation in AWGN n 10, σ 2 /a2 1, and φ 0. 0.08 0.07 0.06 CRLB 0.05 0.04 0.03 0.02 0.01 0 0 Worcester Polytechnic Institute 0.5 1 1.5 2 θ (frequency) D. Richard Brown III 2.5 3 3.5 26-March-2009 24 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Attainability of the Information Bound Recall that the only inequality used to derive the information inequality was the Cauchy-Schwarz inequality: 2 Z b Z b Z b 2 f22 (y; θ) dy f1 (y; θ) dy · f1 (y; θ)f2 (y; θ) dy a a a Under what conditions does this inequality become an equality? If and only if f2 (y; θ) k(θ)f1 (y; θ) on y (a, b). In our problem, we have n o f1 (y; θ) θ̂(y) Eθ θ̂(Y ) f2 (y; θ) ln pY (y ; θ) θ hence, an estimator θ̂(y) has variance equal to the information lower bound for all θ Λ if and only if h n oi ln pY (Y ; θ) k(θ) θ̂(Y ) Eθ θ̂(Y ) θ almost surely for some k(θ). Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 25 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Attainability of the Information Bound To attain the information bound, we require h n oi ln pY (Y ; θ) k(θ) θ̂(Y ) Eθ θ̂(Y ) θ almost surely for some k(θ). We can “undo” the derivative and the logarithm to write pY (y ; θ) h(y) exp (Z θ a h i k(t) θ̂(y) f (t) dt h ) h(y)C(θ) exp {g(θ)T (y)} {z } one parameter exponential family i for all y Y. Note that f (t) : Et θ̂(Y ) and h(y) does not depend on θ. Remarks: The information lower bound is achieved by θ̂(y) if and only if θ̂(y) T (y) in a one-parameter exponential family (the estimator is the sufficient statistic). See example IV.C.4 in the Poor textbook. I(θ) It can also be shown that k(θ) here must be equal to . θ̂(Y )] E θ θ [ Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 26 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Attainability of the Information Bound: Unbiased Case n o When θ̂(y) is unbiased, Eθ θ̂(Y ) θ. Hence, the necessary and sufficient attainability condition can be written as h i ln pY (Y ; θ) k(θ) θ̂(Y ) θ θ almost surely for some k(θ). Squaring both sides and taking the expectation, we can write " 2 # 2 2 k (θ)Eθ θ̂(Y ) θ ln pY (Y ; θ) Eθ θ I(θ) k2 (θ) 1 I(θ) hence k(θ) I(θ). The negative option can be eliminated. The necessary and sufficient attainability condition becomes h i ln pY (Y ; θ) I(θ) θ̂(Y ) θ θ Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 27 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Multiparameter Estimation Problems In many problems, we have more than one parameter that we would like to estimate. For example, Yk a cos(ωk φ) Wk for k 0, 1, . . . , n 1 where a 0, φ ( π, π), and ω (0, π) are all non-random parameters i.i.d. and Wk N (0, σ 2 ). In this problem θ [a, φ, ω]. 1.5 1 0.5 0 0.5 1 1.5 Worcester Polytechnic Institute 0 2 4 6 8 10 12 D. Richard Brown III 14 16 18 20 26-March-2009 28 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Fisher Information Matrix Recall, in the scalar parameter case, the Fisher information was motivated by a computation of the mean squared relative slope of the likelihood function: !2 Z 2 θ pY (y ; θ) ln pY (Y ; θ) pY (y ; θ) dy I(θ) : Eθ pY (y ; θ) θ Y In multiparameter problems, we are now concerned with the relative slope of the likelihood function with respect to each of the parameters. A natural choice (assuming that all of the required derivatives exist) would be i h I(θ) Eθ ( θ ln pY (Y ; θ)) ( θ ln pY (Y ; θ)) Rm m where x is the gradient operator defined as f (x), . . . , f (x) . x f (x) : x0 xm 1 Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 29 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Fisher Information Matrix Let p : pY (Y ; θ). The Fisher information matrix is then 2 Eθ h θ0 ln p · θ0 ln p i i h 6 6 E 6 θ θ1 ln p · θ0 ln p I(θ) 6 6 . 6 . 4 h i Eθ θm 1 ln p · θ 0 ln p . Eθ . Eθ . . . Eθ h h h θ0 ln p · θ1 ln p · . . θm 1 θm 1 θm 1 ln p · ln p i 7 7 7 7 7 7 i5 ln p ln p θm 1 i 3 Note that the ijth element of the Fisher information matrix is given as Iij (θ) Eθ ln pY {Y ; θ} · ln pY {Y ; θ} θi θj hence we can say that I(θ) is symmetric. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 30 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Fisher Information Matrix Under our regularity conditions Z θi pY (y ; θ) pY (y ; θ) dy ln pY {Y ; θ} Eθ θi Y pY (y ; θ) Z pY (y ; θ) dy 0 θi Y Hence Iij (θ) covθ ln pY {Y ; θ}, ln pY {Y ; θ} . θi θj Since I(θ) is a covariance matrix, I(θ) is positive semidefinite. The information inequality and CRLB for scalar parameters required 1 . We can expect that we might need to compute us to compute I(θ) 1 I (θ) when we have vector parameter. For I(θ) to be invertible, we need I(θ) to be positive definite. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 31 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Covariance Matrices and Positive Definiteness Suppose X Rm is a zero mean random vector. A covariance matrix cov(X, X) E[XX ] fails to be positive definite only if one or more random variables Xi can be written as linear combinations of the other random variables. The random variables (parameterized by θ) in the Fisher information matrix are Xi (θ) : θ i ln pY {Y ; θ} i 0, . . . , m 1. What can we do if {Xi (θ)}m 1 i 0 are linearly dependent? Linear dependence implies that one or more of the Xi are extraneous. We can excise the extraneous random variables to form a smaller set of linearly independent variables {Xi′ (θ)}p 1 i 0 with p m such that cov(X ′ , X ′ ) is positive definite. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 32 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Fisher Information Matrix When the second derivatives all exist, we can write 2 ln pY (y ; θ) θi θj 2 θi θj pY (y ; pY (y ; θ) θ) θi pY (y ; θ) pY (y ; θ) θj pY (y ; θ) pY (y ; θ) and, under the regularity assumptions, we can write Eθ 2 ln pY (y ; θ) Eθ ln pY (y ; θ) · ln pY (y ; θ) Iij (θ). θi θj θi θj Hence, we can say that Iij (θ) Eθ 2 ln pY (y ; θ) θi θj This expression is often more convenient to compute than the former expression for Iij (θ). Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 33 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Fisher Information Matrix of Signal in AWGN We assume the general system model Yk sk (θ) Wk for k 0, 1, . . . , n 1 where sk (θ) : Λ 7 R is a deterministic signal with an unknown vector i.i.d. parameter θ and where Wk N (0, σ 2 ). We assume σ 2 is not an unknown parameter and that all of the regularity conditions are satisfied. To compute the Fisher information matrix, we can write „ «„ «ﬀ n 1 2 1 X 2 [Yk sk (θ)] ln pY (Y ; θ) 2 sk (θ) sk (θ) sk (θ) θi θj σ k 0 θi θj θi θj Since Eθ [Yk ] sk (θ), the ijth element of the FIM can be written as Iij (θ) Eθ n 1 2 1 X ln pY (Y ; θ) 2 sk (θ) sk (θ) θi θj σ θi θj Worcester Polytechnic Institute k 0 D. Richard Brown III 26-March-2009 34 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Fisher Information for Amplitude and Phase Consider the case where Yk a cos(ωk φ) Wk for k 0, 1, . . . , n 1 where ω is known, a 0 and φ ( π, π) are unknown, and i.i.d. Wk N (0, σ 2 ) with σ 2 known. Let θ [a, φ] and compute the Fisher information matrix: I00 (θ) I11 (θ) I01 (θ) «2 n 1 „ 1 X s (θ) k σ 2 k 0 a « n 1 n 1 „ 1 X 1 n 1 X 2 cos (ωk φ) 2 cos(2(ωk φ)) σ2 σ 2 2σ 2 k 0 k 0 „ «2 n 1 n 1 X na2 1 1 X 2 2 a sin (ωk φ) sk (θ) 2 2 σ φ σ 2σ 2 k 0 k 0 «„ « n 1 „ n 1 1 X 1 X s (θ) s (θ) 2 a sin(ωk φ) cos(ωk φ) 0 k k 2 σ k 0 a φ σ k 0 Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 35 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Fisher Information for Amplitude and Phase Remarks: The Fisher information matrix in this example is n 1 0 I(θ) 2σ 2 0 a2 Clearly I(θ) is positive definite when a 0. Note that, since the observations are i.i.d., I(θ) satisfies the additive information property (as expected). We got lucky that the off-diagonal terms are (at least approximately) equal to zero here. The matrix inverse is easy to compute here. This will not be true in general. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 36 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Information Inequality for Multiparameter Estimation Under the multiparameter regularity conditions (see Lehmann TPE 1998 p. 127) and also assuming I(θ) is positive definite, we can say that h i cov θ θ̂(Y ) β (θ)I 1 (θ)β(θ) where the matrix inequality A B means that A B is positive semi-definite and h h i i Eθ θ̂(Y ) , . . . , Eθ θ̂(Y ) . β (θ) : θ0 θm 1 h i Note that β(θ) Rm m since Eθ θ̂(Y ) Rm . What is β (θ) for an unbiased estimator? It should be clear that this is equivalent to our scalar parameter result when we have only one parameter. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 37 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Multiparameter Cramer-Rao Lower Bound If we constrain our attention to unbiased estimators, the multiparameter Cramer-Rao lower bound (CRLB) can be simply expressed as h i covθ θ̂(Y ) I 1 (θ) since β (θ) : h h i i Eθ θ̂(Y ) , . . . , Eθ θ̂(Y ) θ0 θm 1 in the information inequality is just the m m identity matrix when the estimator is unbiased. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 38 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Amplitude and Phase Information Bound We can compute the inverse of the Fisher information matrix easily: 2σ 2 1 0 1 I (θ) n 0 a 2 If we assume an unbiased estimator, then it is easy to show that h h i i 1 0 Eθ θ̂(Y ) , Eθ θ̂(Y ) β (θ) 0 1 a φ and the information inequality (the CRLB in this case) is simply h i 2σ 2 1 0 cov θ θ̂(Y ) n 0 a 2 h i The diagonal elements of cov θ θ̂(Y ) reveal the minimum variance for h i 2 2σ2 each parameter: vara [â(Y )] 2σn and varφ φ̂(Y ) na 2. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 39 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Unknown Amplitude, Phase, and Frequency Let θ [a, φ, ω] . We can compute «„ « n 1 „ n 1 1 X 1 X s (θ) s (θ) ak cos(ωk φ) sin(ωk φ) 0 k k σ 2 k 0 a ω σ 2 k 0 «„ « n 1 „ n 1 a2 (n 1)n 1 X 2 1 X a k sin2 (ωk φ) sk (θ) sk (θ) 2 · I12 (θ) 2 σ k 0 φ ω σ k 0 2σ 2 2 „ « n 1 n 1 2 a2 n(n 1)(2n 1) 1 X 2 2 2 1 X a k sin (ωk φ) sk (θ) · 2 I22 (θ) 2 σ ω σ 2σ 2 6 I02 (θ) k 0 k 0 Pn 1 where we have used the identities k 0 k n(n 1) and 2 Pn 1 2 n(n 1)(2n 1) . The Fisher information matrix is then k 0 k 6 1 0 0 n a2 (n 1) a2 I(θ) 0 2 2 2σ a2 (n 1)(2n 1) a2 (n 1) 0 2 6 Note coupling between the unknown phase φ and unknown frequency ω. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 40 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Example: Unknown Amplitude, Phase, and Frequency If we consider unbiased estimators, then the information inequality (CRLB) will be h i cov θ θ̂(Y ) I 1 (θ). Although it is possible to symbolically invert I(θ) in this case, let’s look at a numerical example: n 20 and a2 σ 2 1. When we had only two unknown parameters θ [a, φ], the CRLB was h i 0.1 0 1 covθ θ̂(Y ) I (θ) 0 0.1 When we have three unknown parameters θ [a, φ, ω], the CRLB can be computed as 0.1 0 0 h i covθ θ̂(Y ) I 1 (θ) 0 0.371 0.029 0 0.029 0.003 Note the increase in varθ [φ] as a consequence of the unknown frequency. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 41 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Multiparam. Information Inequality: Nuisance Parameters Lemma (Lehmann TPE 1998 pp. 127-128) Iii 1 (θ) I 1 (θ) ii with equality if and only if Iij (θ) 0 for all j 6 i. As the example demonstrates and the Lemma confirms, the presence of additional unknown parameters never makes the problem of estimating a particular parameter easier. In most cases, additional unknown parameters make the estimation of a particular parameter more difficult. When these additional unknown parameters exist only to make the estimation of the desired parameters more difficult, they are called nuisance parameters. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 42 / 43

ECE531 Lecture 9: Information Inequality and the CRLB Conclusions Information bound: a very general lower bound on the variance of an estimator. The information bound applies to biased or unbiased estimators of a real-valued non-random scalar or vector parameter. Useful for finding MVU by “guessing and checking” as well as determining how well your estimator is working. The Cramer-Rao lower bound is a special case of the general bound and applies only to unbiased estimators. An unbiased estimator achieving the CRLB is efficient and MVU. The converse is not always true. Other bounds not covered here: Chapman-Robbins inequality (finite differences) Bhayyachayya inequality (higher order derivatives) Lots of extensions were not covered here. For example, the information inequality for functions of parameters, i.e. g(θ), or complex parameters/observations. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 43 / 43

to the famous Cramer-Rao lower bound (CRLB). Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 2 / 43. ECE531 Lecture 9: Information Inequality and the CRLB . It is not a valid pdf in θ. Worcester Polytechnic Institute D. Richard Brown III 26-March-2009 4 / 43. ECE531 Lecture 9: Information Inequality and the CRLB Example: .

Related Documents:

CHEMICAL REACTION ENGINEERING

Introduction of Chemical Reaction Engineering Introduction about Chemical Engineering 0:31:15 0:31:09. Lecture 14 Lecture 15 Lecture 16 Lecture 17 Lecture 18 Lecture 19 Lecture 20 Lecture 21 Lecture 22 Lecture 23 Lecture 24 Lecture 25 Lecture 26 Lecture 27 Lecture 28 Lecture

100 Views

2y ago

Economics 448: Lecture 12 Measures of Inequality

Measuring economic inequality Summary Economics 448: Lecture 12 Measures of Inequality October 11, 2012 Lecture 12. Outline Introduction What is economic inequality? Measuring economic inequality . Inequality is the fundamental disparity that permits one individual certain

52 Views

2y ago

Inequality, public opinion and redistribution

Income inequality has two components: (a) ‘market’ inequality and (b) govern-ment redistribution via taxes and transfers. In principle, the two can be combined in any of a variety of ways: low market inequality with high redistribution, low market inequality with low redistribution, high market inequality

46 Views

3y ago

LECTURE NOTES on PROGRAMMING & DATA STRUCTURE Course Code : BCS101

Lecture 1: A Beginner's Guide Lecture 2: Introduction to Programming Lecture 3: Introduction to C, structure of C programming Lecture 4: Elements of C Lecture 5: Variables, Statements, Expressions Lecture 6: Input-Output in C Lecture 7: Formatted Input-Output Lecture 8: Operators Lecture 9: Operators continued

60 Views

1y ago

LECTURE 14 RISING INEQUALITY

Capital refers to holdings of machines, real estate, stocks, bonds, etc. Focus of Today’s Lecture Rising inequality in labor income. Key fact is that income inequality has risen substantially in the past few decades. What are the sources of rising labor income inequality and possible remedies?

21 Views

3y ago

The effects of innovation on income inequality in China1

of its income inequality (Lu and Chen, 2005). Innovation not only plays a role in the economic development of developing countries, but can also impact income inequality. While there is ample literature studying income inequality in China, there is less concern about the impact of the innovation level on income inequality.

49 Views

3y ago

Inequality, Employment and Public Policy

Inequality, Employment and Public Policy S.Mahendra Dev Abstract This paper examines dimensions of inequality including labour market inequalities and discusses public policies needed for reduction in inequalities. It discusses both inequality of outcomes and inequality of opportunities. In terms of income, India is the second highest

50 Views

3y ago

Assisted testing for coronavirus (COVID-19)

for coronavirus (COVID-19) for people taking a coronavirus test at a GP with help from the staff This is an easy read guide. January 2021. 2 Contents Introduction Prepare to test Throat swab Get your test results Nose swab For more information Page 3 Page 7 Page Page 10 Page 12 6 Page 14 5 4 3 2 1. 3 Introduction This guide comes from the Government’s Department of Health and Social Care .

41 Views

3y ago

Recent Views

IN THIS ISSUE CAR WASH INSIGHT Recent, Notable M&A Transactions .

9/8/2022 Club Car Wash Sites of Tidal Wave Express Car Wash 8 8/29/2022 Take 5 Car Wash Soft Touch Car Wash, Auto Oasis Car Wash, Clearwater Car Wash and Birdie's Car Wash 5 8/25/2022 WhiteWater Express Geaux Clean Car Wash 7 8/19/2022 ModWash Home Team Car Wash 3 8/18/2022 Splash In ECO Car Wash (Wills Group) Blue Hen Car Wash 2

9m ago

106 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

685 Views

ESSENTIAL PLAN - Discovery

Car insurance only Car and home insurance Car insurance only Car and home insurance 12.5% 25% 5% 10% YOUR FUEL CASH BACK PERCENTAGE GET TO THE HIGHEST CASH BACK PERCENTAGE Add at least R250 000 of home insurance (household contents, buildings or both) Take your car to Tiger Wheel & Tyre and pass the Annual MultiPoint check

1y ago

277 Views

CAR INSURANCE EVERYTHING EXPLAINED - RSA Insurance Group

CAR INSURANCE 93013821.indd 1 15/03/2018 10:46. 2 WELCOME TO µ CAR INSURANCE Thank you for choosing µ to protect you and your car. This booklet is intended to help you check your cover and to reassure you that µ will give you the protection you need for the year ahead. First of all, to help you understand your car insurance policy we want to .

1y ago

283 Views

Describe types and purposes of insurance.

D.O. CAPS Consumer Skills: Insurance—10E 3 Your car - The car you drive can also affect your insurance rates. Insurance companies place certain kinds of cars in special risk categories. You should ask your insurance agent before making a car purchase to make sure you aren't getting a car that will cost you extra for your liability insurance.

1y ago

238 Views

Life Insurance Buyer's Guide Life Insurance - National Association of .

Life Insurance uers uide Naional ssociaion of Insurance Commissioners Compare the Different Types of Insurance Policies There are many types of life insurance pol-icies. You should choose a policy with fea-tures that fit your individual needs. Some things to consider are: Term Insurance vs. Cash Value In-surance. Term insurance is intended to

1y ago

531 Views

Contours Options Infant Car Seat Adapter Instruction Sheet

your Infant Car Seat, as described in the instruction manual provided by the Infant Car Seat manufacturer. † WHEN USING ONLY ONE INFANT CAR SEAT ADAPTER OR TWO FOR TWINS, THE FOLLOWING INFANT CAR SEATS CAN BE USED: † If your Infant Car Seat is not one of the models listed above, DO NOT use your infant car seat with this car seat adapter.

2y ago

572 Views

Microsoft Advertising Travel Update

last minute cruise deals -58.50% Car Rental Queries WoW Change car rental -43.80% rental cars -46.30% car rentals -40.60% cheap car rentals -48.00% car rentals cheapest rates -52.20% rent a car- 40.30% cheap rental cars -45.60% rental car -41.80% car rental deals -49.30% rental cars lowest price -53.90% Flight Queries WoW Change cheap flights .

1y ago

344 Views

Design and development of lift for an automatic car parking system

1. Stacker type car parking system 2. Puzzle type car parking system 3. Level type car parking system 4. Chess type car parking system 5. Rotary type car parking system 6. Tower type car parking system But lift is used only in tower type car parking system. Objectives:-

6m ago

182 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

577 Views

Money Online Price Comparison - WordPress

you to compare car insurance quotes. You'll notice at the top of the screen is a warning regarding telling the truth when completing any form of car insurance quote as something withheld, which later becomes known, can void an insurance claim. 7 The process of completing a car insurance price comparison is broken down into 4

1y ago

178 Views

Better car deals - Consumer Affairs Victoria

Insurance protects you against costs and liabilities if the car is stolen, vandalised or damaged in a crash. When budgeting, consider taking out at least third party car property insurance. It may be cheaper to arrange your own insurance than taking it out through the trader. Contact insurance companies to compare premiums and policy coverage.

1y ago

159 Views

Car Insurance This booklet covers:Car Rapid Bonus Business

Car Insurance This booklet covers:Car Rapid Bonus Business RAC Direct Insurance is a trading name of London and Edinburgh Insurance Company Limited. Registered in England No 924430. Registered Office: 8 Surrey Street, Norwich NR1 3NG. Member of the Aviva Group. Authorised and regulated by the Financial Services Authority. RAC052(V27)-1971-06.06 .

1y ago

223 Views

Root Insurance (ROOT) - Citron Research

Root Insurance (ROOT) Leveling the Playing Field of Car Insurance What every trader needs to know about one of the mostheavily shorted stocks in the market Traditional Credit-Based Car Insurance PerpetuatesEconomic and Racial Inequalities as one in three American cannot affordessentials because of car insurance premiums

1y ago

216 Views

Life Cycle Analysis: Uber vs. Car Ownership

(LCA) will be performed to compare ridesharing services versus car ownership. We will compare per mile average cost and CO 2 emissions . assumption of 15 years being a car's lifetime and calculated average costs for car maintenance, repairs, insurance, gas and registration. We used Economic Input Output Life Cycle Assessment .

1y ago

128 Views

ECE531 Lecture 9: Information Inequality And The Cramer-Rao Lower Bound

It looks like you're using an ad-blocker