Introduction To Nonlinear Regression - ETH Z

2y ago

12 Views

2 Downloads

342.94 KB

30 Pages

Last View : 8d ago

Last Download : 3m ago

Upload by : Sabrina Baez

Report this link

Download PDF

Transcription

Introduction to Nonlinear RegressionAndreas RuckstuhlIDP Institut für Datenanalyse und ProzessdesignZHAW Zürcher Hochschule für Angewandte WissenschaftenOctober 2010 †Contents1. The Nonlinear Regression Model12. Methodology for Parameter Estimation53. Approximate Tests and Confidence Intervals84. More Precise Tests and Confidence Intervals135. Profile t-Plot and Profile Traces156. Parameter Transformations177. Forecasts and Calibration238. Closing Comments27A. Gauss-Newton Method28 †The author thanks Werner Stahel for his valuable comments.E-Mail Address: Andreas.Ruckstuhl@zhaw.ch; Internet: http://www.idp.zhaw.chI

1. The Nonlinear Regression Model1GoalsThe nonlinear regression model block in the Weiterbildungslehrgang (WBL) in angewandter Statistik at the ETH Zurich should1. introduce problems that are relevant to the fitting of nonlinear regression functions,2. present graphical representations for assessing the quality of approximate confidence intervals, and3. introduce some parts of the statistics software R that can help with solvingconcrete problems.1. The Nonlinear Regression Modela The Regression Model. Regression studies the relationship between a variable ofinterest Y and one or more explanatory or predictor variables x(j) . The generalmodel is(1) (2)(m)Yi hhxi , xi , . . . , xi ; θ1 , θ2 , . . . , θp i Ei .Here, h is an appropriate function that depends on the explanatory variables and(1) (2)(m)parameters, that we want to summarize with vectors x [xi , xi , . . . , xi ]T andθ [θ1 , θ2 , . . . , θp ]T . The unstructured deviations from the function h are describedvia the random errors Ei . The normal distribution is assumed for the distribution ofthis random error, soDEEi N 0, σ 2 , independent.bThe Linear Regression Model. In (multiple) linear regression, functions h are considered that are linear in the parameters θj ,(1)(2)(m)hhxi , xi , . . . , xi(1)(2); θ1 , θ2 , . . . , θp i θ1 xei θ2 xei(p) . . . θp xei,where the xe(j) can be arbitrary functions of the original explanatory variables x(j) .(Here the parameters are usually denoted as βj instead of θj .)c The Nonlinear Regression Model In nonlinear regression, functions h are consideredthat can not be written as linear in the parameters. Often such a function is derivedfrom theory. In principle, there are unlimited possibilities for describing the deterministic part of the model. As we will see, this flexibility often means a greater effort tomake statistical statements.Example dPuromycin. The speed with which an enzymatic reaction occurs depends on theconcentration of a substrate. According to the information from Bates and Watts(1988), it was examined how a treatment of the enzyme with an additional substancecalled Puromycin influences this reaction speed. The initial speed of the reaction ischosen as the variable of interest, which is measured via radioactivity. (The unit of thevariable of interest is count/min 2 ; the number of registrations on a Geiger counter pertime period measures the quantity of the substance present, and the reaction speed isproportional to the change per time unit.)A. Ruckstuhl, ZHAW

21. The Nonlinear Regression ConcentrationConcentrationFigure 1.d: Puromycin Example. (a) Data ( treated enzyme; untreated enzyme) and (b)typical course of the regression function.The relationship of the variable of interest with the substrate concentration x (in ppm)is described via the Michaelis-Menten functionhhx; θi θ1 x.θ2 xAn infinitely large substrate concentration (x ) results in the ”asymptotic” speedθ1 . It has been suggested that this variable is influenced by the addition of Puromycin.The experiment is therefore carried out once with the enzyme treated with Puromycinand once with the untreated enzyme. Figure 1.d shows the result. In this section thedata of the treated enzyme is used.Example e Oxygen Consumption. To determine the biochemical oxygen consumption, river watersamples were enriched with dissolved organic nutrients, with inorganic materials, andwith dissolved oxygen, and were bottled in different bottles. (Marske, 1967, see Batesand Watts (1988)). Each bottle was then inoculated with a mixed culture of microorganisms and then sealed in a climate chamber with constant temperature. The bottleswere periodically opened and their dissolved oxygen content was analyzed. From thisthe biochemical oxygen consumption [mg/l] was calculated. The model used to connect the cumulative biochemical oxygen consumption Y with the incubation timex,is based on exponential growth decay, which leads to h hx, θi θ1 1 e θ2 x . Figure 1.e shows the data and the regression function to be applied.Example fFrom Membrane Separation Technology (Rapold-Nydegger (1994)). The ratio ofprotonated to deprotonated carboxyl groups in the pores of cellulose membranes isdependent on the pH value x of the outer solution. The protonation of the carboxylcarbon atoms can be captured with 13 C-NMR. We assume that the relationship canbe written with the extended “Henderson-Hasselbach Equation” for polyelectrolyteslog10θ1 yy θ2WBL Applied Statistics — Nonlinear Regression θ3 θ4 x ,

1. The Nonlinear Regression Model320Oxygen DemandOxygen Demand181614121081234567DaysDaysFigure 1.e: Oxygen consumption example. (a) Data and (b) typical shape of the regressionfunction.162yy ( chem. shift)163161160(a)246810(b)12xx ( pH)Figure 1.f: Membrane Separation Technology.(a) Data and (b) a typical shape of the regressionfunction.where the unknown parameters are θ1 , θ2 and θ3 0 and θ4 0. Solving for y leadsto the modelθ1 θ2 10θ3 θ4 xiYi hhxi ; θi Ei Ei .1 10θ3 θ4 xiThe regression funtion hhxi , θi for a reasonably chosen θ is shown in Figure 1.f nextto the data.gA Few Further Examples of Nonlinear Regression Functions: Hill Model (Enzyme Kinetics): hhxi , θi θ1 xθi 3 /(θ2 xθi 3 )For θ3 1 this is also known as the Michaelis-Menten Model (1.d). Mitscherlich Function (Growth Analysis): hhxi , θi θ1 θ2 exphθ3 xi i. From kinetics (chemistry) we get the function(1)(2)(1)(2)hhxi , xi ; θi exph θ1 xi exph θ2 /xi ii.A. Ruckstuhl, ZHAW

41. The Nonlinear Regression Model Cobbs-Douglas Production FunctionD(1)(2)h xi , xi ; θE θ1 (1) θ2xi (2) θ3xi.Since useful regression functions are often derived from the theory of the applicationarea in question, a general overview of nonlinear regression functions is of limitedbenefit. A compilation of functions from publications can be found in Appendix 7 ofBates and Watts (1988).hLinearizable Regression Functions. Some nonlinear regression functions can be linearized through transformation of the variable of interest and the explanatory variables.For example, a power functionh hx; θi θ1 xθ2can be transformed for a linear (in the parameters) functionlnhh hx; θii ln hθ1 i θ2 ln hxi β0 β1 xe ,whereβ0 ln hθ1 i, β1 θ2 and xe ln hxi. We call the regression function h linearizable, if we can transform it into a function linear in the (unknown) parametersvia transformations of the arguments and a monotone transformation of the result.Here are some more linearizable functions (also see Daniel and Wood, 1980):h hx, θi 1/(θ1 θ2 exph xi) h hx, θi θ1 x/(θ2 x) 1/h hx, θi 1/θ1 θ2 /θ1 x1h hx, θi θ1 xθ2 lnhh hx, θii lnhθ1 i θ2 lnhxih hx, θi θ1 exphθ2 ghxii lnhh hx, θii lnhθ1 i θ2 ghxih hx, θi exph θ1 x(1) exph θ2 /x(2) ii h hx, θi θ1 x (1) θ2x (2) θ31/h hx, θi θ1 θ2 exph xilnhlnhh hx, θiii lnh θ1 i lnhx(1) i θ2 /x(2)lnhh hx, θii lnhθ1 i θ2 lnhx(1) i θ3 lnhx(2) i .The last one is the Cobbs-Douglas Model from 1.g.i The Statistically Complete Model. A linear regression with the linearized regressionfunction in the referred-to example is based on the modellnhYi i β0 β1 xei Ei ,where the random errors Ei all have the same normal distribution. We back transformthis model and thus getYi θ1 · xθ2 · Ee iwith Ee i exphEi i. The errors Ee i , i 1, . . . , n now contribute multiplicatively andare lognormal distributed! The assumptions about the random deviations are thusnow drastically different than for a model that is based directily on h,Yi θ1 · xθ2 Ei with random deviations Ei that, as usual, contribute additively and have a specificnormal distribution.WBL Applied Statistics — Nonlinear Regression

2. Methodology for Parameter Estimation5A linearization of the regression function is therefore advisable only if the assumptionsabout the random deviations can be better satisfied - in our example, if the errorsactually act multiplicatively rather than additively and are lognormal rather thannormally distributed. These assumptions must be checked with residual analysis.j* Note:In linear regression it has been shown that the variance can be stabilized with certaintransformations (e.g. logh·i, · ). If this is not possible, in certain circumstances one can alsoperform a weighted linear regression . The process is analogous in nonlinear regression.k The introductory examples so far:We have spoken almost exclusively of regression functions that only depend on oneoriginal variable. This was primarily because it was possible to fully illustrate themodel graphically. The ensuing theory also functions well for regression functionsh hx; θi, that depend on several explanatory variables x [x(1) , x(2) , . . . , x(m) ].2. Methodology for Parameter Estimationa The Principle of Least Squares. To get estimates for the parameters θ [θ1 , θ2 , . . . ,θp ]T , one applies, like in linear regression calculations, the principle of least squares.The sum of the squared deviationsS(θ) : nX(yi ηi hθi)2mit ηi hθi : hhxi ; θii 1should thus be minimized. The notation where hhxi ; θi is replaced by ηi hθi is reasonable because [xi , yi ] is given by the measurement or observation of the data and onlythe parameters θ remain to be determined.Unfortunately, the minimum of the squared sum and thus the estimation can not begiven explicitly as in linear regression. Iterative numeric procedures help further.The basic ideas behind the common algorithm will be sketched out here. They alsoform the basis for the easiest way to derive tests and confidence intervals.bGeometric Illustration. The observed values Y [Y1 , Y2 , . . . , Yn ]T determine apoint in n-dimensional space. The same holds for the ”model values” η hθi [η1 hθi , η2 hθi , . . . , ηn hθi]T for given θ .Take note! The usual geometric representation of data that is standard in, for example,multivariate statistics, considers the observations that are given by m variables x(j) ,j 1, 2, . . . , m,as points in m-dimensional space. Here, though, we consider the Y and η -values of all n observations as points in n-dimensional space.Unfortunately our idea stops with three dimensions, and thus with three observations.So, we try it for a situation limited in this way, first for simple linear regression.As stated, the observed values Y [Y1 , Y2 , Y3 ]T determine a point in 3-dimensionalspace.calculatethe model valuesD E For given parameters β0 5 and β1 1 we can DEηi β β0 β1 xi and represent the corresponding vector η β β0 1 β1 x as a point.We now ask where all points lie that can be achieved by variation of the parameters.These are the possible linear combinations of the two vectors 1 and x and thus form theA. Ruckstuhl, ZHAW

62. Methodology for Parameter Estimationplane ”spanned by 1 and x” . In estimating the parameters according to the principleD Eof least squares, geometrically represented, the squared distance between Y and η βis minimized. So, we want the point on the plane that has the least distance to Y .This is also called the projection of Y onto the plane. The parameter values thatcorrespond to this point ηb are therefore the estimated parameter values βb [βb0 , βb1 ]T .Now a nonlinear function, e.g. h hx; θi θ1 exp h1 θ2 xi, should be fitted on thesame three observations. We can again ask ourselves where all points η hθi lie thatcan be achieved through variations of the parameters θ1 and θ2 . They lie on atwo-dimensional curved surface (called the model surface in the following) in threedimensional space. The estimation problem again consists of finding the point ηb onthe model surface that lies nearest to Y . The parameter values that correspond tothis point ηb, are then the estimated parameter values θb [θb1 , θb2 ]T .c Solution Approach for the Minimization Problem. The main idea of the usual algorithm for minimizing the sum of squared deviations (see 2.a) goes as follows: If apreliminary best value θ (ℓ) exists, we approximatesurface with the planeDED the modelEthat touches the surface at the point η θ(ℓ) h x; θ(ℓ) . Now we seek the point inthis plane that lies closest to Y . This amounts to the estimation in a linear regressionproblem. This new point lies on the plane, but not on the surface, that correspondsto the nonlinear problem. However, it determines a parameter vector θ(ℓ 1) and withthis we go into the next round of iteration.dLinear Approximation. To determine the approximated plane, we need the partialderivative ηi hθi(j),Ai hθi : θjwhich we can summarize with a n p matrix A . The approximation of the modelsurface ηhθi by the ”tangential plane” in a parameter value θ is(1)(p)ηi hθi ηi hθ i Ai hθ i (θ1 θ1 ) . Ai hθ i (θp θp )or, in matrix notation,ηhθi ηhθ i A hθ i (θ θ ) .If we now add back in the random error, we get a linear regression modelYe A hθ i β Ewith the ”preliminary residuals” Ye i Yi ηi hθ i as variable of interest, the columnsof A as regressors and the coefficients βj θj θj (a model without intercept β0 ).WBL Applied Statistics — Nonlinear Regression

2. Methodology for Parameter Estimation7e Gauss-Newton Algorithm. The Gauss-Newton algorithm consists of, beginning with astart value θ(0) for θ , solving the just introduced linear regression problem for θ θ(0)to find a correction β and from this get an improved value θ (1) θ(0) β . For this,again, Dthe Eapproximated model is calculated,D andE thus the ”preliminary residuals”Y η θ(1) and the partial derivatives A θ(1) are determined, and this gives usθ2 . This iteration step is continued as long as the correction β is negligible. (Furtherdetails can be found in Appendix A.)It can not be guaranteed that this procedure actually finds the minimum of the squaredsum. The chances are better, the better the p-dimensionale model surface at theminimum θb (θb1 , . . . , θbp )T can be locally approximated by a p-dimensional ”plane”and the closer the start value θ (0) is to the solution being sought.* Algorithmscomfortably determine the derivative matrix A numerically. In more complex problems the numerical approximation can be insufficient and cause convergence problems. It is thenadvantageous if expressions for the partial derivatives can be arrived at analytically. With thesethe derivative matrix can be reliably numerically determined and the procedure is more likely toconverge (see also Chapter 6).fInitial Values. A iterative procedure requires a starting value in order for it to beapplied at all. Good starting values help the iterative procedure to find a solutionmore quickly and surely. Some possibilities to get these more or less easily are herebriefly presented.gInitial Value from Prior Knowledge. As already noted in the introduction, nonlinear models are often based on theoretical considerations from the application area inquestion. Already existing prior knowledge from similar experiments can be usedto get an initial value. To be sure that the chosen start value fits, it is advisable tographically represent the regression function hhx; θi for various possible starting valuesθ θ0 together with the data (e.g., as in Figure 2.h, right).hStart Values via Linearizable Regression Functions. Often, because of the distribution of the error, one is forced to remain with the nonlinear form in models withlinearizable regression functions. However, the linearized model can deliver startingvalues.In the Puromycin Example the regression function is linearizable: The reciprocalvalues of the two variables fulfillye 11θ2 11 β0 β1 xe .yhhx; θiθ1 θ1 xThe least squares solution for this modified problem is βb [βb0 , βb1 ]T (0.00511, 0.000247)T(Figure 2.h (a)). This gives the initial value(0)θ1 1/βb0 196 ,(0)θ2 βb1 /βb0 0.048 .A. Ruckstuhl, ZHAW

3. Approximate Tests and Confidence 1.0ConcentrationFigure 2.h: Puromycin Example. Left: Regression line in the linearized problem. Right: Regression function hhx; θi for the initial values θ θ (0) () and for the least squaresbestimation θ θ (——–).i Initial Values via Geometric Meaning of the Parameter. It is often helpful to considerthe geometrical features of the regression function.In the Puromycin Example we can thus arrive at an initial value in another, instructive way: θ1 is the y value for x . Since the regression function is monotoneincreasing, we can use the maximal yi -value or a visually determined ”asymptoticvalue” θ10 207 as initial value for θ1 . The parameter θ2 is the x-value, at which yreaches half of the asymptotic value θ1 . This gives θ20 0.06.The initial values thus result from the geometrical meaning of the parameters and acoarse determination of the corresponding aspects of a curve ”fitted by eye.”Example jMembrane Separation Technology.In the Membrane Separation example we let x , so hhx; θi θ1 (since θ4 0); for x , hhx; θi θ2 . From Figure 1.f(a)along with the data shows θ1 163.7 and θ2 159.5. We know θ1 and θ2 , so we canlinearize the regression function through(0)ye : log10 hθ1 y(0)y θ2i θ3 θ4 x .We speak of a conditional linearizable function. The linear regression leads to the(0)(0)initial value θ3 1.83 and θ4 0.36.With this initial value the algorithm converges to the solution θb1 163.7, θb2 159.8,b are shown in Figureθb3 2.675 and θb4 0.512. The functions hh·; θ (0) i and hh·; θi2.j(b).* The property of conditional linearity of a function can also be useful for developing an algorithmspecially suited for this situation (see e.g. Bates and Watts, 1988).3. Approximate Tests and Confidence IntervalsaThe estimator θb gives the value of θ that fits the data optimally. We now ask whichparameter values θ are compatible with the observations. The confidence region isWBL Applied Statistics — Nonlinear Regression

3. Approximate Tests and Confidence Intervals29(a)y ( Verschiebung)163y10 1162161160(b) 224681012x ( pH)24681012x ( pH)Figure 2.j: Membrane Separation Technology Example. (a) Regression line, which is used fordetermining the initial values for θ3 and θ4 . (b) Regression function hhx; θi for the initialvalue θ θ(0) () and for the least squares estimation θ θb (——–).the set of all these values. For an individual parameter θj the confidence region is theconfidence interval.The results that now follow are based on the fact that the estimator θb is asymptoticallymultivariate normally distributed. For an individual parameter that leads to a “z -Test”and the corresponding confidence interval; for several parameters the correspondingChi-Square test works and gives elliptical confidence regions.bThe asymptotic properties of the estimator can be derived from the linear approximation. The problem of nonlinear regression is indeed approximately equal to thelinear regression problem mentioned in 2.dYe A hθ i β E ,if the parameter vector θ , which is used for the linearization lies near to the solution. Ifthe estimation procedure has converged (i.e. θ θb), then β 0 – otherwise this wouldnot be the solution. The standard error of the coefficients β – and more generally thecovariance matrix of βb – then correspond approximately to the corresponding valuesfor θb.* A bit more precisely:The standard errors characterize the uncertainties that are generated by therandom fluctuations in the data. The available data have led to the estimation value θb. If thedata were somewhat different, then θb would still be approximately correct, thus we accept that itis good enough for the linearization. The estimation of β for the new data set would thus lie asfar from the estimated value for the available data, as this corresponds to the distribution of theparameter in the linearized problem.c Asymptotic Distribution of the Least Squares Estimator. From these considerationsit follows: Asymptotically the least squares estimator θb is normally distributed (andconsistent) and thereforeV hθiabθ N θ,,nwith asymptotic covariance matrix V hθi σ 2 (A hθiT A hθi) 1 , where A hθi is then p matrix of the partial derivatives (see 2.d).A. Ruckstuhl, ZHAW

103. Approximate Tests and Confidence IntervalsTo determine the covariance matrix V hθi explicitly, A hθi is calculated at the pointθb instead of the unknown point θ , and for the error variance σ 2 the usual estimatoris pluggedVdhθi σb2 D ETA θbD E 1A θbmit σb2 n D E 2bShθi1 X. yi ηi θbn pn p i 1With this the distribution of the estimated parameters is approximately determined,from which, like in linear regression, standard error and confidence intervals can bederived, or confidence ellipses (or ellipsoids) if several variables are considered at once.The denominator n p in σb2 is introduced in linear regression to make the estimatorunbiased. – Tests and confidence intervals are not determined with the normal andchi-square distribution, but with the t and F distributions. There it is taken intoaccount that the estimation of σ 2 causes an additional random fluctuation. Even ifthe distribution is no longer exact, the approximations get more exact if we do this innonlinear regression. Asymptotically the difference goes to zero.Example dMembrane Separation Technology. A computer output for the Membrane Separation example shows Table 3.d. The estimations of the parameters are in the column”Value”, followed by the estimated approximate standard error and the test statistics(”t value”), that are approximately tn p distributed. In the last row the estimatedstandard deviation σb of the random error Ei is given.From this output, in linear regression the confidence intervals for the parameters canbe determined: The approximate 95% confidence interval for the parameter θ1 ist35163.706 q0.975· 0.1262 163.706 0.256 .Formula: delta (T1 T2 * 10ˆ(T3 T4 * pH)) / (10ˆ(T3 T4 * pH) 1)Parameters:EstimateT1 163.7056T2 159.7846T32.6751T4-0.5119Std. Error0.12620.15940.38130.0703t value1297.2561002.1947.015-7.281Pr( t ) 2e-16 2e-163.65e-081.66e-08Residual standard error: 0.2931 on 35 degrees of freedomNumber of iterations to convergence: 7Achieved convergence tolerance: 5.517e-06Table 3.d: Membrane Separation Technology Example: R summary of the fitting.WBL Applied Statistics — Nonlinear Regression

3. Approximate Tests and Confidence Intervals11Example e Puromycin. For checking the influence of treating an enzyme with Puromycin of thepostulated form (1.d) a general model for the data with and without the treatmentcan be formulated as follows:(θ1 θ3 zi )xi Ei .Yi θ2 θ4 zi xiWhere z is the indicator variable for the treatment (zi 1, if treated, otherwise 0).Table 3.e shows that the parameter θ4 at the 5% level is not significantly different from0, since the P value of 0.167 is larger then the level (5%). However, the treatment hasa clear influence, which is expressed through θ3 ; the 95% confidence interval coversthe region 52.398 9.5513 · 2.09 [32.4, 72.4] (the value 2.09 corresponds to the 0.975quantile of the t19 distribution).Formula: velocity (T1 T3 * (treated T)) * conc/(T2 T4 * (treated T) 0.016Std. Error6.8960.0089.5510.011t value23.2425.7615.4871.436Pr( t )2.04e-151.50e-052.71e-050.167Residual standard error: 10.4 on 19 degrees of freedomNumber of iterations to convergence: 6Achieved convergence tolerance: 4.267e-06Table 3.e: R summary of the fit for the Puromycin example.fConfidence Intervals for Function Values. Besides the parameters, the functionvalueDEh hx0 , θi for a given x0 is of interest. In linear regression the function value h x0 , β xT0 β : η0 is estimated by ηb0 xT0 βb and the estimated (1 α) confidence interval forit isqtn pbbbbη 0 q1 α/2 · se hη 0 i with se hη 0 i σ xTo (X T X ) 1 xo .With analogous considerations and asymptotic approximation we can specify confidenceDEfor the function values h hx0 ; θi for nonlinear h. If the functionD E intervalsbbη0 θ : h x0 , θ is approximated at the point θ , we getb ηo hθi aT (θb θ) mit ao ηo hθio hhxo , θi. θ(If x0 is equal to an observed xi , then a0 equals the corresponding row of the matrixA from 2.d.) The confidence interval for the function value η0 hθi : h hx0 , θi is thenapproximatelyD Etη0 θb q n pD EEb1 α/2 · se η0 θDr D ED EED E 1Tabo .mit se η0 θb σb aboT A θb A θbDIn this formula, again the unknown values are replaced by their estimations.A. Ruckstuhl, ZHAW

123. Approximate Tests and Confidence Intervals25Oxygen Demandlog(PCB ears (1/3)02468DaysFigure 3.g: Left: Confidence band for an estimated line for a linear problem. Right: Confidenceband for the estimated curve hhx, θi in the oxygen consumption example.gConfidence Band. The expression for the (1 α) confidence interval for ηo hθi : hhxo , θi also holds for arbitrary xo . As in linear regression, it is obvious to representthe limits of these intervals as a ”confidence band” that is a function of xo , as thisFigure 3.g shows for the two examples of Puromycin and oxygen consumption.Confidence bands for linear and nonlinear regression functions behave differently: Forlinear functions this confidence band is thinnest by the center of gravity of the explanatory variables and gets gradually wider as it move out (see Figure 3.g, left). In thenonlinear case, the bands can be arbitrary. Because the functions in the “Puromycin”and “Oxygen Consumption’ must go through zero, the interval shrinks to a point there.Both models have a horizontal asymptote and therefore the band reaches a constantwidth for large x (see Figure 3.g, right) .hPrediction Interval. The considered confidence band indicates where the ideal function values hhxi, and thus the expected values of Y for givenx, lie. The question, inwhich region future observations Y0 for given x0 will lie, is not answered by this.However, this is often more interesting than the question of the ideal function value;for example, we would like to know in which region the measured value of oxygenconsumption would lie for an incubation time of 6 days.Such a statement is a prediction about a random variable and is different in principlefrom a confidence interval, which says something about a parameter, which is a fixedbut unknown number. Corresponding to the question posed, we call the region weare now seeking a prediction interval or prognosis interval. More about this inChapter 7.i Variable Selection. In nonlinear regression, unlike linear regression, variable selectionis not an important topic, because a variable does not correspond to each parameter, so usually the number ofparameters is different than the number of variables, there are seldom problems where we need to clarify whether an explanatoryvariable is necessary or not – the model is derived from the subject theory.However, there is sometimes a reasonable question of whether a portion of the parame-WBL Applied Statistics — Nonlinear Regression

4. More Precise Tests and Confidence Intervals13ters in the nonlinear regression model can appropriately describe the data (see BeispielPuromycin).4. More Precise Tests and Confidence Intervalsa The quality of the approximate confidence region depends strongly on the quality of thelinear approximation. Also the convergence properties of the optimization algorithmsare influenced by the quality of the linear approximation. With a somewhat largercomputational effort, the linearity can be checked graphically and, at the same time,we get a more precise confidence interval.bF Test for Model Comparison. To test a null hypothesis θ θ for the wholeparameter vector or also θj θj for an individual component, we can use an F-Testfor model comparison like in linear regression. Here, we compare the sum of squaresb (For n Shθ i that arises under the null hypothesis with the sum of squares Shθi.the F test is the same as the so-called Likelihood Quotient test, and the sum of squaresis, up to a constant, equal to the log likelihood.)Now we consider the null hypothesis θ θ for the whole parameter vector. The teststatistic isb an p Shθ i ShθiT Fp,n p .bpShθiFrom this we get a confidence regionFn b 1 θ Shθi Shθipn pq op,n pwhere q q1 αis the (1 α) quantile of the F distribution with p and n p degreesof freedom.In linear regression we get the same exact confidence region if we use the (multivariate)normal distribution of the estimator βb . In the nonlinear case the results are different.The region that is based on the F tests is not based on the linear approximation in 2.dand is thus (much) more exact.c Exact Confidence Regions for p 2. If p 2, we can find the exact confidence regionby calculating Shθi on a grid of θ values and determine the borders of the regionthrough interpolation, as is familiar for contour plots. In Figure 4.c are given thecontours together with the elliptical regions that result from linear approximation forthe Puromycin example (left) and the oxygen consumption example (right).For p 2 contour plots do not exist. In the next chapter we will be introduced tographical tools that also work for higher dimensions. They depend on the following

Related Documents:

Introduction to Regression Procedures

independent variables. Many other procedures can also ﬁt regression models, but they focus on more specialized forms of regression, such as robust regression, generalized linear regression, nonlinear regression, nonparametric regression, quantile regression, regression modeling of survey data, regression modeling of

161 Views

2y ago

ECON4150 - Introductory Econometrics Lecture 8: Nonlinear Regression ...

There are 2 types of nonlinear regression models 1 Regression model that is a nonlinear function of the independent variables X 1i;:::::;X ki Version of multiple regression model, can be estimated by OLS. 2 Regression model that is a nonlinear function of the unknown coefﬁcients 0; 1;::::; k Can't be estimated by OLS, requires different .

12 Views

1y ago

ECON4150 - Introductory Econometrics Lecture 11: Nonlinear Regression ...

10 Views

1y ago

Nonlinear Regression II - RWTH Aachen University

Nonlinear Regression II Objective and Learning Outcomes Objective I Introduction to nonlinear regression methods for multivariate scalar models. Learning Outcomes I You will understand I normal equation (Gauss-Newton / Levenberg-Marquardt methods) I Givens rotation I Householder re ection in the context of linearization of nonlinear regression .

10 Views

1y ago

Applications of Nonlinear Regression Methods in Insurance

Alternative Regression Methods for LSMC » Examples of linear and nonlinear regression methods: -Mixed Effects Multiple Polynomial Regression -Generalized Additive Models -Artificial Neural Networks -Regression Trees -Finite Element Methods » In other work we have considered local regression methods such as -kernel smoothing and

19 Views

1y ago

Parametric Estimating - Nonlinear Regression - DAU

Nonlinear Regression The term "nonlinear" regression, in the context of this job aid, is used to describe the application of linear regression in fitting nonlinear patterns in the data. The techniques outlined here are offered as samples of the types of approaches used to fit patterns that some might refer to as being "curvilinear" in .

15 Views

1y ago

Lecture 14 Multiple Linear Regression and Logistic Regression

LINEAR REGRESSION 12-2.1 Test for Significance of Regression 12-2.2 Tests on Individual Regression Coefficients and Subsets of Coefficients 12-3 CONFIDENCE INTERVALS IN MULTIPLE LINEAR REGRESSION 12-3.1 Confidence Intervals on Individual Regression Coefficients 12-3.2 Confidence Interval

93 Views

2y ago

Artificial Intelligence in English Law

the topic of artificial intelligence (AI) in English law. AI, once a notion confined to science fiction novels, movies and research papers, is now making a tremendous impact on society. Whether we are aware of it or not, AI already pervades much of our world, from its use in banking and finance to electronic disclosure in large scale litigation. The application of AI to English law raises many .

59 Views

3y ago

Recent Views

No. 41018

die Boedelwet, 1965, ten einde die Kabinetslid verantwoordelik vir die . die aanstellingstermyn van lede van die Raad van Regshulp Suid-Afrika verder te reel; en . artikel3 van Wet 104 van 1996, artikel 3 van Wet 66 van 1998, artikel 1 van Wet 62 van 2000, artikel 1 van Wet 28 van 10 . 6 No. 41018 Act No.8 of 2017

3y ago

184 Views

STAATSKOERANT - Parliament of Namibia

V AN DIE REPUBLIEK VAN SUID-AFRIKA REPUBLIC OF SOUTH AFRICA . GOVERNMENT GAZETTE . . Tot wysiging van die Boedelwet, 1965, om sekere bedrae te . Wysiging van artikel 35 van Wet 66 van 1965. Wysiging van artikel 80 van Wet 66 van 1965. Wysiging van artikel 102 van

3y ago

152 Views

The State of Van Nuys - Final Report - Van Nuys Neighborhood Council

Geographic Location of Van Nuys in Los Angeles City Figure 2. Van Nuys Neighborhood Council Figure 3.!Founding of Van Nuys in 1911 Figure 4. Original Van Nuys Hotel, Van Nuys, Calif., on Van Nuys Blvd. Figure 5. Van Nuys Population Trends 1970-2010 Figure 6. Population Trends in Race/Ethnicity, 1980 - 2010 Figure 7. Van Nuys Land Use Figure 8.

1y ago

89 Views

VLAAMSE OVERHEID 17 MEI 2019 Besluit van de Vlaamse .

Vlaamse Regering van 19 maart 2010 betreffende de organisatie van de fokkerij van de voor de landbouw nuttige huisdieren; Gelet op het ministerieel besluit van 26 juli 2011 tot erkenning van centra voor varkens ter uitvoering van artikelen 35 en 59, par. 2, van het Fokkerijbesluit van 19 maart 2010;

3y ago

150 Views

YOGA VASISTHA SARA (De essentie van Yoga Vasishtha)

YOGA VASISTHA SARA (De essentie van Yoga Vasishtha) Śrī Ramana Maharṣi 1 - Onthechting 2 - Onwerkelijkheid van de wereld 3 - Kenmerken van de bevrijde 4 - Het oplossen van de geest 5 - Het uitwissen van de onbewuste denk- en voelpatronen. 6 - Meditatie van het Zelf 7 - Methode van Zuivering 8 - Verering van het Zelf

2y ago

309 Views

Training Didactische inzet van ICT - edufit.nl

De rol van facilitator van leerprocessen De mogelijkheden van de didactische inzet van ICT om het onderwijs te verbeteren vraagt ook om beleidsbeslissingen en het ondersteunen van veranderprocessen. Het opschrijven van een visie op de inzet van ICT in het onderwijs is daarbij stap één, het motiveren en stimuleren van docenten om ICT te .

1y ago

104 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

AGRICULTURAL CREDIT ACT NO. 28 OF - FAO

1965 (Wet No. 66 van 1965), aangestel, wat ten opsigte van daardie aangeleentheid, goed of boedel met regsbevoegdheid beklee is; ,,Ministerv die Minister van Landbou; [Omskrywing van ,,h1inisterv vervang deur a. 1 (a) van Wet No. 45 van 1968, deur a. 1 (c) van Wet No. 73 van 1981, deur a.

3y ago

160 Views

Onderwijs- en ExamenRegeling (OER) - Anton de Kom .

1. Het borgen van de kwaliteit van de toetsing. 2. De coordinatie van en controle op examens en tentamens. 3. Het bekrachtigen van tentamenresultaten. 4. Het vaststellen van richtlijnen binnen het kader van het OER om de uitslag van examens vast te stellen. 5. In overleg met de betreffende discipline, verlenen van vrijstelling. 6.

3y ago

241 Views

BIJLAGE I Lijst met door leerlingen geselecteerde werken

3 Beijnum, Kees van - De ordening 1 Beijnum, Kees van - Dichter op zee 1 Beijnum, Kees van - Het mooie seizoen 3 Beijnum, Kees van - Het verboden pad 4 Beijnum, Kees van - Over het IJ 12 Beijnum, Kees van - Paradiso 16 Beijnum, Kees van - Zoon van 2 Beishuizen, Ti

2y ago

214 Views

Musiek van die 'dood'! Of Musiek van die 'lewe'!

Musiek van die 'dood'! Musiek van die 'lewe'! Of Amos 6:5 Wat liedjies sing met begeleiding van die harp, wat net soos Dawid vir hulle musiekinstrumente uitdink! Amos 5:23 Verwyder van My die geraas van jou liedere!En na die geluid van jou harpe wil Ek ( GOD ) nie luister nie.

2y ago

261 Views

van de Europese Unie - Huisvoorklokkenluiders

Gezien het advies van de Rekenkamer (1), Gezien het advies van het Europees Economisch en Sociaal Comité . van de vlaggenstaat met betrekking tot de naleving en de handhaving van het Verdrag betreffende maritieme arbeid, 2006 (PB L 329 van 10.12.2013, blz. 1) en Richtlijn 2009/16/EG van het Europees Parlement en de Raad van 23 april 2009 .

1y ago

111 Views

Gedragscode - dxc

Preventie, detectie en onderzoek van wangedrag Beheer van de Gedragscode Beheer en handhaving van beleid inzake zakelijk gedrag Beheer van naleving van wet-/regelgeving Training in en bewustzijn van ethiek en naleving Beheer van nalevingsrisico's Programma-administratie van SpeakUp! en OpenLine.

1y ago

92 Views

over dierproeven en proefdieren of Research

bevordering van de naleving van de wettelijke voorschriften en daarmee het bevorderen van het welzijn van proefdieren. De NVWA is met deze manier van inspecteren in staat mede vorm te geven aan het principe van de 3 V's (vervanging, vermindering en vooral verfijning van dierproeven) dat ook de basis is van het dierproefbeleid en de Wet op de .

1y ago

104 Views

Wat is de rol van TWW voor het preventiebeleid van mijn bedrijf

Middelen Sociaal strafwetboek (6 juni 2010) SLIC-document: gemeenschappelijke visie van de hoofden van de inspectiediensten in Europa op het vlak van het beheer van een inspectiedienst De Iso9001-norm over de vereisten van een kwaliteitssysteem De jaarlijkse uitwerking van een operationeel plan De samenwerking met diverse andere diensten: Afdeling

1y ago

115 Views

Introduction To Nonlinear Regression - ETH Z

It looks like you're using an ad-blocker