Adaptable Multivariate Calibration Models For Spectral .

2y ago
37 Views
2 Downloads
905.64 KB
24 Pages
Last View : 20d ago
Last Download : 3m ago
Upload by : Maxine Vice
Transcription

1* .Adaptable MultivariateCalibration Models for Spectral ApplicationsEdward V. ThomasSandia National LaboratoriesAlbuquerque, NM 87185-0829b &EpGEl EG’EEC 7 w!!AbstractMultivariate calibration techniques have been used in a wide variety of spectroscopicsituations.In many of these situations spectral variation can be partitioned intomeaningful classes. For example, suppose that multiple spectra are obtained from eachof a number of different objects wherein the level of the analyte of interest varies withineach object over time. In such situations the total spectral variation observed across allmeasurementshas two distinct general sources of variation: intra-object and inter-object.One might want to develop a global multivariate calibration model that predicts theanalyte of interest accurately both within and across objects, including new objects notinvolved in developing the calibration model. However, this goal might be hard torealize if the inter-object spectral variation is complex and difficult to model. If the intraobject spectral variation is consistent across objects, an effective alternative approachmight be to develop a generic intra-object model that can be adapted to each obj ectseparately.This paper contains recommendationsanalysis in such situations.noninvasive measurementfor experimental protocols and dataThe approach is illustrated with an example involving theof glucose using near-infrared reflectance spectroscopy.Extensions to calibration maintenance and calibration transfer are discussed.OSTI

DISCIAIMERThis report was prepared as an account of work sponsoredby an agency of the United States Government. Neither theUnited States Government nor any agency thereof, nor anyof their employees, make any warranty, express or implied,or assumes any legal liability or responsibility for theaccuracy, completeness, or usefulness of any information,apparatus, product, or process disclosed, or represents thatits use would not infringe privately owned rights. Referenceherein to any specific commercial product, process, ormanufacturer,orservice by trade name, trademark,otherwise does not necessarily constitute or imply itsendorsement, recommendation, or favoring by the UnitedStates Government or any agency thereof. The views andopinions of authors expressed herein do not necessarilystate or reflect those of the UnitedStates Government or.any agency thereof.

DISCLAIMERPortions of this document may be illegiblein electronic image products. Images areproduced from the best available originaldocument.

.IntroductionMultivariateparticularlyin spectroscopy.on multivariatecontextcalibration has received significant attention in analytical chemistry,calibration.are associatedmanufacturingother contexts.withindustries,Examplesof multivariateapplicationscalibrationin the agriculturalmedical sciences4, the pharmaceuticalin a spectroscopicand foodindustries,industries,and in manySee the review by Lavine6 for more examples.The developmentmeasurementMartens and Naesl provide an excellent general referenceof a multivariate calibration model in spectroscopyinvolves theof n specimens at q different wavelengths comprising the calibration set. Inthe case of a linear multivariatecalibration model, the resultant predictive model can beexpressed in the form,j bo bl.x1 b2. x2 . bxq,q,(1)where is the predicted value of the analyte of interest for a new specimen given thespectral measurements(xi’s) associated with the new specimen and the model parameters(hi’s). Common methods for obtaining the model parameters include partial least-squaresregression (PLS) and principal components regression (PCR)l.The efficacy of thepredictive model depends on the applicability of the model form and how well thecalibration set represents the compositionsmodel will be applied.of the new specimens on which the predictiveThe efficacy of the predictive model also depends on how wellthe calibration set spans instrumental/environmentalconditions expected in the future.This is particularly critical when spectral effects induced by instrumental variation (e.g.,source fluctuations)and the instrument’s local environment (e.g., temperature)nontrivial when compared to effects introduced by varying the compositional2arefactors7.

. ,In many situations, it would be desirable to apply the predictive model to spectraobtained across and within different objects. It is assumed that the objects are dynamic inthe sense that the level of the analyte of interest can vary from spectrum to spectrumwithin an object. In such situations, the nature of the spectral variation across objectsmight make it difficult to develop a single model of the form in Equation 1 that is validacross objects.However, if the intra-object spectral variation is similar across objectsand amenable to a model of the form in Equation 1, there is a viable alternative approach.This approach, which is the topic of this paper, is to develop a generic model (operablewithin an object) that can be adapted to each object separately.recommendationsThis paper containsfor experimental protocols and data analysis in such situations.approach is illustrated with an example involving the noninvasive measurementTheofglucose using near-infrared reflectance spectroscopy where the objects are differentindividuals.Extensions of this approach to calibration maintenance and transfer arediscussed.TheoryConsider q-dimensionalspectral measurementsthat are obtained within andacross a population of objects. Let xv [xtil, Xvz,., xu ] denote thejt spectrumassociated with the ith object. This spectrum can be represented byxo p- ai zq,where P [,U1, P2,., ] is the average spectrum across the population,ai [ail, ai2,., aiq] is the specific effect of the ith object on the spectrum, and&ti [&i,, &i2,., &i ] is the specific effect of thej spectrum of the ith object. It is3(2)

,.assumed that the ZYterm describes random spectral effects that, in a sense, have acommon distributionacross objects. That is, these effects are just as likely to beassociated with one object as well as another. In contrast, the ai effects are specific to anobject. It is also assumed that the level of the associated analyte of interest varies acrossthe measurementsof each object (e.g., the analyte of interest could be changing in time).Furthermore it is assumed that a unit-change in the analyte of interest induces an identicalspectral effect across objects.The usual strategy for developing a global model (one that is applicable acrossobjects) is to use a modeling method such as PLS or PCR in conjunction with anappropriate calibration set containing spectra from multiple objects. The calibration setconsists of Xand Y, where X [X1,X2, . . . X1], Y 1, y2, . . . JYIl,xi [Xii,%2,. . . xinJisthe nix q matrix (i.e. number of spectra by number of pixels) of the calibration spectrafrom the iti of I objects, and i [yiI, i2,,.,yin) is the ni x 1 vector of the associatedanalyte reference values. The dimensions of X and Y are Nxq and Nxl respectively,where N ni . A linear calibration model is obtained by using a method such as PLSisl(or PCR) withXandY. The resulting predictor of the analyte level associated with thej’ spectrum of the i kobject (denoted by yy) can be expressed as(3)The set of coefficientscoefficients.{bk}k.1,. are sometimes referred to as final regressionNote that Equation 3 can be expanded (via Equation 2) as9 j bo bk”(pk aik sgkk l4) (4)

rIn the usual strategy for developing a global model, the inter-object spectral effects andintra-object spectral effects are not considered independently.That is, the model isdeveloped in view of the combined inter- and intra-object effects ( ai and Zv). As longas both the inter- and intra-object effects are amenable to modeling, this strategy can beeffective.On the other hand, if the inter-object spectral effects are for some reasondifficult to model (e.g., the calibration data do not sufficiently span the inter-objecteffects), then this approach may be ineffective.An alternative approach involves modeling the intra-object spectral effectsseparately and then adapting the model to a target object. In many cases, the intra-objecteffects are small relative to the inter-object effects and are distributed similarly acrossobjects. The latter condition is assumed here. In this approach the calibration data aremean-centered by object. That is Xm [Xlm, Xzm,., XIm ] and [ylm, y ,.,y ],Xpwhere [xzy , x;Y/w‘[J’’il-Yi7Yi2yi —. 1n] J dJi x”ZyJ.ni zj l1i, . xinim l [xil‘Yi ””” Yi.,‘zi xi2‘Xi .-- xini‘xi](5)(6)‘Yil?i’liThe mean-centeringoperation on the spectrajnlremoves the average spectrum @) and the object-specificspectral effect ( i) from theoriginal calibration spectra leaving only the specific effect of thej h spectrum of the itkobject (see Equation 1). That is(7)W’ and ’ will heretofore be referred to as generic calibration data.5

.Standard multivariatethe generic calibrationanalyteof interestcalibration techniques like PLS and PCR can be applied todata to develop a calibration model that relates variation of theto intra-objectoperations ensure that both spectraleffects.Note that the mean-centeringand c have a mean of zero. Thus it is unnecessarytoinclude an intercept in these modeling activities, resulting in a predictive model of theform,(8)This model is referred to as a generic model as it is applicable to intra-object spectralvariation from all objects. The model coefficients ({g )bl,,,., J are differentiatedfromthose in Equation 3. Because the generic model is developed with regard to intra-objectspectral variation only, these model coefficients will not generally be the same as thoseassociated with the global model ({b } l,,,., J. Note that the quality of the modeldepends on the level of the intra-object variation of the analyte of interest (i.e. thevariation in c).For maximum benefit, it is important to observe each object in thecalibration set at a relatively wide range of analyte levels.In order to be a viable predictor of the analyte of interest, the generic model mustbe adapted to each new target object. The global model (in Equation 4) does thisadaptation implicitly through the fitted model coefficients that are developed inrecognition of inter-obj ect spectral effects as well as the intra-object spectral effects. InEquation 4, bk “pk is a constant that does not depend on i orj.Furthermore,k la bk “ ik is an object-specificconstant.Thus, Equation 4 can be rewritten ask l6

jv di bk”& k.(9)k lCast in this formulation,it is clear that di bk “a ik provides an adaptation to the it k lobject through the interaction of the regression coefficients ({bk}) and the object-specificspectral effects (at). Hence the adaptation is linear with respect to the object-specificspectral effects.In the case of the generic model, the proposed adaptation takes a similar form.That is, for the P* prediction sample associated with the target object (where the Atarget object is not normally represented in the calibration set), we seek to find anappropriate value of c such that9 tp ct (lo)gk”&tpkk lis a good predictor of yp. Equivalently,we want to find a value offi such that9ip(11)‘ft gk”xtpkk lis a good predictor ofy@ given the spectral measurement, xp . A direct way toaccomplish this is to use one or more representativespectra and associated referencevalues from the tt target object. Let Xt [x ,x z,.,Xtii] and yt ,M2,.representative.Ytit]denote%spectra and associated reference values from the t fitarget object.Adaptation to the t target object is provided by(12)The predictor based onfi given in Equation 12 is constructed so that, over the nt7

.representativespectrzq the average prediction will equal the average of the associatedreference analyte values. It is important to emphasize that this method of adaptationdepends on an accurate reference value for each representativespectrum that is used toadapt. Inaccuracies in the reference analyte values will adversely affect the quality of theadaptation.An important benefit of the generic modeling approach is the potential for thedetection of outliers in prediction that is focused entirely on intra-object spectral effects,In the case of the developmentof global models, outlier detection metrics are likely to bebased heavily on inter-object effects and, therefore, not be sufficiently responsive tounusual intra-object effects. In the generic modeling approach, inter-object effects areancillary.While outlier detection metrics in the generic modeling approach can be veryeffective for identi inganomalous spectra, some care is required in their construction.In order to illustrate how outlier metrics can be constructed in the genericmodeling context, let dp J – mean(XJ represent the deviation of thep k predictionspectrum from the average of the associated nf adaptation spectra with all spectraassociated with the target object. Note that the elements of dp can be rewritten mdpk Gpk– mean(ajk),since the object-specificmean(XJ.(13)spectral effects are removed from Xp via the subtraction ofIn the sense that object-specificspectral effects have been removed, dpconforms to the modeling space given by .To continue, let the integrated unmodelledresidual of the p spectrum from the target object be defined by,Rp e@Tep,where ep is the unmodelled portion of dp (see e.g., p. 291 in Martens and Naesl).8(14)

.Normally, a prediction spectrum is deemed anomalous if its integrated unmodelledresidual is unusually large when compared to the distribution of similarly computedvalues from the calibration set. Here, however, due to the use of the adaptation spectra,n one should compare StP —.nt lR@ with the distribution of Rti’s that are derived fromntthe calibration set. The normalizing factor —nt lvariance ofdpknt lis inflated by the factor —is needed due to the fact that thewhen compared to the variance of kthatntis associated with the calibration set.Example – NoninvasiveAccurate noninvasive measurementinfrared spectroscopynumber of researchersof in vivo glucose levels in diabetics via near-and commerciallevels and the complexity of human tissue.Arnold et a19 discuss a number of the issues that confront those whousing multivariatepain and inconvenienceexamplecalibrationassociatedis to demonstratewith current monitoringthe feasibilityof couplingtechnologythe skin.that involvesThe purposeof thisa generic model with subject-specific adaptation to provide clinically uselid noninvasive measurements9formonitor their glucose levels without thethe glucose in blood obtained by pricking.with spectralThe ultimate objective of these activities is to provide a mechanismdiabetic subjects to accurately and convenientlymeasuringAentities have been heavily involved in this areaattempt to measure glucose noninvasivelymeasurements.of Glucosehas proved to be a very difficult task due to the relatively smallspectral effect of glucose at physiological(see e.g., Heise8).Measurementof glucose.

r.Researchers at Sandia National Laboratories, the University of New Mexico, andRio Grande MedicalTechnologies,associatedwith the noninvasivereflectancespectroscopy.Inc. have conductedmeasurementa number of clinical studiesof invivoglucose using near-infraredOne such study is discussed here.the commercial party involved in this collaboration,To protect the interests ofmany details of the study (which areimportant but unrelated to the focus of this paper) are not discussed here. Nevertheless,some of the relevant specifics concerning the study can be provided and are as follows.Calibrationdata were obtained from 18 diabetic subjects who were repeatedlymeasured over a span of 7 weeks.The intent of observing the subjects for such a longperiod of time was to develop calibration data that spanned significant levels of naturalintra-subj ect physiologicalsampling variation.the spectrometervariation (including but not limited to glucose variation) andIn addition, the study protocol involved the deliberate perturbationand its local environmentto induce instrumental/environmentalinto the generic calibration data. These perturbationswere carefillyexpected long-term operating conditions of the instrument.extremely important for developingofeffectsselected to span theActivities, such as these, arecalibration data that will facilitate valid predictionsinto the fi-dme7711.Spectral and reference data were acquired twice per week from most subjects.few subjectswere unable to keep all of their appointmentsreference data.During each appointment,to provide5 separate spectral measurementsspectralAandat differentspatial positions on the underside of the forearm were acquired over a 15-minute periodusing reflectancewere involved).samplingfrom 4200-7200In addition,two capillary10wavenumbersglucose(390 discretereferencewavelengthsmeasurementswere

obtained via blood draws from each subject during each data acquisitionblood draws were performed immediatelydata.The spectra,backgroundbasedcorrected.Thebefore and after the acquisition of the spectralon the logarithmTime-basedperiod.of the reflectedinterpolationintensities,were notwas used to assign an appropriatecapillaryglucose referencevalue to each spectrum.A total of 1161 spectra (otheracquiredspectra were deemed outliers and were discarded)and associatedreferenceglucose values comprise the calibration data.The total variation in the spectra within the calibration set is due to a combinationof inter- and intra-subjecteffects.In the case of this example,include those effects associated with the deliberate perturbationits local environment.of the calibrationvariance components analysisspectral variation obtained by this analysis.inter-subjectthe intra-subjectof theThe estimate of intra-subject spectral variation is an aggregate measure obtained across all subjects.spectral variation dominatesandinto intra- and inter-datal 1. Figure 1 displays estimates of the standard deviationsinter- and intra-subjectthe inter-subjecteffectsof the spectrometerThe total spectral variation was decomposedsubject spectral variation via a ral variation.effects were difficult to model in this particularexperimentnumber of subjects was modest) and in many other related experimentsThe(where thewhere the totalnumber of subj ects was much larger. However, the intra-subject effects were found to beconsistentin nature across subjects.Thus, a generic modelingapproach was deemedappropriate.In order to test the efficacycalibrationmodel was developedof the genericmodelingby using PCR (no intercept)11approach,a genericon the spectraland

.capillary glucose reference data that were mean-centeredThe resultinggeneric model coefficientsby subject (see Equation8).({g } l,.,., J are shown in Figure 2.In aqualitative sense, this model corresponds well with the relatively strong glucose bands inThe generic model was then adapted to twothe vicinity of 4300 cm-l and 4400 cm-l.additional diabetic subjects who were distinct from the 18 subjects whose data were usedto develop the generic calibration data/model.additionalsubjectsmeasurementsspannedof the originalmorethanThe period of observationsix18 subjects.months,beginningwithThus, the two additionalobserved for more than four months following the acquisitionfor these twothesubjectsinitialwereof the generic calibrationdata. As in the case of acquiring the calibration data, 5 separate spectral measurementsat-different spatial positions on the underside of the forearm were acquired over a 15-minuteperiod.In addition, capillary glucose reference measurementsof the two subjectsduringeach data acquisitionperiodwere acquired fi-om eachaccordingto the protocoldescribed earlier.During the first 7 weeks of observation and coinciding with the measurementsofthe original 18 subjects, the two additional subjects were observed twice per week

Multivariate calibration has received significant attention in analytical chemistry, particularly in spectroscopy. Martens and Naesl provide an excellent general reference on multivariate calibration. Examples of multivariate calibration in a spectroscopic context are associated w

Related Documents:

6.7.1 Multivariate projection 150 6.7.2 Validation scores 150 6.8 Exercise—detecting outliers (Troodos) 152 6.8.1 Purpose 152 6.8.2 Dataset 152 6.8.3 Analysis 153 6.8.4 Summary 156 6.9 Summary:PCAin practice 156 6.10 References 157 7. Multivariate calibration 158 7.1 Multivariate modelling (X, Y): the calibration stage 158 7.2 Multivariate .

Multivariate Calibration Quick Guide 3 You are now ready to setup the calibration model. Select the Soybean Oil project node in the Project explorer. Choose New Multivariate Calibration from the Quantify menu. The calibration wizard opens and guides you through

Multivariate calibration models are secondary analytical techniques in that they require primary analytical data for calibration. Thus, a robust multivariate calibration model requires a sufficient number of representative sampl

Bruksanvisning för bilstereo . Bruksanvisning for bilstereo . Instrukcja obsługi samochodowego odtwarzacza stereo . Operating Instructions for Car Stereo . 610-104 . SV . Bruksanvisning i original

selection; Multivariate calibration. INTRODUCTION Multivariate calibration models used to analyze spectroscop-ic data have the general form y ¼ Xb ð1Þ where X contains m calibration samples measured at n wavelengths, b is an n 3 1 model vector, and y is an m 3 1 vector containing the analy

Quantify – Multivariate Calibration Overview Calibration Model Development Steps Load calibration spectra Merge all spectra into one data view Create a project (and a folder) Add labels with concentrations (Label Editor) Start the calibration wizard Create and optimize a calibration

Calibration (from VIM3) Continued NOTE 1 A calibration may be expressed by a statement, calibration function, calibration diagram, calibration curve, or calibration table. In some cases, it may consist of an additive or multiplicative correction of the indication with associated measurement uncertainty. NOTE 2 Calibration should not be .

jogensis is the only recorded species of the genus by M.O.P. Iyengar in 1958. It shows the least indulgence of It shows the least indulgence of subsequent researchers in study of this group.