Partial Least Squares (PLS): Path Modeling - Uni-freiburg.de

11m ago
2 Views
1 Downloads
546.14 KB
31 Pages
Last View : 1m ago
Last Download : 3m ago
Upload by : Tia Newell
Transcription

Partial Least Squares (PLS): Path Modeling Method Talk Winter Term 2015/16 Pascal Stichler

Outline 1 Introduction to PLS 2 Putting PLS in Context 3 Model Definition 4 Solution Algorithm 5 Model Evaluation 6 Wrap-Up PLS 2

Today’s Lecture Objectives PLS 1 Evaluate when to use PLS 2 Learn how PLS works and how to use it 3 Investigate how to evaluate a PLS model, interpret the results and adjust the model accordingly 3

Outline 1 Introduction to PLS 2 Putting PLS in Context 3 Model Definition 4 Solution Algorithm 5 Model Evaluation 6 Wrap-Up PLS: Introduction to PLS 4

PLS: A silver bullet? Partial Least Squares Path Modeling is a statistical data analysis methodology that exists at the intersection of Regression Models, Structural Equation Models, and Multiple Table Analysis methods [9] Goal: Use theoretical knowledge about structure of latent variables to predict indicators based on data I Doing so with least possible distribution assumptions I PLS-PM is known under several names: PLS-PM, PLS-SEM, component-based structural equation modeling, projection to latent structures, soft modeling etc. I Developed by Herman Wold in the mid 1960s under the term of "soft modeling" [14] I After initial introduction and discussions it received little attention until the late 1990s, however since then sharply rising interest PLS: Introduction to PLS 5

Why use PLS? PLS-PM is worth considering when . Structural model I . . . you have a theoretical model that involves latent variables I . . . the phenomenon you investigate is relatively new and measurement models need to be newly developed I . . . the structural equation model is complex with a large number of latent variables and indicator variables [12] Observed variables I . . . you have small sample sets (e. g. more variables than observations) [7] I . . . you have non-normal distributed data I . . . you have multicollinearity problems I . . . you have formative and reflective measures (to be discussed) I . . . you need minimum requirements regarding measurement scales (e. g. ratio and nominal variables) I . . . you need minimum requirements regarding residuals distribution [1] PLS: Introduction to PLS 6

Outline 1 Introduction to PLS 2 Putting PLS in Context 3 Model Definition 4 Solution Algorithm 5 Model Evaluation 6 Wrap-Up PLS: Putting PLS in Context 7

General Overview Types of PLS: I PLS-Path Modeling: Component-based modeling based on theoretical structure model Mainly used in: social sciences, econometrics, marketing and strategic management I PLS-Regression: Regression based approach investigating the linear relationship between multiple independent variables and dependent variable(s) Mainly used in: chemometrics, bioinformatics, sensometrics, neuroscience and anthropology I OPLS: Orthognal projection improves interpretability I PLS-DA: Used when Xr is categorial I CB-SEM: Covariance-based structural equation modelling PLS: Putting PLS in Context LVp LVr Xp Xr I Predictors Xp X I Responses Xr X with Xp Xr I Exogenous latent variables LVp LV I Endogenous latent variables LVr LV with LVp LVr 8

PLS-PM vs. CB-SEM Both methods differ from statistical point of view. Hence, neither of the techniques is generally superior to the other and neither of them is appropriate for all situations. In general, the strenghts of PLS-SEM are CB-SEM’s weaknesses, and visa versa. [3] PLS-PM (PLS-SEM) Variance-based I The goal is prediction and theory development I Formatively measured constructs are part of the structural model CB-SEM Covariance-based I The goal is theory testing, theory confirmation, or the comparison of alternative theories I The structural model is complex I Error terms require additional specification, such as the covariation I The sample size is small and/or the data are non-normally distributed I The structural model has non-recursive relationships I The plan is to use latent variable scores in subsequent analyses I The research requires a global goodness-of-fit criterion I Available Software: SmartPLS, PLSGraph, R packages (plspm) etc. I Available Software: LISREL, AMOS, EQS etc. Based on [8], [4], [11] PLS: Putting PLS in Context 9

Theory Testing Comparison CB-SEM PLS-PM Prediction PLS: Putting PLS in Context 10

Outline 1 Introduction to PLS 2 Putting PLS in Context 3 Model Definition 4 Solution Algorithm 5 Model Evaluation 6 Wrap-Up PLS: Model Definition 11

Exemplary Model Measurement Model/Outer Model Measurement Model/Outer Model X11 X12 LV1 X31 X13 LV3 X21 X22 X32 X33 LV2 X23 Structural Model/Inner Model Formal definition: I X data set with n observations and m variables I X can be divided into J exclusive blocks with K variables each X1,1 . . . XJ ,K etc. cj Yj I Each block Xj associated with LVj ; estimation of variable ("score") denoted by LV I LV1 and LV3: reflective blocks; LV2: formative block [9] PLS: Model Definition 12

Structural Model (Inner Model) 1 Linear Relationship All relationships are considered linear relationships and can be noted as LVj β0 βji LVi εj i j The coefficients βji represent the path coefficients 2 Recursive Model mandatory Causality flow must be unidirectional (no loops) 3 Regression Specification (Predictor Specification) E (LVj LVi ) β0i βji LVi i j Specifying that the regression has to be linear under the assumption that cov(LVj , εj ) 0 and εj 0 PLS: Model Definition 13

Measurement Model (Outer Model) Reflective Indicators X11 X12 X13 X21 λ11 λ12 LV1 λ13 I Linear relationships: Xjk λ0jk λjk LVj εjk (λjk is called loading) I Regression Specification: E (Xjk LVj ) λ0jk λjk LVj I Characteristics: - Unidimensional - Correlated - Xjk "fully relevant" PLS: Model Definition Formative Indicators X22 X23 X31 λ21 λ22 MIMIC* LV2 λ23 X32 X33 λ31 λ32 LV3 λ33 LVj λ0j λjk Xjk εj equivalent to reflective and formative (depending on indicator) E (LVj Xjk ) λ0j λjk Xjk equivalent to reflective and formative (depending on indicator) - Multidimensional - Uncorrelated - Xjk "partly relevant" In R package plspm not possible *multiple effect indicators for multiple causes 14

Weight Relations (Scores) I The latent variables are only virtual entities I However, as all linear relations depend on the latent variables, they need a representation: Weight Relations cj Yj Score: LV wjk Xjk k I The score, as a representation of the latent variable, is calculated as the sum of its indicators (similar to the approach in principal component analysis) I Because of this PLS is called a component-based approach PLS: Model Definition 15

Outline 1 Introduction to PLS 2 Putting PLS in Context 3 Model Definition 4 Solution Algorithm 5 Model Evaluation 6 Wrap-Up PLS: Solution Algorithm 16

PLS-PM Algorithm Overview 1 Stage: Get the weights to compute latent variable scores Most important and most difficult 2 Stage: Estimate the path coefficients (inner model) Usually done via OLS 3 Stage: Obtain the loadings (outer model) Calculation of correlations PLS: Solution Algorithm 17

Stage 1: Latent Variable Scores Start: initial arbitrary outer weights (e.g. wjk 1) Check for convergence of outer weights Mode A: Simple regression Mode B: Multiple regression (Mode C:) Combination PLS: Solution Algorithm Step 1: Compute the external approximation of latent variables Yj wjk Xjk k Step 4: Step 2: Calculate new outer weights wjk Obtain inner weights eij Inner weighting schemes: Centroid scheme Factor scheme Path scheme Step 3: Compute the internal approximation of latent variables Zj i j eijYi 18

Stage 2 & 3 2. Stage: Path Coefficients The path coefficient estimates βbji Bji are calculated usually using ordinary least squares in the multiple regression of Yi on the Yj ’s related with it Yj βbji Yi i j In case high multicollinearity occurs PLS regression can also be applied [11] 3. Stage: Loadings For convenience and simplicity reasons, loadings are preferably calculated as correlations between a latent variable and its indicators: λc jk cor (Xjk , Yj ) PLS: Solution Algorithm 19

PLS-PM usage in R (package plspm) Parameters to define the PLS Path Model Data path matrix blocks scaling modes Data for the model Definition of inner model List definitng the blocks of variables of the outer model List defining the measurment scale of variables for non-metric data Vector defining the measuremnt mode of each block Parameters related to the PLS-PM algorithm scheme scaled tol maxiter plscomp Inner path weighting scheme Indicates whether the data should be standardized Tolerance threshold for checking convergence of the iterative stages maximum number of iterations Indicates the number of PLS components when handling non-metric data Additional parameters boot.val br dataset PLS: Solution Algorithm Indicates whether bootstrap validation must be performed Number of bootstrap resamples Indicates whether the data matrix should be retrieved 20

Outline 1 Introduction to PLS 2 Putting PLS in Context 3 Model Definition 4 Solution Algorithm 5 Model Evaluation 6 Wrap-Up PLS: Model Evaluation 21

Interpreting the Results In PLS the real challenge is interpreting the results and making well-founded adjustments the model [9], p. 54 Steps of Model Assessment: 1 Assessment Measurement Model (Outer Model) 2 Assessment Structural Model (Inner Model) (It is important to keep this order due to model dependencies) PLS: Model Evaluation 22

1. Measurement Model Assessment (Outer Model) I Formative Blocks: Evaluation relatively straightforward I Reflective Blocks: Evaluation rather complex Formative Blocks: Variables are considered as causing the latent variable I They do not necessarily measure the same underlying construct I Not supposed to be correlated I Compare outer weights to check which indicator contributes most efficiently I Elimination of variables should Test theory applied Reflective Blocks: Variables are considered as measuring the same underlying construct I Hence they need a strong mutual association I Further they should be strongly related to its latent variable 1 Unidimensionality of indicators 2 Indicators well explained 3 Constructs differ from each other be based on multicollinearity PLS: Model Evaluation 23

Deep Dive: Reflective Indicators 1 Unidimensionality of indicators: All for one and one for all (a) Cronbach’s alpha Measures the average inter-variable correlation (considered good if 0.7) (b) Dillon-Goldstein’s rho Focus on the variance of the sum of variables (considered a better indicator than Cronbach’s alpha ([1], p.320) (considered good if 0.7) (see [11], [13] p. 50 for formal definition) (c) First eigenvalue First eigenvalue of correlation matrix should be larger than one and second one significantly smaller (preferably smaller than 1) 2 Loadings & Communalities: Indicators well explained I I Loadings are considered for each indicator (considered good if 0.7) Communalities (squared loadings): amount of indicator variance explained by its corresponding LV PLS: Model Evaluation 24

Deep Dive: Reflective Indicators 3 Cross-loadings: Constructs differ from each other cross-loadings ˆ loadings of an indicator with the rest of the latent variables Goal: Ensure that shared variance between construct and its indicators is higher than for other constructs (no "traitor" indicators) Loadings should always be highest for the respective block [. . . ] crossloadings PLS: Model Evaluation 25

2. Structural Model Assessment (Inner Model) Standard OLS regression output: 3 further indicators of model quality: I R 2 determination coeffcient: Amount of variance of endogenous LVs explained by its independent LVs (considered low below 0.3 and high above 0.6) I Redundancy Index: Amount of variance in the endogenous block that explained by 2 2 its independent LVs (defined as Rd (LVj , xjk ) loadingjk Rj ) I Goodness-of-Fit (GoF): No single criterion exists for overall quality of a model. GoF as a pseudo criterion: q GoF communality R 2 (considered good if 0.7) [10] [11] I Validation: Resampling (bootstrapping, jackknifing) possible; more traditional approaches are not (as there are no assumptions made on the distribution) PLS: Model Evaluation 26

Outline 1 Introduction to PLS 2 Putting PLS in Context 3 Model Definition 4 Solution Algorithm 5 Model Evaluation 6 Wrap-Up PLS: Wrap-Up 27

Summary: PLS Advisable for the following conditions (based on [8]) Focus Distribution Sample size Prediction and theory development Minimum assumptions made regarding indicator distribution Small sample size possible (however questioned in literature [2], [6], [5]) Model definition Indicators Measurement Model Structural Model Define blocks of variables and respective latent variables Define relations (formative/reflective) Define internal model Interpreting the results Measurement Model (formative) Measurement Model (reflective) Structural Model Validation PLS: Wrap-Up Eliminate multicollinearity Unidimensionality, loadings & communalities and cross-loadings Consider R 2 , redundancy index and GoF Apply resampling (bootstrapping, jackknifing) 28

Bibliography I W. W. C HIN. The partial least squares approach to structural equation modeling. In: Modern methods for business research, Vol. 295, No. 2 (1998), pp. 295–336. D. G OODHUE, W. L EWIS, and R. T HOMPSON. PLS, small sample size, and statistical power in MIS research. In: System Sciences, 2006. HICSS’06. Proceedings of the 39th Annual Hawaii International Conference on. Vol. 8. IEEE. 2006, 202b–202b. J. F. H AIR J R et al. A primer on partial least squares structural equation modeling (PLS-SEM). Sage Publications, 2013. J. F. H AIR, C. M. R INGLE, and M. S ARSTEDT. PLS-SEM: Indeed a silver bullet. In: Journal of Marketing Theory and Practice, Vol. 19, No. 2 (2011), pp. 139–152. G. A. M ARCOULIDES, W. W. C HIN, and C. S AUNDERS. A critical look at partial least squares modeling. In: Mis Quarterly (2009), pp. 171–175. PLS: Wrap-Up 29

Bibliography II G. A. M ARCOULIDES and C. S AUNDERS. Editor’s comments: PLS: a silver bullet? In: MIS quarterly, Vol. 30, No. 2 (2006), pp. iii–ix. B.-H. M EVIK and R. W EHRENS. The pls package: principal component and partial least squares regression in R. In: Journal of Statistical Software, Vol. 18, No. 2 (2007), pp. 1–24. W. R EINARTZ, M. H AENLEIN, and J. H ENSELER. An empirical comparison of the efficacy of covariance-based and variance-based SEM. In: International Journal of research in Marketing, Vol. 26, No. 4 (2009), pp. 332–344. G. S ANCHEZ. PLS path modeling with R. In: Online, January (2013). M. T ENENHAUS, S. A MATO, and V E SPOSITO V INZI. A global goodness-of-fit index for PLS structural equation modelling. In: Proceedings of the XLII SIS scientific meeting. Vol. 1. CLEUP Padova. 2004, pp. 739–742. PLS: Wrap-Up 30

Bibliography III M. T ENENHAUS et al. PLS path modeling. In: Computational statistics & data analysis, Vol. 48, No. 1 (2005), pp. 159–205. N. U RBACH and F. A HLEMANN. Structural equation modeling in information systems research using partial least squares. In: Journal of Information Technology Theory and Application, Vol. 11, No. 2 (2010), pp. 5–40. V. E. V INZI, L. T RINCHERA, and S. A MATO. PLS path modeling: from foundations to recent developments and open issues for model assessment and improvement. In: Handbook of partial least squares. Springer, 2010, pp. 47–82. H. W OLD et al. Estimation of principal components and related models by iterative least squares. In: Multivariate analysis, Vol. 1 (1966), pp. 391–420. PLS: Wrap-Up 31

I PLS-PM is known under several names: PLS-PM, PLS-SEM, component-based structural equation modeling, projection to latent structures, soft modeling etc. I Developed by Herman Wold in the mid 1960s under the term of "soft modeling" [14] I After initial introduction and discussions it received little attention until

Related Documents:

Finley Engineering Vantagepoint Training: FEC Equipment Hub Training, Last Update: 1/12/2021 Page 5 . PLS CADD Key PLS CADD Key Date PLS CADD Lite Key PLS CADD Lite Key Date PLS Pole Key PLS Pole Key Date PLS Super Stick Key PLS Super Stick Key Date PLS

PLS CADD II This module will enable the user to maximize the capabilities of PLS CADD. Use of PLS CADD / Lite and the link between PLS Pole /Tower Using method 4 structures (Finite element modelling) Overview of line optimisation process Optimum spotting Management of preferences, backup

PLS CADD, PLS Tower, PLS Pole. COURSE DESCRIPTION AND LEARNING OBJECTIVES This 5 day course is aimed at teaching line designers how to use the industry standard line design programmes . All Delegates are required to bring their own laptop to the training session. Day 2 : PLS Tower Design of lattice transmission tower

The PROFESSIONAL STANDARD. 2 Our Edge PLS manufactures the highest quality layout tools for the professional contractor. We are dedicated to designing rugged and accurate laser tools for a lifetime of increased . Applications PLS 3 PLS 5 PLS 180 PLS 360 FT90 PLS 6 PLS 480 HVL 100 H2 HV2

PLS CADD, PLS Tower, PLS Pole a and joint venture company. COURSE DESCRIPTION AND LEARNING OBJECTIVES . the commencement of the training course and receive a full refund. General. Title: External_PLS-CADD-Course-V6.pdf Author: Nova Tiger CEO Created Date: 12/13/2018 2:41:06 PM .

A Simple Explanation of Partial Least Squares Kee Siong Ng April 27, 2013 1 Introduction Partial Least Squares (PLS) is a widely used technique in chemometrics, especially in the case where the number of independent variables is signi cantly larger than the number of data points.File Size: 214KB

Linear Least Squares ! Linear least squares attempts to find a least squares solution for an overdetermined linear system (i.e. a linear system described by an m x n matrix A with more equations than parameters). ! Least squares minimizes the squared Eucliden norm of the residual ! For data fitting on m data points using a linear

AppSec-DC 2012 Hacking NET Applications: The Black Arts Jon McCoy www.DigitalBodyGuard.com