Multiple Linear Regression & AIC - University Of Alberta

1y ago

14 Views

2 Downloads

697.42 KB

17 Pages

Last View : 30d ago

Last Download : 3m ago

Upload by : Randy Pettway

Report this link

Download PDF

Transcription

Multiple Linear Regression & AIC “I've come loaded with statistics, for I've noticed that a man can't prove anything without statistics. No man can.” Mark Twain (Humorist & Writer)

Linear Regression Linear relationships Regression Analysis PART 1: find a relationship between response variable (Y) and a predictor variable (X) (e.g. Y X) response (y) PART 2: use relationship to predict Y from X m Equation of a line: 𝑦 𝑚𝑥 𝑏 𝑚 slope of the line 𝑏 𝑦-intercept b 𝑅𝐼𝑆𝐸 𝑅𝑈𝑁 Simple Linear Regression in R: predictor (x) lm(response predictor) summary(lm(response predictor))

Multiple Linear Regression Linear relationship developed from more than 1 predictor variable Simple linear regression: y b m*x y β0 β1 * x1 Multiple linear regression: y β0 β1*x1 β2*x2 βn*xn βi is a parameter estimate used to generate the linear curve Simple linear model: β1 is the slope of the line Multiple linear model: β1 , β2, etc. work together to generate a linear curve β0 is the y-intercept (both cases)

Multiple Linear Regression Relating model back to data table Response variable (Y) ID DBH VOL AGE DENSITY 1 11.5 1.09 23 0.55 2 5.5 0.52 24 0.74 3 11.0 1.05 27 0.56 4 7.6 0.71 23 0.71 5 10.0 0.95 22 0.63 6 8.4 0.78 29 0.63 Multiple linear regression: y β0 β1*x1 β2*x2 DENSITY Intercept β1*AGE β2*VOL β1, β2 : What I need to multiply AGE and VOL by (respectively) to get the value in DENSITY (predicted) Remember the difference between the observed and predicted DENSITY are our regression residuals Smaller residuals Better Model Predictor variable 1 (x1) Predictor variable 2 (x2)

Multiple Linear Regression Output from R Estimate of model parameters (βi values) Standard error of estimates Coefficient of determination a.k.a “Goodness of fit” Measure of how close the data are to the fitted regression line R2 and Adjusted R2 The significance of the overall relationship described by the model Tests the null hypothesis that the coefficient is equal to zero (no effect) A predictor that has a low p-value is likely to be a meaningful addition to your model because changes in the predictor's value are related to changes in the response variable A large p-value suggests that changes in the predictor are not associated with changes in the response

Multiple Linear Regression Adjusted R-squared Test statistic: 𝑅2 𝑛 𝑖 1 𝑛 𝑖 1 𝑦𝑖 𝑦 𝑦𝑖 𝑦 2 2 𝑆𝑆𝑟𝑒𝑔𝑟𝑒𝑠𝑠𝑖𝑜𝑛 𝑅 𝑆𝑆𝑡𝑜𝑡𝑎𝑙 2 𝑅 2 𝑎𝑑𝑗 𝑅 2 1 𝑅 2 𝑝 𝑛 𝑝 1 𝑝 number of predictor variables (regressors, not including intercept) 𝑛 sample size Adjusted 𝑅2 is always positive Ranges from 0 to 1 with values closer to 1 indicating a stronger relationship Adjusted 𝑅2 is the value of 𝑅2 which has been penalized for the number of variables added to the model Therefore Adjusted 𝑅2 is always smaller than 𝑅2

Multiple Linear Regression Adjusted R-squared Why do we have to Adjust 𝑅 2 ? For multiple linear regression there are 2 problems: Problem 1: Every time you add a predictor to a model, the R-squared increases, even if due to chance alone. It never decreases. Consequently, a model with more terms may appear to have a better fit simply because it has more terms. Problem 2: If a model has too many predictors and higher order polynomials, it begins to model the random noise in the data. This condition is known as over-fitting the model and it produces misleadingly high R-squared values and a lessened ability to make predictions. Therefore for Multiple Linear Regression you need to report the Adjusted 𝑅2 which accounts for the number of predictors you had to added

Akaike’s Information Criterion (AIC) How do we decide what variable to include? In the 1970s he used information theory to build a numerical equivalent of Occam's razor Occam’s razor: All else being equal, the simplest explanation is the best one Hirotugu Akaike, 1927-2009 In statistics, this means a model with fewer parameters is to be preferred to one with more Of course, this needs to be weighed against the ability of the model to actually predict anything AIC considers both the fit of the model and the number of parameters used – More parameters result in a penalty

Akaike’s Information Criterion (AIC) How do we decide what variable to include? The model fit (AIC value) is measured ask likelihood of the parameters being correct for the population based on the observed sample The number of parameters is derived from the degrees of freedom that are left AIC value roughly equals the number of parameters minus the likelihood of the overall model – Therefore the smaller the AIC value the better the model Allows us to balance over- and under-fitting in our modelled relationships – We want a model that is as simple as possible, but no simpler – A reasonable amount of explanatory power is traded off against model size – AIC measures the balance of this for us

Akaike’s Information Criterion (AIC) AIC in R Stepwise model comparison is an iterative model evaluation that will either: 1. Starts with a single variable, then adds variables one at a time (“forward”) 2. Starts with all variables, iteratively removing those of low importance (“backward”) 3. Run in both directions (“both”) The order of the variables matters – therefore it is best to run the stepwise model comparison in all directions and compare AIC values Akaike’s Information Criterion in R to determine predictors: step(lm(response predictor1 predictor2 predictor3), direction "backward") step(lm(response predictor1 predictor2 predictor3), direction "forward") step(lm(response predictor1 predictor2 predictor3), direction "both")

Akaike’s Information Criterion (AIC) AIC Output from stepwise procedure AIC value for full model (starting point) Backward Selection If I remove VOL What happens to my AIC? ( β0 AGE DBH) Now remove DBH What happens to my AIC? ( β0 AGE) Now remove AGE What happens to my AIC? ( β0 ) Best model (lowest AIC) is when VOL is removed Test this model again Forward Selection Start with DENSITY β0 AGE DBH What is the best model to use? (What has the lowest AIC?) What are the parameter estimates of this model? If I remove AGE What happens to my AIC? ( β0 DBH) If I add VOL What happens to my AIC? ( β0 DBH VOL) If I remove DBH What happens to my AIC? ( β0 VOL)

Multiple Linear Regression Assumptions 1. For any given value of X, the distribution of Y must be normal BUT Y does not have to be normally distributed as a whole 2. For any given value of X, of Y must have equal variances You can again check this by using the Shaprio Test, Bartlett Test, and residual plots on the residuals of your model What we have all ready been doing! No assumptions for X – but be conscious of your data

Collinearity a.k.a Multicollinearity Problem with predictors Occurs when predictor variables are related (linked) to one another Does not reduce the predictive power or reliability of the model as a whole Meaning that one predictor can be linearly predicted from the others with a non-trivial degree of accuracy E.g. Climate (mean summer precipitation and heat moisture index) BUT the coefficient estimates may change erratically in response to small changes in the model or the data A model with correlated predictors CAN indicate how well the combination of predictors predicts the outcome variable BUT it may not give valid results about any individual predictor, or about which predictors are redundant with respect to others

Collinearity a.k.a Multicollinearity What to do If you suspect your predictor variables are correlated you can calculate a matrix of correlation values between your variables to confirm Here VOL and DBH are highly correlated But it is up to you to judge what might be a problematic relationship

Collinearity a.k.a Multicollinearity What to do Whether or not you choose to use Multiple Regression Models depends on the question you want to answer Are you interested in establishing a relationship? Are you interested in which predictors are driving that relationship? There are alternative techniques that can deal with highly correlated variables – these are mostly multivariate - Regression Trees can handle correlated data well

Important to Remember A multiple linear relationship DOES NOT imply causation! Adjusted 𝑅2 implies a relationship rather than one or multiple factors causing another factor value Be careful of your interpretations!

Related Documents:

Sound Waves Practice Problems PSI AP Physics 1 Name ...

PSI AP Physics 1 Name_ Multiple Choice 1. Two&sound&sources&S 1∧&S p;Hz&and250&Hz.&Whenwe& esult&is:& (A) great&&&&&(C)&The&same&&&&&

383 Views

3y ago

Introduction to Regression Procedures

independent variables. Many other procedures can also ﬁt regression models, but they focus on more specialized forms of regression, such as robust regression, generalized linear regression, nonlinear regression, nonparametric regression, quantile regression, regression modeling of survey data, regression modeling of

160 Views

2y ago

Elenco Libri della Biblioteca dei ragazzi 2012-13

Argilla Almond&David Arrivederci&ragazzi Malle&L. Artemis&Fowl ColferD. Ascoltail&mio&cuore Pitzorno&B. ASSASSINATION Sgardoli&G. Auschwitzero&il&numero&220545 AveyD. di&mare Salgari&E. Avventurain&Egitto Pederiali&G. Avventure&di&storie AA.&VV. Baby&sitter&blues Murail&Marie]Aude Bambini&di&farina FineAnna

218 Views

3y ago

Lecture 14 Multiple Linear Regression and Logistic Regression

LINEAR REGRESSION 12-2.1 Test for Significance of Regression 12-2.2 Tests on Individual Regression Coefficients and Subsets of Coefficients 12-3 CONFIDENCE INTERVALS IN MULTIPLE LINEAR REGRESSION 12-3.1 Confidence Intervals on Individual Regression Coefficients 12-3.2 Confidence Interval

92 Views

2y ago

LINEAR REGRESSION - York University

Probability & Bayesian Inference CSE 4404/5327 Introduction to Machine Learning and Pattern Recognition J. Elder 3 Linear Regression Topics What is linear regression? Example: polynomial curve fitting Other basis families Solving linear regression problems Regularized regression Multiple linear regression

18 Views

1y ago

Taico&Incentive&Services&Inc.&&&&&&&&&&&&&&&&&&&&845&228&4438 ...

The program, which was designed to push sales of Goodyear Aquatred tires, was targeted at sales associates and managers at 900 company-owned stores and service centers, which were divided into two equal groups of nearly identical performance. For every 12 tires they sold, one group received cash rewards and the other received

69 Views

10m ago

CHAPTER 6:UNIFORMCIRCULARM OTION ANDGRAVITATION

College"Physics" Student"Solutions"Manual" Chapter"6" " 50" " 728 rev s 728 rpm 1 min 60 s 2 rad 1 rev 76.2 rad s 1 rev 2 rad , π ω π " 6.2 CENTRIPETAL ACCELERATION 18." Verify&that ntrifuge&is&about 0.50&km/s,∧&Earth&in&its& orbit is&about p;linear&speed&of&a .

187 Views

3y ago

Asset management in Italy: a snapshot in an evolutive context

asset management industry, that in the future will need to move these resources within its boundaries. handling compliance some Regulatory challenges In the past few years, regulatory compliance has constantly been at the top of asset manager’s agenda. Currently, the most debated regulation is the upcoming Market in Financial Instruments Directive (MiFID II), as it covers many areas of the .

54 Views

3y ago

Recent Views

Chapter 15 Rooming Houses - MassLegalHelp

Individual renters usually have their own separate room and their own agreement with the landlord. For example, you may stay for just a few days, but another renter may stay for 3 months. Rooming houses with 4 or more renters at the same time must be licensed. Some cities and towns have local protections for renters in rooming houses. Rooming House

2y ago

356 Views

Americans rent, buy, sell and think about home.

median rent among Generation X is 1,062 per month. The youngest renters, Generation Z, are typically paying the least at 882 per month.9 This echoes the notion that Generation Z renters are opting to rent the smallest apartments or homes, which translates to lower monthly rental payments. Approximately half of renters (47 percent) are paying for

1y ago

174 Views

Disaster assistance process overview

A guide through the post-disaster recovery process. KEY ASSISTANCE SOURCES TIPS HOMEOWNERS/RENTERS INSURANCE If you have homeowners or renters insurance, this provides you funds to repair or replace property damaged as a result of covered perils during a disaster. Additional types of insurance, such as auto or other peril-specific

1y ago

109 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

Texas Demographic Trends and Projections and the 2020 Census

Income disparities place African Americans and Latinos at greater risk during times of income loss. Renters, renters w/low incomes, Blacks, and households w/children face greater risk of eviction. Persistently low health insurance coverage in the state increases vulnerability of Texans with employer based insurance.

1y ago

137 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

Texas - milestonepnc

State Auto - Homeowners TEXAS 05/2017 State Auto Insurance Company UG-1.0 I - UNDERWRITING GUIDELINES A. Entire State Eligibility Guidelines Premier Protection Plus Standard Available Forms HO0004 - Renters HO0005 - Homeowner Expanded HO0006 - Condominium HO0003 - Homeowner HO0004 - Renters HO0005 - Homeowner Expanded

1y ago

112 Views

Consumer Guide to Auto Insurance - csimt.gov

consumer guide to auto insurance contents introduction to auto insurance 1 understanding your auto insurance policy 2 required auto insurance 3 optional types of auto insurance 4-5 getting the right coverage 6 accidents and violations 7 how to shop for auto insurance 8 shopping tips 9 frequently asked questions 10-11 insurance complaints/when you have a problem 12

2y ago

805 Views

Industry Observations Insurance Industry

Jun 30, 2019 · 6/17/2019 Commercial Insurance Branch of Extraco Banks, N.A. Higginbotham Insurance Group, Inc. Insurance Brokers NA 6/13/2019 Links Insurance Services, LLC World Insurance Associates LLC Property and Casualty Insurance NA 6/13/2019 Abram Interstate Insurance Services, Inc. Risk Placement Services,

2y ago

619 Views

Life Insurance Buyer's Guide Life Insurance - National Association of .

Life Insurance uers uide Naional ssociaion of Insurance Commissioners Compare the Different Types of Insurance Policies There are many types of life insurance pol-icies. You should choose a policy with fea-tures that fit your individual needs. Some things to consider are: Term Insurance vs. Cash Value In-surance. Term insurance is intended to

1y ago

520 Views

your guide to understanding auto ins in nh - New Hampshire

Hampshire Insurance Department does not mandate or set Auto Insurance Rates. Auto Insurance Rates will vary by insurance company. This guide is intended to give New Hampshire consumers basic information on auto insurance. It suggests ways to: Lower the cost of your auto insurance, shop for Auto insurance and, file an auto insurance claim.

1y ago

449 Views

18.01.41 - REPLACEMENT OF LIFE INSURANCE AND ANNUITIES - Idaho

Department of Insurance Replacement of Life Insurance and Annuities. Page 3. 04. Existing Life Insurance or Annuity. "Existing Life Insurance or Annuity" means any life insurance or annuity in force, including life insurance under a binding or conditional receipt or a lif e insurance policy or annuity that is within an unconditional refund period.

1y ago

407 Views

EXAMINATION REPORT OF THE ADMIRAL INSURANCE COMPANY AS OF . - Delaware

Berkley Regional Specialty Insurance Comp 31295 DE Carolina Casualty Insurance Company 10510 IA Clermont Insurance Company 33480 IA Continental Western Insurance Company 10804 IA Firemen's Insurance Com pany of Wash, D.C. 21784 DE Gemini Insurance Company 10833 DE Great Divide Insurance Company 25224 ND

1y ago

258 Views

American International Group, Inc. - Federal Reserve

American General Life Insurance Company AGL U.S. Life Insurance Company AGC Life Insurance Company AGC Life U.S. Life Insurance Company The United States Life Insurance Company in the City of New York U.S. Life U.S. Life Insurance Company The Variable Annuity Life Insurance Company VALIC U.S. Life Insurance Company

1y ago

269 Views

Japan's Insurance Market - Toa Re

with 61.6% of net premiums written, of which automobile insurance totaled 48.8% and compulsory automobile liability insurance totaled 12.8%. Fire insurance accounted for 13.7%, miscellaneous casualty insurance including liability insurance accounted for 11.6%, accident insurance accounted for 9.8%, and marine insurance accounted for 3.2%.

1y ago

179 Views

Multiple Linear Regression & AIC - University Of Alberta

It looks like you're using an ad-blocker