1 Theory Of Convex Functions - Princeton University

2y ago

15 Views

3 Downloads

1.39 MB

14 Pages

Last View : 23d ago

Last Download : 3m ago

Upload by : Aiyana Dorn

Report this link

Download PDF

Transcription

ORF 523Lecture 7Instructor: A.A. AhmadiScribe: G. HallSpring 2015, Princeton UniversityTuesday, March 1, 2016When in doubt on the accuracy of these notes, please cross check with the instructor’s notes,on aaa. princeton. edu/ orf523 . Any typos should be emailed to gh4@princeton.edu.In the previous couple of lectures, we’ve been focusing on the theory of convex sets. In thislecture, we shift our focus to the other important player in convex optimization, namely,convex functions. Here are some of the topics that we will touch upon: Convex, concave, strictly convex, and strongly convex functions First and second order characterizations of convex functions Optimality conditions for convex problems11.1Theory of convex functionsDefinitionLet’s first recall the definition of a convex function.Definition 1. A function f : Rn R is convex if its domain is a convex set and for all x, yin its domain, and all λ [0, 1], we havef (λx (1 λ)y) λf (x) (1 λ)f (y).Figure 1: An illustration of the definition of a convex function1

In words, this means that if we take any two points x, y, then f evaluated at any convexcombination of these two points should be no larger than the same convex combinationof f (x) and f (y). Geometrically, the line segment connecting (x, f (x)) to (y, f (y)) must sit above thegraph of f . If f is continuous, then to ensure convexity it is enough to check the definition withλ 21 (or any other fixed λ (0, 1)). This is similar to the notion of midpoint convexsets that we saw earlier. We say that f is concave if f is convex.1.2Examples of univariate convex functionsThis is a selection from [1]; see this reference for more examples. eax log(x) xa (defined on R ), a 1 or a 0 xa (defined on R ), 0 a 1 x a , a 1 x log(x) (defined on R )Can you formally verify that these functions are convex?1.3Strict and strong convexityDefinition 2. A function f : Rn R is Strictly convex if x, y, x 6 y, λ (0, 1)f (λx (1 λ)y) λf (x) (1 λ)f (y). Strongly convex, if α 0 such that f (x) α x 2 is convex.2

Lemma 1. Strong convexity Strict convexity Convexity.(But the converse of neither implication is true.)Proof: The fact that strict convexity implies convexity is obvious.To see that strong convexity implies strict convexity, note that strong convexity of f impliesf (λx (1 λ)y) α λx (1 λ)y 2 λf (x) (1 λ)f (y) λα x 2 (1 λ)α y 2 .Butλα x 2 (1 λ)α y 2 α λx (1 λ)y 2 0, x, y, x 6 y, λ (0, 1),because x 2 is strictly convex (why?). The claim follows.To see that the converse statements are not true, observe that f (x) x is convex but notstrictly convex and f (x) x4 is strictly convex but not strongly convex (why?).1.4Examples of multivariate convex functions Affine functions: f (x) aT x b (for any a Rn , b R). They are convex, but notstrictly convex; they are also concave: λ [0, 1], f (λx (1 λ)y) aT (λx (1 λ)y) b λaT x (1 λ)aT y λb (1 λ)b λf (x) (1 λ)f (y).In fact, affine functions are the only functions that are both convex and concave. Some quadratic functions: f (x) xT Qx cT x d.– Convex if and only if Q 0.– Strictly convex if and only if Q 0.– Concave if and only if Q 0; strictly concave if and only if Q 0.– The proofs are easy if we use the second order characterization of convexity (coming up). Any norm: Recall that a norm is any function f that satisfies:a. f (αx) α f (x), α R3

b. f (x y) f (x) f (y)c. f (x) 0, x, f (x) 0 x 0Proof: λ [0, 1],f (λx (1 λ)y) f (λx) f ((1 λ)y) λf (x) (1 λ)f (y).where the inequality follows from triangle inequality and the equality follows from thehomogeneity property. (We did not even use the positivity property.) (a) An affine function(b) A quadratic function(c) The 1-normFigure 2: Examples of multivariate convex functions1.5Convexity convexity along all linesTheorem 1. A function f : Rn R is convex if and only if the function g : R R givenby g(t) f (x ty) is convex (as a univariate function) for all x in domain of f and ally Rn . (The domain of g here is all t for which x ty is in the domain of f .)Proof: This is straightforward from the definition. The theorem simplifies many basic proofs in convex analysis but it does not usuallymake verification of convexity that much easier as the condition needs to hold for alllines (and we have infinitely many). Many algorithms for convex optimization iteratively minimize the function over lines.The statement above ensures that each subproblem is also a convex optimization problem.4

2First and second order characterizations of convexfunctionsTheorem 2. Suppose f : Rn R is twice differentiable over an open domain. Then, thefollowing are equivalent:(i) f is convex.(ii) f (y) f (x) f (x)T (y x), for all x, y dom(f ).(iii) 2 f (x) 0, for all x dom(f ).Intepretation:Condition (ii): The first order Taylor expansion at any point is a global under estimator ofthe function.Condition (iii): The function f has nonnegative curvature everywhere. (In one dimensionf 00 (x) 0, x dom(f ).)Proof ([2],[1]):We prove (i) (ii) then (ii) (iii).5

(i) (ii) If f is convex, by definitionf (λy (1 λ)x) λf (y) (1 λ)f (x), λ [0, 1], x, y dom(f ).After rewriting, we havef (x λ(y x)) f (x) λ(f (y) f (x)) f (y) f (x) f (x λ(y x)) f (x), λ (0, 1]λAs λ 0, we getf (y) f (x) f T (x)(y x).(1)(ii) (i) Suppose (1) holds x, y dom(f ). Take any x, y dom(f ) and letz λx (1 λ)y.We havef (x) f (z) f T (z)(x z)(2)f (y) f (z) f T (z)(y z)(3)Multiplying (2) by λ, (3) by (1 λ) and adding, we getλf (x) (1 λ)f (y) f (z) f T (z)(λx (1 λ)y z) f (z) f (λx (1 λ)y). (ii) (iii) We prove both of these claims first in dimension 1 and then generalize.(ii) (iii) (n 1) Let x, y dom(f ), y x. We havef (y) f (x) f 0 (x)(y x)0and f (x) f (y) f (y)(x y)6(4)(5)

f 0 (x)(y x) f (y) f (x) f 0 (y)(y x)using (4) then (5). Dividing LHS and RHS by (y x)2 givesf 0 (y) f 0 (x) 0, x, y, x 6 y.y xAs we let y x, we getf 00 (x) 0, x dom(f ).(iii) (ii) (n 1) Suppose f 00 (x) 0, x dom(f ). By the mean value version of Taylor’stheorem we have1f (y) f (x) f 0 (x)(y x) f 00 (z)(y x)2 , for some z [x, y].20 f (y) f (x) f (x)(y x).Now to establish (ii) (iii) in general dimension, we recall that convexity is equivalentto convexity along all lines; i.e., f : Rn R is convex if g(α) f (x0 αv) is convex x0 dom(f ) and v Rn . We just proved this happens iffg 00 (α) v T 2 f (x0 αv)v 0, x0 dom(f ), v Rn and α s.t. x0 αv dom(f ). Hence, f is convex iff 2 f (x) 0for all x dom(f ). Corollary 1. Consider an unconstrained optimization problemmin f (x)s.t. x Rn ,where f is convex and differentiable. Then, any point x̄ that satisfies f (x̄) 0 is a globalminimum.Proof: From the first order characterization of convexity, we havef (y) f (x) f T (x)(y x), x, y7

In particular,f (y) f (x̄) f T (x̄)(y x), y.Since f (x̄) 0, we getf (y) f (x̄), y. Remarks: Recall that f (x) 0 is always a necessary condition for local optimality in anunconstrained problem. The theorem states that for convex problems, f (x) 0 isnot only necessary, but also sufficient for local and global optimality. In absence of convexity, f (x) 0 is not sufficient even for local optimality (e.g.,think of f (x) x3 and x̄ 0). Another necessary condition for (unconstrained) local optimality of a point x was 2 f (x) 0. Note that a convex function automatically passes this test.33.1Strict convexity and uniqueness of optimal solutionsCharacterization of Strict ConvexityRecall that a fuction f : Rn R is strictly convex if x, y, x 6 y, λ (0, 1),f (λx (1 λ)y) λf (x) (1 λ)f (y).Like we mentioned before, if f is strictly convex, then f is convex (this is obvious from thedefinition) but the converse is not true (e.g., f (x) x, x R).Second order sufficient condition: 2 f (x) 0, x Ω f strictly convex on Ω.The converse is not true though (why?).First order characterization: A function f is strictly convex on Ω Rn if and only iff (y) f (x) f T (x)(y x), x, y Ω, x 6 y.8

There are similar characterizations for strongly convex functions. For example, f isstrongly convex if and only if there exists m 0 such thatf (y) f (x) T f (x)(y x) m y x 2 , x, y dom(f ),or if and only if there exists m 0 such that 2 f (x) mI, x dom(f ). One of the main uses of strict convexity is to ensure uniqueness of the optimal solution.We see this next.3.2Strict Convexity and Uniqueness of Optimal SolutionsTheorem 3. Consider an optimization problemmin f (x)s.t. x Ω,where f : Rn R is strictly convex on Ω and Ω is a convex set. Then the optimal solution(assuming it exists) must be unique.Proof: Suppose there were two optimal solutions x, y Rn . This means that x, y Ω andf (x) f (y) f (z), z Ω.But consider z x y.2(6)By convexity of Ω, we have z Ω. By strict convexity, we have x yf (z) f211 f (x) f (y)2211 f (x) f (x) f (x).22But this contradicts (6). Exercise: For each function below, determine whether it is convex, strictly convex, stronglyconvex or none of the above. f (x) (x1 3x2 )29

f (x) (x1 3x2 )2 (x1 2x2 )2 f (x) (x1 3x2 )2 (x1 2x2 )2 x31 f (x) x (x R) f (x) x (x Rn )3.3Quadratic functions revisitedLet f (x) xT Ax bx c where A is symmetric. Then f is convex if and only if A 0,independently of b, c. (why?)Consider now the unconstrained optimization problemmin f (x)x(7)We can establish the following claims: A 0 (f not convex) problem (7) is unbounded below.Proof: Let x̄ be an eigenvector with a negative eigenvalue λ. ThenAx̄ λx̄ x̄T Ax̄ λx̄T x̄ 0f (αx̄) α2 x̄T Ax̄ αbx̄ cSo f (αx̄) when α . A 0 f is strictly convex. There is a unique solution to (7):1x A 1 b (why?)2 A 0 f is convex. If A is positive semidefinite but not positive definite, theproblem can be unbounded (e.g., take A 0, b 6 0), or it can be bounded below andhave infinitely many solutions (e.g., f (x) (x1 x2 )2 ). One can distinguish betweenthe two cases by testing whether b lies in the range of A. Furthermore, it is impossiblefor the problem to have a unique solution if A has a zero eigenvalue (why?).10

Figure 3: An illustration of the different possibilities for unconstrained quadratic minimization3.3.1Least squares revisitedRecall the least squares problemmin Ax b 2 .xUnder the assumption that the columns of A are linearly independent,x (AT A) 1 AT bis the unique global solution because the objective function is strictly convex (why?).4Optimality conditions for convex optimizationTheorem 4. Consider an optimization problemmin. f (x)s.t. x Ω,where f : Rn R is convex and differentiable and Ω is convex. Then a point x is optimalif and only if x Ω and f (x)T (y x) 0, y Ω.Remarks:11

What does this condition mean?– If you move from x towards any feasible y, you will increase f locally.– The vector f (x) (assuming it is nonzero) serves as a hyperplane that “supports” the feasible set Ω at x. (See figure below.)Figure 4: An illustration of the optimality condition for convex optimization The necessity of the condition holds independent of convexity of f . Convexity is usedin establishing sufficiency. If Ω Rn , the condition above reduces to our first order unconstrained optimalitycondition f (x) 0 (why?). Similarly, if x is in the interior of Ω and is optimal, we must have f (x) 0. (Takey x α f (x) for α small enough.)Proof:(Sufficiency) Suppose x Ω satisfies f (x)T (y x) 0, y Ω.(8)By the first order characterization of convexity, we have:f (y) f (x) f T (x)(y x), y Ω.12(9)

Then(8) (9) f (y) f (x), y Ω x is optimal.(Necessity) Suppose x is optimal but for some y Ω we had f T (x)(y x) 0.Consider g(α) : f (x (α(y x)). Because Ω is convex, α [0, 1], x α(y x) Ω.Observe thatg 0 (α) (y x)T f (x α(y x)). g 0 (0) (y x)T f (x) 0.This implies that δ 0, s.t. g(α) g(0), α (0, δ) f (x α(y x)) f (x), α (0, δ).But this contradicts the optimality of x. Here’s a special case of this theorem that comes up often.Theorem 5. Consider the optimization problemmin f (x)(10)s.t. Ax b,where f is a convex function and A Rm n . A point x Rn is optimal to (10) if and onlyif it is feasible and µ Rm s.t. f (x) AT µ.Proof: Since this is a convex problem, our optimality condition tells us that a feasible x isoptimal iff f T (x)(y x) 0, y s.t. Ay b.13

Any y with Ay b can be written as y x v, where v is a point in the nullspace of A;i.e., Av 0. Therefore, a feasible x is optimal if and only if f T (x)v 0, v s.t. Av 0.For any v, since Av 0 implies A( v) 0, we see that f T (x)v 0. Hence the optimalitycondition reads f T (x)v 0, v s.t. Av 0.This means that f (x) is in the orthogonal complement of the nullspace of A which weknow from linear algebra equals the row space of A (or equivalently the column space ofAT ). Hence µ Rm s.t. f (x) AT µ. NotesFurther reading for this lecture can include Chapter 3 of [1].References[1] S. Boyd and L. Vandenberghe. Convex Optimization. Cambridge University Press,http://stanford.edu/ boyd/cvxbook/, 2004.[2] A.L. Tits. Lecture notes on optimal control. University of Maryland, 2013.14

Lemma 1. Strong convexity )Strict convexity )Convexity. (But the converse of neither implication is true.) Proof: The fact that strict convexity implies convexity is obvious. To see that strong convexity implies strict convexity, note that strong convexity of fimplies f( x (1 )y) jj x (1 )yjj2 f(x) (1 )f(y) jjxjj2 (1 ) jjyjj2: But

Related Documents:

Optimizing Convex Functions over Non-Convex Domains - Columbia University

Convex obj non-convex domain. Introduction Generic Problem: min Q(x); s:t: x 2F; Q(x) convex, especially: convex quadratic F nonconvex Examples: F is a mixed-integer set F is constrained in a nasty way, e.g. x 1 3 sin(x 2) 2cos(x 3) 4 Bienstock, Michalka Columbia Convex obj non-convex domain.

8 Views

11m ago

Convex Optimization Solutions Manual

Solution. We prove the rst part. The intersection of two convex sets is convex. There-fore if Sis a convex set, the intersection of Swith a line is convex. Conversely, suppose the intersection of Swith any line is convex. Take any two distinct points x1 and x2 2 S. The intersection of Swith the line through x1 and x2 is convex.

31 Views

3y ago

The Structure of Properly Convex Manifolds - Florida State University

What is convex projective geometry? Motivation from hyperbolic geometry Nice Properties of Hyperbolic Space Convex: Intersection with projective lines is connected. Properly Convex: Convex and closure is contained in an afﬁne patch ()Disjoint from some projective hyperplane. Strictly Convex: Properly convex and boundary contains

8 Views

1y ago

42 GEOMETRIC INTERSECTION

3 convex-convex n [DK85] convex-convex (n;lognp lognq) [DK90] INTERSECTION DETECTION OF CONVEX POLYGONS Perhaps the most easily understood example of how the structure of geometric objects can be exploited to yield an e cient intersection test is that of detecting the intersection of two convex polygons. There are a number of solutions to this .

32 Views

3y ago

Convex Optimization Theory Athena Scientiﬁc, 2009

Convex Optimization Theory Athena Scientiﬁc, 2009 by Dimitri P. Bertsekas Massachusetts Institute of Technology Supplementary Chapter 6 on Convex Optimization Algorithms This chapter aims to supplement the book Convex Optimization Theory, Athena Scientiﬁc, 2009 with material on convex optimization algorithms. The chapter will be .

21 Views

2y ago

1 Convex Sets, and Convex Functions

Proof:Let us denote the set of all convex combinations of ppoints of Sby Cp(S). Then the set of all possible convex combinations of points of S is C(S) : [1 p 1Cp(S). If x2 C(S) then it is a convex com

80 Views

2y ago

Optimization for Machine Learning

Convex optimization – Boyd & Vandenberghe Nonlinear programming – Bertsekas Convex Analysis – Rockafellar Fundamentals of convex analysis – Urruty, Lemarechal Lectures on modern convex optimization – Nemirovski Optimization for Machine Learning – Sra, Nowozin, Wright Theory of Convex Optimization for Machine Learning – Bubeck .

21 Views

2y ago

Convex Optimization - People

Convex optimization { Boyd & Vandenberghe (BV) Introductory lectures on convex optimisation { Nesterov Nonlinear programming { Bertsekas Convex Analysis { Rockafellar Numerical optimization { Nocedal & Wright Lectures on modern convex optimization { Nemirovski Optimization for Machine Learning { Sra, Nowozin, Wright

20 Views

2y ago

Recent Views

Technological Revolutions and Stock Prices National Bureau of Economic .

stock prices up, but the discount rate eﬀect prevails eventually, pushing the stock prices down. The resulting pattern in the new economy stock prices looks like a bubble but it obtains under rational expectations through a general equilibrium eﬀect. The bubble-like pattern in stock prices arises in part due to an ex post selection bias.

1y ago

121 Views

Dynamic correlation between stock market and oil prices: The case of .

between stock market and oil prices is still growing. Nevertheless, there are very few studies on the dynamic correlation between these two markets. A first approach on the dynamic co-movements between oil prices and stock markets was performed by Ewing and Thomson (2007), using the cyclical components of oil prices and stock prices.

1y ago

122 Views

Prices Effective January 1, 2020 Machine Prices and Speci cations

Prices Effective January 1, 2020 Machine Prices and Speci cations Prices Effective January 1, 2020 ZERO TURN-4 SERIES REVISED MAY 18, 2020. Machine Prices and Secications Prices Eectie anuar , 2 ZT1. Prices F.O.B. Selma, Alabama and Subject to Change Without Notice. ESTATE SERIES

1y ago

119 Views

Vanguard U.S. Stock Index Small-Capitalization Funds .

† Stock market risk, which is the chance that stock prices overall will decline. Stock markets tend to move in cycles, with periods of rising prices and periods of falling prices.The Fund’s target index tracks a subset of the U.S. stock market, which could cause the Fund to perform di

2y ago

260 Views

Forecasting Prices on the Stock Exchange Using a Trading System

Forecasting prices in stock markets is a matter of great interest both in the academic field and in business. The forecasting of stock prices and stock returns is possible using various techniques and methods. Many researchers study price trends in stock markets with the help of artificial neural networks [1-2] or fuzzy-trends [3, 4]. The

1y ago

132 Views

A Hybrid Prediction Method for Stock Price Using LSTM and . - Hindawi

the relationship between stock prices and these factors. Although these factors will temporarily change the stock price, in essence, these factors will be reﬂected in the stock price and will not change the long-term trend of the stock price. erefore, stock prices can be predicted simply with historical data.

1y ago

159 Views

Columbus,Ohio 1890

Slicing Steaks 3563 Beef Tender, Select In Stock 3852 Angus XT Shoulder Clod, Choice In Stock 3853 Angus XT Chuck Roll, Choice 20/up In Stock 3856 Angus XT Peeled Knuckle In Stock 3857 Angus XT Inside Rounds In Stock 3858 Angus XT Flats, Choice In Stock 3859 Angus XT Eye Of Round, Choice In Stock 3507 Point Off Bnls Beef Brisket, Choice In Stock

2y ago

268 Views

Determine your Pricing Point Taxonomy to Help you Pricing Strategy: Using

Lowering of prices to match competitor prices can be done on a more precise level with Taxonomy. Price matching can be done based on the comparison of further classification via Taxonomy. Stock Availability: In stock/ Out of stock When both the competitor and us have the same product in stock, we ought to markdown to match prices when

1y ago

117 Views

COVID-19 and Energy COVID-19 and the Oil Price - Stock Market Nexus .

stock market. 1. Introduction Oil prices play a key role in stock market performance of oil-importing economies. A decline in oil prices reduces the cost of production and increases economic growth (Narayan et al., 2014). The effect of this is a rise in stock prices due to higher future earnings and dividends (Filis, 2010; Jones &

1y ago

125 Views

Relationship between Financial Ratios in the Stock Prices of .

projected financial ratios. Miri and Abraham (2010) linear and non-linear relationship between the ratio of stock prices in the financial and non-metallic minerals industry Tehran Stock Exchange for the years 2003 to 2007 were reviewed. The results showed that linear and nonlinear relationships between financial ratios and stock prices there is no

10m ago

99 Views

Stock prediction using a Hidden Markov Model versus a Long Short-Term .

the close price of a stock is used when training and predicting stock prices. All data was retrieved from Yahoo. Historical stock prices from 1 January 1990 until 1 June 2019 results in approximately 7000 data points (trading days) per stock. Data preparation In order to create enough data for the two models to

1y ago

121 Views

Learning from Peers Stock Prices and Corporate Investment - USI

stock prices then an increase in the -rm s own stock price informativeness reduces the sensitivity of its investment to its peer stock price (prediction 1). Indeed, as the signal conveyed by its own . stock price (prediction 2), but not otherwise. The same prediction holds for an increase in the correlation of the fundamentals of a -rm .

1y ago

128 Views

Do stock prices respond to changes in corporate income tax rates?

In the event studies, I regress stock returns on market returns and other factors over a time span well before the events of a tax change, creating a model of how the stock returns behave. Then I use the deviation of stock prices from the model's prediction around the events of the tax change to establish the stock's abnormal returns.

1y ago

117 Views

The Impact of Persian News on Stock Returns Through Text Mining Techniques

Persian news - on the stock prices has been neglected. Consequently, this study aimed to fill this gap. To this aim, the stock index values were collected from the Tehran Stock Exchange along with the . Stock market prediction is a way to understand the future fluctuations of a company's stock price (Jishag et al., 2020). Generally, two .

1y ago

225 Views

Buying Your First Stock - Stock-Trak

Stock Market Game Time: 15 Minutes Requires: StockTrak Curriculum , Computer Access Buying Your First Stock This lesson is an introduction to buying a stock. Students will be introduced to basic vocabulary that is involved with a buying and owning a stock. Stu-dents will be going through the entire process of buying a stock from looking

1y ago

164 Views

1 Theory Of Convex Functions - Princeton University

It looks like you're using an ad-blocker