Lazifying Conditional Gradient Algorithms

3y ago

29 Views

2 Downloads

1.26 MB

42 Pages

Last View : 14d ago

Last Download : 3m ago

Upload by : Olive Grimm

Report this link

Download PDF

Transcription

Journal of Machine Learning Research 20 (2019) 1-42Submitted 3/18; Revised 2/19; Published 3/19Lazifying Conditional Gradient AlgorithmsGábor BraunSebastian PokuttaDaniel sye.gatech.edudaniel.zink@gatech.eduSchool of Industrial & Systems EngineeringGeorgia Institute of TechnologyAtlanta, GA 30332, USAEditor: Mark SchmidtAbstractConditional gradient algorithms (also often called Frank–Wolfe algorithms) are popular dueto their simplicity of only requiring a linear optimization oracle and more recently theyalso gained significant traction for online learning. While simple in principle, in many casesthe actual implementation of the linear optimization oracle is costly. We show a generalmethod to lazify various conditional gradient algorithms, which in actual computationsleads to several orders of magnitude of speedup in wall-clock time. This is achieved by usinga faster separation oracle instead of a linear optimization oracle, relying only on few linearoptimization oracle calls.Keywords: Frank–Wolfe algorithm, conditional gradient, caching, linear optimizationoracle, convex optimization1. IntroductionConvex optimization is an important technique both from a theoretical and an applicationsperspective. Gradient descent based methods are widely used due to their simplicity andeasy applicability to many real-world problems. We are interested in solving constraintconvex optimization problems of the formmin f (x),x P(1)where f is a smooth convex function and P is a polytope, with access to f being limited tofirst-order information, i.e., we can obtain f (x) and f (x) for a given x P and access toP via a linear minimization oracle, which returns LPP (c) argminx P cx for a given linearobjective c.When solving Problem (1) using gradient descent approaches in order to maintainfeasibility, typically a projection step is required. This projection back into the feasibleregion P is potentially computationally expensive, especially for complex feasible regions invery large dimensions. As such, projection-free methods gained a lot of attention recently,in particular the Frank–Wolfe algorithm Frank and Wolfe (1956) (also known as conditionalgradient descent Levitin and Polyak (1966); see also Jaggi (2013) for an overview) and itsonline version Hazan and Kale (2012) due to their simplicity. We recall the basic Frank–Wolfealgorithm in Algorithm 1. These methods eschew the projection step and rather use ac 2019 Gábor Braun, Sebastian Pokutta, and Daniel Zink.License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are providedat http://jmlr.org/papers/v20/18-114.html.

Braun, Pokutta, and ZinkAlgorithm 1 Frank–Wolfe Algorithm Frank and Wolfe (1956)Input: smooth convex function f with curvature C, start vertex x1 P , linear minimizationoracle LPPOutput: points xt in P1: for t 1 to T 1 do2:vt LPP ( f (xt ))23:xt 1 (1 γt )xt γt vt with γt : t 24: end forlinear optimization oracle to stay within the feasible region. While convergence rates andregret bounds are often suboptimal, in many cases the gain due to only having to solve asingle linear optimization problem over the feasible region in every iteration still leads tosignificant computational advantages (see e.g., (Hazan and Kale, 2012, Section 5)). Thisled to conditional gradient algorithms being used for e.g., online optimization and moregenerally machine learning. Also the property that these algorithms naturally generate sparsedistributions over the extreme points of the feasible region is often helpful. Further increasingthe relevance of these methods, it was shown recently that conditional gradient methods canalso achieve linear convergence (see e.g., Garber and Hazan (2013); Lacoste-Julien and Jaggi(2015); Garber and Meshi (2016)) as well as that the number of total gradient evaluationscan be reduced while maintaining the optimal number of oracle calls as shown in Lan andZhou (2014).Unfortunately, for complex feasible regions even solving the linear optimization problemmight be time-consuming and as such the cost of solving the LP might be non-negligible.This could be the case, e.g., when linear optimization over the feasible region is a hardproblem or when solving large-scale optimization or learning problems. As such it is naturalto ask the following questions:(i) Does the linear optimization oracle have to be called in every iteration?(ii) Does one need approximately optimal solutions for convergence?(iii) Can one reuse information across iterations?We will answer these questions in this work, showing that (i) the LP oracle is not requiredto be called in every iteration, (ii) much weaker guarantees are sufficient, and (iii) we canreuse information. To significantly reduce the cost of oracle calls while maintaining identicalconvergence rates up to small constant factors, we replace the linear optimization oracleby a (weak) separation oracle (Oracle 1) which approximately solves a separation problemOracle 1 Weak Separation Oracle LPsepP (c, x, Φ, K)Input: linear objective c Rn , point x P , accuracy K 1, objective value Φ 0;Output: Either (1) vertex y P with c(x y) Φ/K, or (2) false: c(x z) Φ for allz P.within a multiplicative factor and returns improving vertices. We stress that the weakseparation oracle is significantly weaker than approximate minimization, which has been2

Lazifying Conditional Gradient Algorithmsalready considered in Jaggi (2013). In fact, there is no guarantee that the improving verticesreturned by the oracle are near to the optimal solution to the linear minimization problem.It is this relaxation of dual bounds and approximate optimality that will provide a significantspeedup as we will see later. However, if the oracle does not return an improving vertex(returns false), then this fact can be used to derive a reasonably small dual bound of theform: f (xt ) f (x ) f (xt )(xt x ) Φt for some Φt 0. While the accuracy K ispresented here as a formal argument of the oracle, an oracle implementation might restrict toa fixed value K 1, which often makes implementation easier. We point out that the cases(1) and (2) potentially overlap if K 1. This is intentional and in this case it is unspecifiedwhich of the cases the oracle should choose (and it does not matter for the algorithms).This new oracle encapsulates the smart use of the original linear optimization oracle,even though for some problems it could potentially be implemented directly without relyingon a linear programming oracle. Concretely, a weak separation oracle can be realized by asingle call to a linear optimization oracle and as such is no more complex than the originaloracle. However it has two important advantages: it allows for caching and early termination.Caching refers to storing previous solutions, and first searching among them to satisfy theoracle’s separation condition. The underlying linear optimization oracle is called only, whennone of the cached solutions satisfy the condition. Algorithm 2 formalizes this process. Earlytermination is the technique to stop the linear optimization algorithm before it finishes atan appropriate stage, when from its internal data a suitable oracle answer can be easilyrecovered; this is clearly an implementation dependent technique. The two techniquescan be combined, e.g., Algorithm 2 could use an early terminating linear oracle or otherimplementation of the weak separation oracle in line 4.Algorithm 2 LPsepP (c, x, Φ, K) via LP oracleInput: linear objective c Rn , point x P , accuracy K 1, objective value Φ 0;Output: Either (1) vertex y P with c(x y) Φ/K, or (2) false: c(x z) Φ for allz P.1: if y P cached with c(x y) Φ/K exists then2:return y{Cache call}3: else4:y argmaxx P cx{LP call}5:if c(x y) Φ/K then6:add y to cache7:return y8:else9:return false10:end if11: end ifWe call lazification the technique of replacing a linear programming oracle with a muchweaker one, and we will demonstrate significant speedups in wall-clock performance (seee.g., Figure 12), while maintaining identical theoretical convergence rates.To exemplify our approach we provide conditional gradient algorithms employing theweak separation oracle for the standard Frank–Wolfe algorithm as well as the variants in3

Braun, Pokutta, and ZinkHazan and Kale (2012); Garber and Meshi (2016); Garber and Hazan (2013), which havebeen chosen due to requiring modified convergence arguments that go beyond those requiredfor the vanilla Frank–Wolfe algorithm. Complementing the theoretical analysis we reportcomputational results demonstrating effectiveness of our approach via a significant reductionin wall-clock time compared to their linear optimization counterparts.Related WorkThere has been extensive work on Frank–Wolfe algorithms and conditional gradient algorithms, so we will restrict to review work most closely related to ours. The Frank–Wolfealgorithm was originally introduced in Frank and Wolfe (1956) (also known as conditionalgradient descent Levitin and Polyak (1966) and has been intensely studied in particular interms of achieving stronger convergence guarantees as well as affine-invariant versions. Wedemonstrate our approach for the vanilla Frank–Wolfe algorithm Frank and Wolfe (1956)(see also Jaggi (2013)) as an introductory example. We then consider more complicatedvariants that require non-trivial changes to the respective convergence proofs to demonstratethe versatility of our approach. This includes the linearly convergent variant via locallinear optimization Garber and Hazan (2013) as well as the pairwise conditional gradientvariant of Garber and Meshi (2016), which is especially efficient in terms of implementation. However, our technique also applies to the Away-Step Frank–Wolfe algorithm, theFully-Corrective Frank–Wolfe algorithm, the Pairwise Conditional Gradient algorithm, aswell as the Block-Coordinate Frank–Wolfe algorithm. Recently, in Freund and Grigas (2016)guarantees for arbitrary step-size rules were provided and an analogous analysis can be alsoperformed for our approach. On the other hand, the analysis of the inexact variants, e.g.,with approximate linear minimization does not apply to our case as our oracle is significantlyweaker than approximate minimization as pointed out earlier. For more information, werefer the interested reader to the excellent overview in Jaggi (2013) for Frank–Wolfe methodsin general as well as Lacoste-Julien and Jaggi (2015) for an overview with respect to globallinear convergence.It was also recently shown in Hazan and Kale (2012) that the Frank–Wolfe algorithm canbe adjusted to the online learning setting and in this work we provide a lazy version of thisalgorithm. Combinatorial convex online optimization has been investigated in a long line ofwork (see e.g., Kalai and Vempala (2005); Audibert et al. (2013); Neu and Bartók (2013)).It is important to note that our regret bounds hold in the structured online learning setting,i.e., our bounds depend on the 1 -diameter or sparsity of the polytope, rather than itsambient dimension for arbitrary convex functions (see e.g., Cohen and Hazan (2015); Guptaet al. (2016)). We refer the interested reader to Hazan (2016) for an extensive overview.A key component of the new oracle is the ability to cache and reuse old solutions, whichaccounts for the majority of the observed speed up. The idea of caching of oracle calls wasalready explored in various other contexts such as cutting plane methods (see e.g., Joachimset al. (2009)) as well as the Block-Coordinate Frank–Wolfe algorithm in Shah et al. (2015);Osokin et al. (2016). Our lazification approach (which uses caching) is however much morelazy, requiring no multiplicative approximation guarantee; see (Osokin et al., 2016, Proof ofTheorem 3. Appendix F) and Lacoste-Julien et al. (2013) for comparison to our setup.4

Lazifying Conditional Gradient AlgorithmsContributionThe main technical contribution of this paper is a new approach, whereby instead of findingthe optimal solution, the oracle is used only to find a good enough solution or a certificate thatsuch a solution does not exist, both ensuring the desired convergence rate of the conditionalgradient algorithms.Our contribution can be summarized as follows:(i) Lazifying approach. We provide a general method to lazify conditional gradientalgorithms. For this we replace the linear optimization oracle with a weak separationoracle, which allows us to reuse feasible solutions from previous oracle calls, so thatin many cases the oracle call can be skipped. In fact, once a simple representation ofthe underlying feasible region is learned no further oracle calls are needed. We alsodemonstrate how parameter-free variants can be obtained.(ii) Lazified conditional gradient algorithms. We exemplify our approach by providing lazyversions of the vanilla Frank–Wolfe algorithm as well as of the conditional gradientmethods in Hazan and Kale (2012); Garber and Hazan (2013); Garber and Meshi(2016).(iii) Weak separation through augmentation. We show in the case of 0/1 polytopes how toimplement a weak separation oracle with at most k calls to an augmentation oraclethat on input c Rn and x P provides either an improving solution x P withcx cx or ensures optimality, where k denotes the 1 -diameter of P . This is usefulwhen the solution space is sparse.(iv) Computational experiments. We demonstrate computational superiority by extensivecomparisons of the weak separation based versions with their original versions. Inall cases we report significant speedups in wall-clock time often of several orders ofmagnitude.It is important to note that in all cases, we inherit the same requirements, assumptions,and properties of the baseline algorithm that we lazify. This includes applicable functionclasses, norm requirements, as well as smoothness and (strong) convexity requirements. Wealso maintain identical convergence rates up to (small) constant factors.A previous version of this work appeared as extended abstract in Braun et al. (2017); thisversion has been significantly revised over the conference version including a representativesubset of more extensive computational results, full proofs for all described variants, aswell as a variant that uses an augmentation oracle instead of linear optimization oracle (seeSection 7).OutlineWe briefly recall notation and notions in Section 2 and consider conditional gradientalgorithms in Section 3. In Section 4 we consider parameter-free variants of the proposedalgorithms, and in Section 5 we examine online versions. Finally, in Section 7 we show arealization of a weak separation oracle with an even weaker oracle in the case of combinatorialproblem and we provide extensive computational results in Section 8.5

Braun, Pokutta, and Zink2. PreliminariesLet k·k be an arbitrary norm on Rn , and let k·k denote the dual norm of k·k. A functionf is L-Lipschitz if f (y) f (x) Lky xk for all x, y dom f . A convex function f issmooth with curvature at most C iff (γy (1 γ)x) f (x) γ f (x)(y x) Cγ 2 /2for all x, y dom f and 0 γ 1. A function f is S-strongly convex iff (y) f (x) f (x)(y x) Sky xk22for all x, y dom f . Unless stated otherwise Lipschitz continuity and strong convexity willbe measured in the norm k·k. Moreover, let Br (x) : {y kx yk r} be the ball aroundx with radius r with respect to k.k. In the following, P will denote the feasible region, apolytope and the vertices of P will be denoted by v1 , . . . , vN .3. Lazy Conditional GradientWe start with the most basic Frank–Wolfe algorithm as a simple example for lazifying bymeans of a weak separation oracle. We then lazify more complex Frank–Wolfe algorithmsin Garber and Hazan (2013) and Garber and Meshi (2016). Throughout this section k·kdenotes the 2 -norm.3.1. Lazy Conditional Gradient: a basic exampleWe start with lazifying the original Frank–Wolfe algorithm (arguably the simplest ConditionalGradient algorithm), adapting the baseline argument from (Jaggi, 2013, Theorem 1). Whilethe vanilla version has suboptimal convergence rate O(1/T ), its simplicity makes it anillustrative example of the main idea of lazification. The lazy algorithm (Algorithm 3)maintains an upper bound Φt on the convergence rate, guiding its eagerness for progresswhen searching for an improving vertex vt . If the weak separation oracle provides animproving vertex vt we refer to this as a positive call and if the oracle claims there are noimproving vertices we call it a negative call.The step size γt is chosen to (approximately) minimize Φt in Line 2; roughly Φt 1 /KC.Theorem 1 Assume f is convex and smooth with curvature C. Then Algorithm 3 with2(K 2 1) γt K(t K2 2) and f (x1 ) f (x ) Φ0 has convergence ratef (xt ) f (x ) 2 max{C, Φ0 }(K 2 1),t K2 2where x is a minimum point of f over P .Proof We prove by induction thatf (xt ) f (x ) Φt 1 .6

Lazifying Conditional Gradient AlgorithmsAlgorithm 3 Lazy Conditional GradientInput: smooth convex function f with curvature C, start vertex x1 P , weak linearseparation oracle LPsepP , accuracy K 1, step sizes γt , initial upper bound Φ0Output: points xt in P1: for t 1 to T 1 doCγ 22:3:4:5:6:7:8:9:Φt Φt 1 2 tγ1 Ktvt LPsepP ( f (xt ), xt , Φt , K)if vt false thenxt 1 xtelsext 1 (1 γt )xt γt vtend ifend forThe claim is clear for t 1 by the choice of Φ0 . Assuming the claim is true for t, we proveit for t 1. We distinguish two cases depending on the return value of the weak separationoracle in Line 3.In case of a positive call, i.e., when the oracle returns an improving solution vt , then f (xt )(xt vt ) Φt /K, which is used in the second inequality below. The first inequalityfollows by smoothness of f , and the second inequality by the induction hypothesis and thefact that vt is an improving solution:Cγ 2f (xt 1 ) f (x ) f (xt ) f (x ) γt f (xt )(vt xt ) t{z}{z} 2 Φt 1 Φt 1 γt Φt /KCγt2Φt K2 Φt ,In case of a negative call, i.e., when the oracle returns no improving solution, then inparticular f (xt )(xt x ) Φt , hence by Line 5f (xt 1 ) f (x ) f (xt ) f (x ) f (xt )(xt x ) Φt .Finally, using the specific values of γt we prove the upper boundΦt 1 2 max{C, Φ0 }(K 2 1)t K2 2by induction on t. The claim is obvious for t 1. The induction step is an easy computationrelying on the definition of Φt on Line 2:Cγ 2Φt 1 2 tΦt 1 γKt2 max{C,Φ0 }(K 2 1)t K 2 21 γtKmax{C,Φ0 }γt22 Here the last inequality follows from the concrete value of γt .72 max{C, Φ0 }(K 2 1).t K2 3

Braun, Pokutta, and ZinkNote that by design, the algorithm converges at the worst-case rate that we postulatedue to the negative calls when it does not move. Clearly, this is highly undesirable, thereforethe algorithm should be understood as the textbook variant of lazy conditional gradient. Wewill present an improved, parameter-free variant of Algorithm 3 in Section 4 that convergesat the best possible rate that the non-lazy variant would achieve (up to a small constantfactor).3.2. Lazy Pairwise Conditional GradientIn this section we provide a lazy variant (Algorithm 4) of the Pairwise Conditional Gradientalgorithm from Garber and Meshi (2016), using separation instead of linear optimization. Wemake identical assumptions: the feasible region is a 0/1 polytope, i.e., all vertices of P haveonly 0/1 entries, and moreover it is given in the form P {x Rn 0 x 1, Ax b},where 1 denotes the all-one vector.Algorithm 4 Lazy Pairwise Conditional Gradient (LPCG)Input: polytope P , smooth and S-strongly convex function f with curvature C, accuracyK 1, non-increasing step-sizes ηt , eagerness tOutput: points xt1: x1 P arbitrary and Φ0 f (x1 ) f (x )2: for t 1, . . . ,(T doe (xt )i : f (xt )i if xt

Journal of Machine Learning Research 20 (2019) 1-42 Submitted 3/18; Revised 2/19; Published 3/19 Lazifying Conditional Gradient Algorithms G abor Braun gabor.braun@isye.gatech.edu Sebastian Pokutta sebastian.pokutta@isye.gatech.edu Daniel Zink daniel.zink@gatech.edu School of Industrial & Systems Engineering

Related Documents:

Good Habits for Successful Gradient Separations

Good Habits for Successful Gradient Separations Developing good gradient habits is the key to long term success. In this session we will start by discussing what it takes to maximize gradient efficiency by balancing gradient speed with adequate resolution needs. Since even the best gradient can be compromised we are going to look at optimizing LC

14 Views

2y ago

Method Development in High- Performance Liquid …

Steps in Gradient Method Development 1. Run a wide gradient (e.g., 5 to 100% B) over 40-60 min. From this run, decide whether gradient or isocratic elution is best 2. If gradient elution is chosen, eliminate portions of the gradient prior to the first peak and following the last peak. Use the same gradient

16 Views

2y ago

Gradient Descent - University of Liverpool

Method of Gradient Descent The gradient points directly uphill, and the negative gradient points directly downhill Thus we can decrease f by moving in the direction of the negative gradient This is known as the method of steepest descent or gradient descent Steepest descent proposes a new point

23 Views

1y ago

Color-aware Regularization for Gradient Domain Image Manipulation

Gradient domain manipulation is the cornerstone of many image processing algo-rithms from image editing to texture transfer to image fusion. For an overview of gradient domain algorithms and applications we refer readers to [1]. As the name implies, gradient domain algorithms do not operate in the 0th order domain (i.e.

9 Views

5m ago

Conditional Statements - Mr Meyers Math

2.1 Conditional Statements 71 Conditional Statements RECOGNIZING CONDITIONAL STATEMENTS In this lesson you will study a type of logical statement called a conditional statement. A has two parts, a hypothesisand a conclusion. When the statement is written in the "if" part contains the and the "then" part contains the Here is an example:

19 Views

1y ago

13.5 Dir.Der. and Gradient - contemporarycalculus.com

13.5 Directional Derivatives and the Gradient Vector Contemporary Calculus 5 gradient vector at several locations. (Note: the lengths of these gradient vectors are exaggerated.) Practice 5: Sketch the gradient vector f(x,y) for the function f in Fig. 2 at A, B and C. A ball placed at (x,y) will begin to roll in the direction u - f(x,y

13 Views

1y ago

Explaining Deep Neural Network Models with Adversarial Gradient Integration

In general there are two directions towards interpreting DNNs, i.e., gradient based methods, and local approxima-tion methods. Some gradient based methods calculate in-put feature importance by exploiting its gradient with re-spect to the model inputs. For example, Saliency Map (SM) [Simonyan et al., 2013] uses gradient directly, Guided

7 Views

1y ago

Nevada Academic Content Standards for ELA

present document. Grade-specific K–12 standards in reading, writing, speaking, listening, and language translate the broad (and, for the earliest grades, seemingly distant) aims of the CCR standards into age- and attainment-appropriate terms. The Standards set requirements not only for English language arts (ELA) but

82 Views

3y ago

Recent Views

Technological Revolutions and Stock Prices National Bureau of Economic .

stock prices up, but the discount rate eﬀect prevails eventually, pushing the stock prices down. The resulting pattern in the new economy stock prices looks like a bubble but it obtains under rational expectations through a general equilibrium eﬀect. The bubble-like pattern in stock prices arises in part due to an ex post selection bias.

1y ago

121 Views

Dynamic correlation between stock market and oil prices: The case of .

between stock market and oil prices is still growing. Nevertheless, there are very few studies on the dynamic correlation between these two markets. A first approach on the dynamic co-movements between oil prices and stock markets was performed by Ewing and Thomson (2007), using the cyclical components of oil prices and stock prices.

1y ago

122 Views

Prices Effective January 1, 2020 Machine Prices and Speci cations

Prices Effective January 1, 2020 Machine Prices and Speci cations Prices Effective January 1, 2020 ZERO TURN-4 SERIES REVISED MAY 18, 2020. Machine Prices and Secications Prices Eectie anuar , 2 ZT1. Prices F.O.B. Selma, Alabama and Subject to Change Without Notice. ESTATE SERIES

1y ago

119 Views

Vanguard U.S. Stock Index Small-Capitalization Funds .

† Stock market risk, which is the chance that stock prices overall will decline. Stock markets tend to move in cycles, with periods of rising prices and periods of falling prices.The Fund’s target index tracks a subset of the U.S. stock market, which could cause the Fund to perform di

2y ago

260 Views

Forecasting Prices on the Stock Exchange Using a Trading System

Forecasting prices in stock markets is a matter of great interest both in the academic field and in business. The forecasting of stock prices and stock returns is possible using various techniques and methods. Many researchers study price trends in stock markets with the help of artificial neural networks [1-2] or fuzzy-trends [3, 4]. The

1y ago

132 Views

A Hybrid Prediction Method for Stock Price Using LSTM and . - Hindawi

the relationship between stock prices and these factors. Although these factors will temporarily change the stock price, in essence, these factors will be reﬂected in the stock price and will not change the long-term trend of the stock price. erefore, stock prices can be predicted simply with historical data.

1y ago

159 Views

Columbus,Ohio 1890

Slicing Steaks 3563 Beef Tender, Select In Stock 3852 Angus XT Shoulder Clod, Choice In Stock 3853 Angus XT Chuck Roll, Choice 20/up In Stock 3856 Angus XT Peeled Knuckle In Stock 3857 Angus XT Inside Rounds In Stock 3858 Angus XT Flats, Choice In Stock 3859 Angus XT Eye Of Round, Choice In Stock 3507 Point Off Bnls Beef Brisket, Choice In Stock

2y ago

268 Views

Determine your Pricing Point Taxonomy to Help you Pricing Strategy: Using

Lowering of prices to match competitor prices can be done on a more precise level with Taxonomy. Price matching can be done based on the comparison of further classification via Taxonomy. Stock Availability: In stock/ Out of stock When both the competitor and us have the same product in stock, we ought to markdown to match prices when

1y ago

117 Views

COVID-19 and Energy COVID-19 and the Oil Price - Stock Market Nexus .

stock market. 1. Introduction Oil prices play a key role in stock market performance of oil-importing economies. A decline in oil prices reduces the cost of production and increases economic growth (Narayan et al., 2014). The effect of this is a rise in stock prices due to higher future earnings and dividends (Filis, 2010; Jones &

1y ago

125 Views

Relationship between Financial Ratios in the Stock Prices of .

projected financial ratios. Miri and Abraham (2010) linear and non-linear relationship between the ratio of stock prices in the financial and non-metallic minerals industry Tehran Stock Exchange for the years 2003 to 2007 were reviewed. The results showed that linear and nonlinear relationships between financial ratios and stock prices there is no

10m ago

99 Views

Stock prediction using a Hidden Markov Model versus a Long Short-Term .

the close price of a stock is used when training and predicting stock prices. All data was retrieved from Yahoo. Historical stock prices from 1 January 1990 until 1 June 2019 results in approximately 7000 data points (trading days) per stock. Data preparation In order to create enough data for the two models to

1y ago

121 Views

Learning from Peers Stock Prices and Corporate Investment - USI

stock prices then an increase in the -rm s own stock price informativeness reduces the sensitivity of its investment to its peer stock price (prediction 1). Indeed, as the signal conveyed by its own . stock price (prediction 2), but not otherwise. The same prediction holds for an increase in the correlation of the fundamentals of a -rm .

1y ago

128 Views

Do stock prices respond to changes in corporate income tax rates?

In the event studies, I regress stock returns on market returns and other factors over a time span well before the events of a tax change, creating a model of how the stock returns behave. Then I use the deviation of stock prices from the model's prediction around the events of the tax change to establish the stock's abnormal returns.

1y ago

117 Views

The Impact of Persian News on Stock Returns Through Text Mining Techniques

Persian news - on the stock prices has been neglected. Consequently, this study aimed to fill this gap. To this aim, the stock index values were collected from the Tehran Stock Exchange along with the . Stock market prediction is a way to understand the future fluctuations of a company's stock price (Jishag et al., 2020). Generally, two .

1y ago

225 Views

Buying Your First Stock - Stock-Trak

Stock Market Game Time: 15 Minutes Requires: StockTrak Curriculum , Computer Access Buying Your First Stock This lesson is an introduction to buying a stock. Students will be introduced to basic vocabulary that is involved with a buying and owning a stock. Stu-dents will be going through the entire process of buying a stock from looking

1y ago

164 Views

Lazifying Conditional Gradient Algorithms

It looks like you're using an ad-blocker