Learning-based Lyapunov Analysis For Nonlinear Control Systems

3y ago

21 Views

2 Downloads

681.68 KB

5 Pages

Last View : 1m ago

Last Download : 3m ago

Upload by : Noelle Grant

Report this link

Download PDF

Transcription

Learning-based Lyapunov Analysis for NonlinearControl SystemsYa-Chien Chang, Lijing Kuang, Yuet FungAbstractThe establishment of Lynapunov theory has provided fundamental methods todetermine local stability using Lyapunov functions. How to successfully searchLynapunov functions therefore becomes a critical problem in stabilizing controlsystems. In this work, we tackle the stability problem in nonlinear control systemsusing a novel learning-based architecture, which guarantees the satisfaction ofLynapunov constraints in a strict sense. Experiments on various dynamical systemsare performed to evaluate the effectiveness of the presented method.1IntroductionEvaluating the regions of stability is one of the most fundamental problems in cyber-physical systemsand safety-critical systems [1, 2], where continuous and unknown system dynamics are often involved.When dealing with continuous dynamics, the common practice is to discretize the space using dynamicprogramming. This approach, however, struggles to handle discontinuous dynamics and complexdomains effectively [3]. Recently, we have seen some promising progresses in the control communityto utilize convex optimization for the development of efficient algorithms that are able to searchand compute Lyapunov functions in a direct way, so as to stabilize nonlinear control systems. Thepresence of nonlinearity requires function approximation, where existing algorithms typically referto sum-of-squares polynomials as Lyapunov functions using semidefinite programming (SDP). Ithas been shown that SDP facilitates the determination of stability in nonlinear control systems[4, 5], which can then be combined with motion planning algorithms for feedback control synthesisin complex dynamical systems [6]. Nevertheless, none of the existing methods provide provableguarantees of Lyapunov stability constraints, which may lead to infeasible solution sets as a result.To tackle this issue, we take one step forward to solve the stability problem in nonlinear dynamicalsystems by resorting to learning-based optimization methods using neural network architectures. Thepresented method is able to provide provable guarantees in constructing Lynapunov functions for therequired system dynamics, which in turn allows us to establish regions of stability. To summarize,the main contributions of this work are as follows: We formulate the searching of safe regions for arbitrary nonlinear dynamical systems as a convexoptimization problem, by dealing with a scalar function of states. A neural network learning framework is presented to construct Lyapunov function candidates forcomputing regions of stability, which are guaranteed to satisfy Lyapunov stability conditions. Ourmethod excels the existing algorithms in providing guarantees of the satisfaction of Lyapunovconditions, which is a key challenge in the control community. Several nonlinear dynamical systems are introduced for experimental evaluations. Our presentedmethod is able to find policies that generate larger region of attractions than existing methods.The rest of this paper is as follows. In Section 2, we discuss the prior works and highlight ourcontributions. We describe the problem statement in Section 3 under which we introduce the primalproblem, dual problem and KKT conditions. We develop a learning-based architecture using neuralPreprint. Under review.

networks to discover Lyapunov functions in Section 4. Section 5 presents our experiment results.Task assignment is in Section 6. We conclude our study in Section 7.2Related workRichards et al. [7] have proposed a Lyapunov framework using neural networks to learn safetycertificates in his recent work and he has shown that using neural networks is more effective.However, the goal and the approach proposed in this study are different from that of the prior work.Richards et al. focus on discrete-time polynomial systems whereas we focus on learning the controland the Lyapunov function together with a provable guarantee of stability in larger regions. Interms of approaches, Richard et al. use neural networks to learn the region of an attraction of agiven controller while our method is capable of handling non-polynomial continuous dynamicalsystems with only the initialization of control functions. Besides, our architecture uses genericfeed-forward network representations with no manual design, while the neural architecture in [7]required special design practices to accommodate the relaxation. As a result, our architecture appliesto more nonlinear system and can find new control functions that enlarge provinces of attractionobtainable from standard control methods.3Problem StatementConsider a dynamical systemdx f (x), x(0) x0 .(1)dtA point x is a stable point of the above system if f (x ) 0. The equilibrium state x is asymptotically stable if x is locally stable and, as t all solutions starting near x tend to x (i.e. alltrajectories converge to zero).Meanwhile, consider a function V , V is positive definite if V (x) 0 x, V (x) 0 x 0, and V (x) as x .Suppose V (x) is the generalized energy function of a dynamical system, as the dynamical systemapproaches stability, the energy of it decreases with time. As a result, we have V (x) 0.Consider the Lyapunov global asymptotic stability theorem:Suppose there is a function V such that V is positive definite, V (x) 0 x 6 0, V (0) 0,then every trajectory ofdxdt f (x) converges to zero as t .By changing the origin to x0 0 and with the above facts, we can restate the stability problem asfollows:Seek a positive definite function V : D R that satisfies the following conditions:V (0) 0, and, x D \ {0}, V (x) 0 and V (x) 0,(2)then the equilibrium is asymptotically stable and V is called a Lyapunov function of the dynamicalsystem.3.1Primal ProblemConsider the quadraticLyapunov function candidate V (x) xT P x. We then have V (x) TTx A P P A x, where P Rn n is a positive definite matrix and A Rn n . To satisfy aboveLyapunov conditions we need AT P P A 0. Thus, the primal problem is searching a positivedefinite matrix P subject to AT P P A 0.2

3.2Dual ProblemAssume a candidate Lyapunov function Vθ is a multilayered feed-forward networks, then we formulatea cost function to measure the degree of violation of the Lyapunov conditions in (2). The learningprocess updates the parameters, θ, to improve the likelihood of satisfying the Lyapunov conditions.Simultaneously, we want the safe regions as big as enough. Therefore, the cost function takes theform: 2sup inf max(0, Vθ (x)) max(0, Vθ (x)) Vθ (0) ,x D θs.t.3.3x(i 1) f (x(i)), x(0) x0 , x(i) D, i [0, N ].KKT conditionsThe corresponding first-order necessary conditions defined by KKT can be represented as follows: ata local optimum x , under the defined Lyapunov conditions, there exists Lagrange multipliers λ isuch that:L(x, θ, λ) Vθ (x) λ(AT P P A), θ L(x , θ , λ ) 0,4AT P P A 0,λ 0,λ(AT P P A) 0.MethodologiesIn this section, we introduce a learning-based architecture to seek Lynapunov functions using neuralnetworks, which ensure the stability of control systems. Our learning-based architecture contains twoneural-network components: an actor that learns candidate Lyapunov functions, and an evaluator thatfinds states that violate the Lyapunov conditions specified in (2). During the learning process, theactor network learns a paramterized Lyapunov function, by taking the input of vectorized system stateand outputting a scalar value. The objective of the actor network measures the degree of violation ofLyapunov conditions, which takes the following form: L(θ) E max(0, Vθ (x)) max(0, Vθ (x)) Vθ2 (0) .Accordingly, using Monte Carlo estimation, we are able to obtain the surrogate empirical loss functionby drawing samples, which has the following form:Le (θ) N 1 X max(0, Vθ (xi )) max(0, Vθ (xi )) Vθ2 (0),N i 1where xi are random samples drawn from the state vectors of the system in the i.i.d. manner.Throughout each iteration, the above objective function needs to be minimized using stochasticgradient descent (SGD).While it is generally difficult for the existing algorithms to fully satisfy Lyapunov conditions, ourmethod provides guarantees over all states of the system by incorporating an evaluator network. Ittakes the learned Lyapunov function from the actor network, and performs a global search to findout the state vectors that fail to meet the constraints. Such undesired states are appended to thetraining set for the subsequent training. Using delta-complete constraint solving introduced in [8],the algorithm assures when no occurrence of violation, Lyapunov conditions can be ensured to besatisfied across the examined domain. The learning process thus becomes an alternating learningprocess between the actor network and the evaluator network.By adjusting the cost functions in the framework, it is possible to obtain additional desired propertiesfor the learned Lyapunov functions, such as tuning region of attraction by adopting regularization interm of the speed that Lyapunov function value increases with respect to the radius of the level sets:L(θ) N1 Xk xi k αVθ (xi ).N i 1Compared to the gradient based methods which typically takes the entire training set for learning, theadoption of SGD in our algorithm allows us to achieve much better efficiency in each iteration andeventually, converge to the desired solution much faster.3

5ExperimentsTo demonstrate the effectiveness of our method, we perform experimental evaluations on a nonlinearcontrol problem. We demonstrate that our method successfully finds Lyaupnov functions that fullysatisfy the Lyapunov criteria. We also show that our framework outperforms the existing algorithms,in the sense to be able to generate significantly larger regions of attraction (ROA) compared to theresults presented in [9].Normalized pendulum. The standard pendulum system with normalized parameters is one of themost standard nonlinear problems. The two state variables in the system, x1 and x2 , represent theangular position and angular velocity respectively. The system dynamics can be described asx 1 x2x 2 sin (x1 ) x2(3)Our learning procedure finds the following neural Lyapunov function: V tanh(W2 tanh(W1 x B1 ) B2 ), where x [x1 x2 ]T and T0.3575 0.4428 0.2622 0.95520.38270.4264,0.2386 0.2765 0.9584 0.5441 0.5767 1.4503 W2 1.1849 1.3468 1.7934 1.6629 1.4462 0.1228 , B1 1.4599 1.7899 1.3305 1.3885 1.4987 0.8466 and B2 1.8862 .W1 Figure 1: (left) Learned Lyapunov function for normalized pendulum. It shows any trajectory startsinside the regions of attraction (ROA) defined by the learned neural Lyapunov function will approachthe equilibrium point as t . (right) Comparison of ROA estimated from different Lyapunovfunctions. Our method enlarges the invariant set (red unit ball) from the previous method in [9] twice.6Task AssignmentsThe write-ups (outline and report) are finished in teamwork. Yuet Fung studies the prior works andformulates the constrained primal and dual problem. Lijing Kuang develops a learning-based architecture to approximate Lyapunov functions. Ya-Chien Chang conducts experiments to demonstratethe effectiveness of the proposed framework.7Conclusions and Future WorkIn this paper, we focus on the problem of stabilizing nonlinear control systems using convex optimization techniques. To address the fundamental issue in cyber-physical systems and to to learn Lyapunovfunctions with provable guarantees, we introduce a neural network architecture, which can be used to4

compute regions of stability for achieving equilibrium in nonlinear dynamical systems. As for futurework, it is of interest to incorporate the state-of-the-art reinforcement learning methods, such as trustregion policy optimization (TRPO), soft actor critic (SAC), so as to search optimal control policies incontinuous domains.References[1] D. S. Bernstein and W. M. Haddad, “Robust stability and performance analysis for state-spacesystems via quadratic lyapunov bounds,” SIAM Journal on Matrix Analysis and Applications,vol. 11, no. 2, p. 239, 1990.[2] R. Wang, L. Hou, G. Zong, S. Fei, and D. Yang, “Stability and stabilization of continuous-timeswitched systems: a multiple discontinuous convex lyapunov function approach,” InternationalJournal of Robust and Nonlinear Control, vol. 29, no. 5, pp. 1499–1514, 2019.[3] K. Byl and R. Tedrake, “Approximate optimal control of the compass gait on rough terrain,” in2008 IEEE International Conference on Robotics and Automation. IEEE, 2008, pp. 1258–1263.[4] J. Anderson and A. Papachristodoulou, “Advances in computational lyapunov analysis usingsum-of-squares programming,” Discrete and Continuous Dynamical Systems-B, vol. 20, no. 8, p.2361, 2015.[5] Y. Zhu, D. Zhao, X. Yang, and Q. Zhang, “Policy iteration for h optimal control of polynomialnonlinear systems via sum of squares programming,” IEEE transactions on cybernetics, vol. 48,no. 2, pp. 500–509, 2017.[6] R. Tedrake, “Lqr-trees: Feedback motion planning on sparse randomized trees,” 2009.[7] S. M. Richards, F. Berkenkamp, and A. Krause, “The lyapunov neural network: Adaptive stabilitycertification for safe learning of dynamic systems,” in CoRL, 2018.[8] S. Gao, J. Avigad, and E. M. Clarke, “δ-complete decision procedures for satisfiability over thereals,” in International Joint Conference on Automated Reasoning. Springer, 2012, pp. 286–300.[9] J. Kapinski, J. V. Deshmukh, S. Sankaranarayanan, and N. Arechiga, “Simulation-guidedlyapunov analysis for hybrid dynamical systems,” in Proceedings of the 17th InternationalConference on Hybrid Systems: Computation and Control, ser. HSCC ’14. New York,NY, USA: Association for Computing Machinery, 2014, p. 133–142. [Online]. Available:https://doi.org/10.1145/2562059.25621395

required system dynamics, which in turn allows us to establish regions of stability. To summarize, the main contributions of this work are as follows: We formulate the searching of safe regions for arbitrary nonlinear dynamical systems as a convex optimization problem, by dealing with a scalar function of states.

Related Documents:

Matlab code for Lyapunov exponents of fractional order systems

The Matlab program prints and plots the Lyapunov exponents as function of time. Also, the programs to obtain Lyapunov exponents as function of the bifur-cation parameter and as function of the fractional order are described. The Matlab program for Lyapunov exponents is developed from an existing Matlab program for Lyapunov exponents of integer .

84 Views

3y ago

AppendixA Computing Lyapunov Exponents for Time-Delay Systems

largest nonzero Lyapunov exponent λm among the n Lyapunov exponents of the n-dimensional dynamical system. A.2.1 Computation of Lyapunov Exponents To compute the n-Lyapunov exponents of the n-dimensional dynamical system (A.1), a reference trajectory is created by integrating the nonlinear equations of motion (A.1).

39 Views

3y ago

Stability Analysis of Nonlinear Systems with Linear ...

The Lyapunov theory of dynamical systems is the most useful general theory for studying the stability of nonlinear systems. It includes two methods, Lyapunov’s indirect method and Lyapunov’s direct method. Lyapunov’s indirect method states that the dynamical system x f(x), (1)

24 Views

3y ago

Bruksanvisning för bilstereo Bruksanvisning for bilstereo ... - Jula

Bruksanvisning för bilstereo . Bruksanvisning for bilstereo . Instrukcja obsługi samochodowego odtwarzacza stereo . Operating Instructions for Car Stereo . 610-104 . SV . Bruksanvisning i original

372 Views

1y ago

10 tips och tricks för att lyckas med ert sap-projekt

10 tips och tricks för att lyckas med ert sap-projekt 20 SAPSANYTT 2/2015 De flesta projektledare känner säkert till Cobb’s paradox. Martin Cobb verkade som CIO för sekretariatet för Treasury Board of Canada 1995 då han ställde frågan

733 Views

2y ago

Nordens 25 största medieföretag efter omsättning

service i Norge och Finland drivs inom ramen för ett enskilt företag (NRK. 1 och Yleisradio), fin ns det i Sverige tre: Ett för tv (Sveriges Television , SVT ), ett för radio (Sveriges Radio , SR ) och ett för utbildnings program (Sveriges Utbildningsradio, UR, vilket till följd av sin begränsade storlek inte återfinns bland de 25 största

329 Views

1y ago

SS 02 52 68 Ljudklassning av utrymmen i byggnader - byggtjanst.se

Hotell För hotell anges de tre klasserna A/B, C och D. Det betyder att den "normala" standarden C är acceptabel men att motiven för en högre standard är starka. Ljudklass C motsvarar de tidigare normkraven för hotell, ljudklass A/B motsvarar kraven för moderna hotell med hög standard och ljudklass D kan användas vid

354 Views

1y ago

Apple Developer Program License Agreement (Swedish)

LÄS NOGGRANT FÖLJANDE VILLKOR FÖR APPLE DEVELOPER PROGRAM LICENCE . Apple Developer Program License Agreement Syfte Du vill använda Apple-mjukvara (enligt definitionen nedan) för att utveckla en eller flera Applikationer (enligt definitionen nedan) för Apple-märkta produkter. . Applikationer som utvecklas för iOS-produkter, Apple .

342 Views

1y ago

Recent Views

Request for Proposals for VILLAGE SOLICITOR

705.11 Village solicitor or city director of law - duties. “The village solicitor or city director of law shall act as the legal advisor to and attorney for the municipal corporation, and for all officers of the municipal corporation in matters relating to their official duties.

2y ago

131 Views

Appellee'S Answering Brief - Dol

APPELLEE'S ANSWERING BRIEF . KATE S. O'SCANNLAIN THOMAS TSO . Solicitor of Labor Counsel for Appellate and Special Litigation . G. WILLIAM SCOTT Associate Solicitor JEFFREY M. HAHN for Plan Benefits Security Senior Trial Attorney . U.S. Department of Labor Office of the Solicitor 200 Constitution Ave. N.W. Room N-4611 Washington, D.C. 20210

1y ago

137 Views

Alb erta Solicitor Gene nd Min of Publi c Sec

Alberta Solicitor General and Ministry of Public Security Alberta Basic Security Training Jan-14 Module One: Introduction to the Security Industry, Page 1 Module One: Introduction to the Security Industry When you decided to enroll in this course, you presumably did so because you have a desire or need to work in the security industry in Alberta.

3y ago

219 Views

REQUEST FOR PROPOSALS FOR PROFESSIONAL SERVICES FOR .

The State Solicitor is appointed by the Attorney General and reports through the Chief Deputy Attorney General. As set forth at 29 Del. C. § 2505 (b), the State Solicitor is responsible for all civil actions and matters wherein the State or its agencies or subdivisions are involved and has such powers as the Attorney General shall designate.

3y ago

193 Views

Interviewing and advising

(d) To have the solicitor’s full attention. (e) To sit in reasonable comfort. (f) No physical barriers between us and our solicitor to impede communication. (g) Not to be kept waiting. The bare minimum, then, would appear to be a comfortable, quiet room where you

3y ago

148 Views

Information for Trainee Solicitors on CV Preparation

Solicitor who trained in large law firm with a wide range of experience in banking and financial services, debt collection, commercial property and company law. EDUCATION April 2011 PPC II Completed - Qualified as a solicitor with Law Society of Ireland 2003-2007 Trinity College Dublin - BCL - 2.1 Honours

3y ago

226 Views

STATE OF TEXAS, OFFICE OF THE SPECIAL MASTER Solicitor .

Jul 05, 2019 · FREDERICK LIU Assistant to the Solicitor General US Department of Justice 950 Pennsylvania Avenue, NW Washington, D.C. 20530-0001 JAMES J. DUBOIS* R. LEE LEININGER THOMAS K. SNODGRASS U.S. Department of Justice Environment & Natural Resources Division 999 18th Street South Terrace – Suite 3

2y ago

156 Views

WILLIAM K W LEUNG & CO

Articleship with : Isadore Goldman, a top City of London firm specializing in insolvency (corporate); recovery; commercial and international litigation. 1986 – 1988 (2) Assistant Solicitor with . Wilkinson & Grist, a reputable local solicitor firm . 1990 (3) Corporate Finance Associate wit

2y ago

185 Views

Newsletter of Tweed Valley Jazz Club

SOUTHERN CROSS CREDIT UNION 2 Commercial Road, Murwillumbah Ph. 6672 2744 TWEED ENDEAVOUR CRUISES River Terrace, Tweed Heads Ph. 0755 368800 RUSSELL J BAXTER SOLICITOR N.S.W. & QLD (Honorary Club Solicitor) 28 Recreation Street, Tweed Heads Ph. 0755 992266 AN

2y ago

309 Views

IN THE HULL CROWN COURT R -v- WILLIAM FLANNIGAN

Adams, who worked for a well-known and reputable firm in Hull known as Cooper Wilkin Chapman. It is clear from the evidence that the recommendation to use this solicitor was just that; a recommendation. There was no obligation to use this solicitor and the purchasers were at liberty to use th

1y ago

140 Views

JAMES S. M. KITCHEN Suite 224 BARRISTER & SOLICITOR Airdrie AB T4B 3C3

BARRISTER & SOLICITOR. 203-304 Main St S Suite 224 . Airdrie AB T4B 3C3 . Phone: 403-667-8575 . Email: james@jsmklaw.ca [2] COVID mRNA vaccines (Pfizer and Moderna). She is also compelled to maintain both the physical and spiritual integrity of her body by asserting her God-given prior right to decline

1y ago

206 Views

Rules of Court - WordPress

and includes a Registrar, Court interpreter, bailiff, clerk, process server or other officer who is attached to a Court; "solicitor" means an advocate and solicitor as defined in section 3 of the Legal Profession Act 1976 [Act 166]; "Registry" means the Registry of the High Court, the Sessions Court or the Magistrates' Court;

1y ago

131 Views

Financial Statement for a variation of an order for a financial remedy .

If there is not enough room on the form for any particular piece of information, you may continue on an . attached sheet of paper. If you are in doubt about how to complete any part of this form you should seek legal advice. This statement is filed by (give name and address of solicitor) Solicitor's fee account no.

10m ago

111 Views

SOLICITOR - Charleston County Bar Association

101 Meeting St., Ste 400, Charleston, SC 29401. Main No: 843 958-1900 Fx:- 843 958-1905 Front Desk Line #2: 843 958-1927 . Direct Access is 843 958-**** Website: www.scsolicitor9.org. E-mail: solicitor@scsolicitor9.org. . Moncks Corner, SC 29461. Berkeley Number. 843 719-4529 . Charleston Number .843 723- 3800 ext.4529 .

8m ago

151 Views

DIGES T - academy.difc.ae

The Qualified Lawyers Transfer Scheme (QLTS) allows qualified lawyers in other jurisdictions to qualify as a solicitor in England and Wales. The English legal profession is relatively open to international lawyers seeking to qualify as a solicitor and it does not impose restrictions to admission on grounds of nationality or residence.

7m ago

74 Views

Learning-based Lyapunov Analysis For Nonlinear Control Systems

It looks like you're using an ad-blocker