Nonlinear Network Structures For Optimal Control

2y ago

10 Views

2 Downloads

539.63 KB

63 Pages

Last View : 2m ago

Last Download : 3m ago

Upload by : Ronnie Bonney

Report this link

Download PDF

Transcription

Nonlinear Network Structures forOptimal ControlCheng Tao & Frank L. LewisAdvanced Controls & Sensors GroupAutomation & Robotics Research Institute (ARRI)The University of Texas at ArlingtonSlide 1

Neural Network Solution for Fixed-Final TimeOptimal Control of Nonlinear SystemsCheng TaoFrank L. LewisAutomation & Robotics Research Institute (ARRI)The University of Texas at ArlingtonECC 07 Kos

Neural Network Robot ControllerUniversal Approximation PropertyFeedback linearization.qdNonlinear Inner LoopFeedforward Loopqd f(x)e[Λ I]rKvτqRobot SystemRobust Control v(t)TermPD Tracking LoopProblem- Nonlinear in the NN weights sothat standard proof techniques do not workEasy to implement with a few more lines of codeLearning feature allows for on-line updates to NN memory as dynamics changeHandles unmodelled dynamics, disturbances, actuator problems such as frictionNN universal basis property means no regression matrix is neededNonlinear controller allows faster & moreprecise motionSlide 3

Sponsored byNSF- Paul WerbosARO- Randy Zachery4 US PatentsSlide 4

Problem- Nonlinear in the NN weights sothat standard proof techniques do not workNew book by Jay Farrell and Marios PolycarpouAdaptive Approximation Based ControlSlide 5

Optimality in Biological SystemsCell HomeostasisThe individual cell is a complexfeedback control system. It pumpsions across the cell membrane tomaintain homeostatis, and has onlylimited energy to do so.Permeability control of the cell membraneCellular /index.htmlSlide 6

ARRI Research Roadmap in Neural Networks3. Approximate Dynamic Programming – 2006Nearly Optimal ControlBased on recursive equation for the optimal valueUsually Known system dynamics (except Q learning)The Goal – unknown dynamicsExtend adaptive control toyield OPTIMAL controllers.On-line tuningNo canonical form needed.Optimal Adaptive Control2. Neural Network Solution of Optimal Design Equations – 2002-2006Nearly Optimal ControlBased on HJ Optimal Design EquationsKnown system dynamicsPreliminary Off-line tuningNearly optimal solution ofcontrols design equations.No canonical form needed.1. Neural Networks for Feedback Control – 1995-2002Extended adaptive controlBased on FB Control ApproachUnknown system dynamicsto NLIP systemsOn-line tuningNo regression matrixSlide 7NN- FB lin., sing. pert., backstepping,force control, dynamic inversion, etc.

Objective and Significance Provide a tool to solve finite-horizon continuous-time optimal controlproblems for nonlinear systems. Continuous time finite horizon optimal control problems appear applicationsin which people use model predictive control (receding horizon control).Slide 8

Outline:1. Fixed-Final Time Optimal Control of Nonlinear Systems Using NeuralNetwork HJB Approach2. Neural Network Solution for Finite-Final Time H-InfinityFeedback ControlState3. Neural Network Solution for Fixed-Final time Constrained OptimalControl This research was supported by NSF grant ECS-0140490 andARO grant DAAD 19-02-1-0366.Slide 9

Review of Related work and MotivationApproximate HJB solutionsMunos et. al [65]Constrained-input optimizationSussmann, Sontag and yang [84](Gradient descent approaches)Kim, Lewis and Dawson [47](NNs)Huang and Lin [44](Taylor series expansion)NN applications to an optimalcontrolMiller [63](NNs for control)Bernstein [15]Dolphus [33]Abu-Khalaf, M [1](Infinity horizon)Unconstrained policy iteration withfinite-time horizonBeard[11]Parisini and Zoppoli [70](Infinite horizon)Slide 10

Background on Fixed-Final-Time HJB Optimal ControlNonlinear dynamical systemx f ( x ) g ( x )u (t )(1).wheren mx ℜ , f ( x) ℜ , g ( x) ℜand the input u (t ) R mnnIt is desired to find the control u that minimizes a generalized nonquadratic functionalV ( x(t 0 ), t 0 ) φ ( x(t f ), t f ) [Q( x) W (u )]dtttf0with Q (x) , W (u ) positive definite on ΩSlide 11(2)

Background on Fixed-Final-Time HJB Optimal ControlAn infinitesimal equivalent to (2) is V ( x, t ) V (x, t ) L ( f ( x) g( x)u(t ))x t T(3)where L Q(x) W (u) . This is a time-varying partial differential equation with V (x, t )the cost function for any given u(t ) and is solved backwards in time from t t f .By setting t 0 t fin (2) its boundary condition isV (x(t f ), t f ) φ (x(t f ), t f )Slide 12(4)

Background on Fixed-Final-Time HJB Optimal ControlAccording to Bellman’s optimality principle, the optimal cost is given by* T V ( x, t ) V ( x, t ) ( f ( x ) g ( x )u ( x )) min L u (t ) t x which yields the optimal control*1 1T V ( x, t )*u (x ) R g (x )2 x*(5)(6)where V * ( x, t ) is the optimal value function.Substituting (6) into (5) yields the well-known time-varyingHamilton-Jacobi-Bellman (HJB) equation V ( x, t ) V ( x, t )1 V ( x, t )f (x ) Q(x ) x t x4***TSlide 13g ( x )R g ( x ) 1T V ( x, t ) 0 (7) x*

Background on Fixed-Final-Time HJB Optimal ControlThen (5) becomes(HJB V ( x, t ) ) V ( x, t ) V ( x, t )f (x ) Q(x ) t x**1 V ( x, t )T V ( x, t ) 0g ( x )R 1 g ( x )4 x x*T *(8)If this HJB equation can be solved for the value function V (x, t ) , then the optimalcontrol is1T V ( x, t )u (x ) R 1 g (x ) x2**Slide 14

Nonlinear Fixed-Final-Time HJB Solution by NN Least-Squares ApproximationNN Approximation of the Cost Function V (x, t )In Sandberg [78], it is shown that NNs with time-varyingweights can be used touniformly approximate continuous time-varying functions.[]Using the following equation to approximate V (x, t ) for t t 0 , t f on a compactset Ω ℜ nLV L ( x, t ) w j (t )σ j ( x ) wLT (t )σ L ( x )j 1The NN weights are w j (t ) and L is the number of hidden-layer neurons.σ L ( x ) [σ 1 ( x )σ 2 ( x ).σ L ( x )] is the vector of activation function.Tw L (t ) [w1 (t )w 2 (t ).w L (t )] is the vector of NN weights.TSlide 15(9)

Nonlinear Fixed-Final-Time HJB Solution by NN Least-Squares ApproximationNote:The set σ j ( x ) is selected to be independent. Then without loss of generality, they canbe assumed to be orthonormal, i.e. select equivalent basis functions to σ j ( x ) that are also orthonormal. The orthonormality of the set {σ j ( x )}1 on Ω implies that if afunction ψ ( x, t ) L2 (Ω ) then ψ (x, t ) ψ ( x, t ),σ j (x ) Ω σ j (x )j 1wheref,gΩ f gdxΩis inner product.Slide 16

Nonlinear Fixed-Final-Time HJB Solution by NN Least-Squares ApproximationNote that V L ( x, t ) σ TL (x )w L (t ) σ TL (x )w L (t ) x x(10)where σ L ( x ) is the Jacobian σ L ( x ) x, and that V L ( x, t ) TL (t )σ L ( x ) w t(11)Therefore approximating V ( x, t ) by V L ( x, t ) uniformly in in the HJB equation (8)results in TL (t )σ L ( x ) w TL (t ) σ L ( x ) f ( x ) w1 w TL (t )σ L ( x )g ( x )R 1 g T ( x )σ TL ( x )w L (t )4 Q ( x ) eL ( x , t )Slide 17(12)

Nonlinear Fixed-Final-Time HJB Solution by NN Least-Squares ApproximationorL HJB V L ( x, t ) w j (t )σ j (x ) e L (x, t )j 1 (13)where e L (x, t ) is a residual equation error. The corresponding optimal control input is1T V ( x, t )u (x ) R 1 g (x ) x2** R 1 g T σ TL ( x )w L (t )(14)To find the least-squares solution for w L (t ) , the method of weighted residuals is used e L ( x, t ), e L ( x, t ) L (t ) w 0ΩSlide 18

Nonlinear Fixed-Final-Time HJB Solution by NN Least-Squares Approximation L (t ) w σ L ( x), σ L ( x) 1Ω σ L ( x) f ( x), σ L ptimal ControlWhen (22) is used, (5) becomes**T u()Vxt V ( x, t ), T ( f ( x, t ) g ( x )u ( x )) min Q( x ) 2 φ (v )Rdv 0u (t ) t x Minimizing the Hamiltonian of the optimal control problem with regard to u gives V ( x, t )g (x ) 2φ 1 (u * ) 0 x*Tso* 1 1T V ( x, t ) u ( x ) φ R g ( x ) x 2*Slide 53u U ℜ m(23)

Neural Network Solution for Fixed-Final time Constrained Optimal ControlHJB equation(HJB V (x, t )*) V ( x, t ) V (x, t ) t x 2 φ T (v )Rdv u0*T* V ( x, t ) x*Tf (x ) 1T V ( x, t ) g ( x ) φ R 1 g (x ) x 2* Q(x ) 0 (24)If this HJB equation can be solved for the value function V (x, t ) , then (24) gives theoptimal constrained control.Slide 54

Neural Network Solution for Fixed-Final time Constrained Optimal ControlSo that L (t ) σ L ( x ), σ L ( x )w σ L ( x ), σ L ( x ) σ L ( x ), σ L ( x ) σ L ( x ), σ L ( x ) 1ΩΩ σ L ( x ) f ( x ), σ L ( x )2 φ T (v )Rdv, σ L ( x )Ω 1Ω w L (t )u0 1Ω 1Ω 1 w (t ) σ L ( x ) g ( x ) φ R 1 g T ( x ) σ TL (x )w L (t ) , σ L (x ) 2 (25)TL Q( x ), σ L ( x )ΩSlide 55Ω

Neural Network Solution for Fixed-Final time Constrained Optimal ControlOptimal Algorithm Based on NN Approximation(12) can be converted to L (t ) A T Bw L (t ) A T C A T Dw L (t ) A T E 0 A T Aw(26)then L (t ) (AT A) AT Bw L (t ) (AT A) ATw 1 1 (A A) A Dw L (t ) (A A) A E 1TTT 1(27)TThis is a nonlinear ODE that can easily be integrated backwards using finalconditionw L (t f)to find the least-squares optimal NN weights.Slide 56

Neural Network Solution for Fixed-Final time Constrained Optimal ControlNumerical Examplesa) Linear Systemx 1 2 x1 x 2 x3x 2 x1 x 2 u 2x 3 x3 u1u1 5u 2 20(28)To find a nearly optimal time-varying controller, the following smooth functionis used to approximate the value function of the systemV ( x1 , x 2 ) w1 x12 w2 x 22 w3 x 32 w4 x1 x 2 w5 x1 x 3 w6 x 2 x 3Slide 57(29)

State 220-3100051015Time2025-430Fig. 15 Constrained Linear System Weights051020u1u2151050-5-100515Time202530Fig. 16 State Trajectory of Linear System with BoundsOptimal Control with BoundsControl InputW0401015Time202530Fig. 17 Optimal NN Control Law with BoundsSlide 58

Neural Network Solution for Fixed-Final time Constrained Optimal Controlb) Nonlinear Chained Systemx 1 u1x 2 u 2u1 1x 3 x1u 2u2 2(30)Selecting the smooth approximating functionV ( x1 , x 2 , x3 ) w1 x12 w2 x 22 w3 x32 w4 x1 x 2 w5 x1 x3 w6 x 2 x3 w7 x14 w8 x 24 w9 x34 w10 x12 x 22 w11 x12 x32 w12 x 22 x32 w13 x12 x 2 x3 w x x x3 w15 x1 x 2 x w x x 2 w x x w x x w x x214 1 223316 1317 1 3 w20 x 2 x33 w21 x 23 x3Slide 59318 1 2319 1 3(31)

Neural Network Solution for Fixed-Final time Constrained Optimal ControlNN WeightsState 015Time2025300510Optimal Control with Constrains0.5u1u20-0.5-1-1.5-2015Time2025Fig. 19 State Trajectory of Nonlinear SystemFig. 18 Nonlinear System WeightsControl Input-4x1x2x351015Time2025Fig. 20 Optimal NN Constrained Control LawSlide 60

C) Simulation-BenchmarkProblemNearly Optimal Controller State TrajectoriesNearly Optimal Controller State Trajectories1.53rtheta2rdotthetadot110.50-1x 2,x 4x 1,x 30-2-0.5-3-1-4-1.5-5-60102030405060Time in seconds708090-2100r θ State TrajectoriesFig. 0Fig. 23405060Time in seconds70u (t ) Control Input405060Time in seconds708090100r θ State Trajectories4310305-0.2020Nearly Optimal Controller Cost with Constrains0.5-0.510Fig. 22Nearly Optimal Controller with Constrainscontrol080901000102030405060Time in seconds7080Fig. 24 Disturbance AttenuationSlide 6190100

Overview of the Method Neural networks are used to approximately solve the finite-horizon optimalstate feedback control problem The method is based on solving a related Hamilton-Jacobi equation of thecorresponding finite-horizon problem Transform the problem into solving an ODE equation backwards in time. Neural network approximation converges uniformly to the function and theresulting controller provides closed-loop stability. The result is a nearly exact feedback controller with time-varyingcoefficients. No policy iteration needed.Slide 62

Slide 63

1. Fixed-Final Time Optimal Control of Nonlinear Systems Using Neural Network HJB Approach 2. Neural Network Solution for Finite-Final Time H-Infinity State Feedback Control 3. Neural Network Solution for Fixed-Final time Constrained Optimal Control This research was supported

Related Documents:

Bruksanvisning för bilstereo Bruksanvisning for bilstereo ... - Jula

Bruksanvisning för bilstereo . Bruksanvisning for bilstereo . Instrukcja obsługi samochodowego odtwarzacza stereo . Operating Instructions for Car Stereo . 610-104 . SV . Bruksanvisning i original

376 Views

1y ago

10 tips och tricks för att lyckas med ert sap-projekt

10 tips och tricks för att lyckas med ert sap-projekt 20 SAPSANYTT 2/2015 De flesta projektledare känner säkert till Cobb’s paradox. Martin Cobb verkade som CIO för sekretariatet för Treasury Board of Canada 1995 då han ställde frågan

737 Views

2y ago

Nordens 25 största medieföretag efter omsättning

service i Norge och Finland drivs inom ramen för ett enskilt företag (NRK. 1 och Yleisradio), fin ns det i Sverige tre: Ett för tv (Sveriges Television , SVT ), ett för radio (Sveriges Radio , SR ) och ett för utbildnings program (Sveriges Utbildningsradio, UR, vilket till följd av sin begränsade storlek inte återfinns bland de 25 största

339 Views

1y ago

SS 02 52 68 Ljudklassning av utrymmen i byggnader - byggtjanst.se

Hotell För hotell anges de tre klasserna A/B, C och D. Det betyder att den "normala" standarden C är acceptabel men att motiven för en högre standard är starka. Ljudklass C motsvarar de tidigare normkraven för hotell, ljudklass A/B motsvarar kraven för moderna hotell med hög standard och ljudklass D kan användas vid

358 Views

1y ago

Apple Developer Program License Agreement (Swedish)

LÄS NOGGRANT FÖLJANDE VILLKOR FÖR APPLE DEVELOPER PROGRAM LICENCE . Apple Developer Program License Agreement Syfte Du vill använda Apple-mjukvara (enligt definitionen nedan) för att utveckla en eller flera Applikationer (enligt definitionen nedan) för Apple-märkta produkter. . Applikationer som utvecklas för iOS-produkter, Apple .

345 Views

1y ago

CHAP 2 Nonlinear Finite Element Analysis Procedures

Nonlinear Finite Element Analysis Procedures Nam-Ho Kim Goals What is a nonlinear problem? How is a nonlinear problem different from a linear one? What types of nonlinearity exist? How to understand stresses and strains How to formulate nonlinear problems How to solve nonlinear problems

62 Views

3y ago

19. Nonlinear Optics19. Nonlinear Optics

Third-order nonlinear effectThird-order nonlinear effect In media possessing centrosymmetry, the second-order nonlinear term is absent since the polarization must reverse exactly when the electric field is reversed. The dominant nonlinearity is then of third order, 3 PE 303 εχ The third-order nonlinear material is called a Kerr medium. P 3 E

19 Views

1y ago

TO THE TEACHER - Alfred Music

in Prep Course Lesson Book A of ALFRED'S BASIC PIANO LIBRARY. It gives the teacher considerable flexibility and is intended in no way to restrict the lesson procedures. FORM OF GUIDE The Guide is presented basically in outline form. The relative importance of each activity is reflected in the words used to introduce each portion of the outline, such as EMPHASIZE, SUGGESTION, IMPORTANT .

65 Views

3y ago

Recent Views

Pegasus - University of Exeter Blogs

'Pegasus', Dept. of Classics and Ancient History, Amory Building, Rennes Drive, University of Exeter, Exeter EX4 4RJ E-mail: pegasus@exeter.ac.uk . Pegasus - 2 - Issue 52 (2009) s in . The major event this last year was the announcement of the outcome of the Research Assessment Exercise 2008 in .

11m ago

72 Views

MANAGERIAL FINANCE - GBV

of Managerial Finance page 2 Introduction to Managerial Finance 1 Starbucks—A Taste for Growth page 3 1.1 Finance and Business What Is Finance? 4 Major Areas and Opportunities in Finance 4 Legal Forms of Business Organization 5 Why Study Managerial Finance? Review Questions 9 1.2 The Managerial Finance Function 9 Organization of the Finance

3y ago

6.8K Views

Chapter 1 The roles of finance function in organisations

The roles of the finance function in organisations 4. The role of ethics in the role of the finance function Ethics is the system of moral principles that examines the concept of right and wrong. Ethics underpins an organisation’s sustained value creation. The roles that the finance function performs should be carried out in an .File Size: 888KBPage Count: 10Explore furtherRole of the Finance Function in the Financial Management .www.managementstudyguide.c Roles and Responsibilities of a Finance Department in a .www.pharmapproach.comRoles and Responsibilities of a Finance Department .www.smythecpa.comTop 10 – Functions of Business Finance in an om23 Functions and Duties of Accounting and Finance nded to you b

2y ago

335 Views

Mathematics 2 Problem Sets - Exeter

Mathematics 2 Mathematics Department Phillips Exeter Academy Exeter, NH August 2019. To the Student Contents: Members of the PEA Mathematics Department have written the material in this book. As you work through it, you will discover that algebra, geometry, and trigonometry have been integr

2y ago

119 Views

The Seafarer RL 4 The Wanderer The Wife’s Lament

survive in the Exeter Book, a manuscript of Anglo-Saxon poems produced by a single scribe around a.d. 950. In addition to these and other secular poems, the Exeter Book contains religious verse, nearly 100 riddles, and a heroic narrative. It is the largest collection of Old English poetry in existence. Neglected Treasure Originally, the Exeter

2y ago

139 Views

UPPER SCHOOL - Exeter

Club, Exeter Soccer Club, or Exeter Volleyball Club. UPPER SCHOOL students may also partici-pate in musical or choral groups. DESIGNING YOUR OWN CURRICULUM As an UPPER SCHOOL student, you have the free-dom to design your own academic curriculum. You may enroll in any three of the more than

2y ago

341 Views

Exeter a gentleman in Moscow weekend

“A Gentleman in Moscow” Itinerary at a Glance: Day 1 Arrive in Moscow Day 2 Backstage Tour of the Bolshoi Day 3 Kremlin Tour Day 4 Depart Moscow . Why Exeter International? Our Knowledge & Experience . At Exeter International we have been creating mem

2y ago

328 Views

Phillips Exeter Academy Courses of Instruction 2022-23

Academic Excellence Academic excellence is a signature strength of Phillips Exeter Academy. In every discipline and . — academic, artistic, athletic and extracurricular — . and passions and the agency needed to carry these forward. Non Sibi Non Sibi, or Not For Oneself, inscribed on Exeter’s

1y ago

114 Views

Exeter College Association

club in Takoradi, Ghana. Katrina Hancock read Earth Sciences at Exeter between 1998 and 2002. She joined the Development Office in 2004 and has been Director of Development since 2006. Mark Houghton-Berry,Honorary Fellow, read Literae Humaniores at Exeter between 1976 and 1980. He is CEO of Tudor Capital LP, the European arm of a US based hedge .

1y ago

95 Views

Exeter Plan: Outline Draft Plan Consultation Sustainability Appraisal .

1. Draft Non-Technical Summary of the SA Report for the Exeter Plan (Outline Draft Consultation) M. Andrew B. Miller S. Temple K. Nicholls K. Nicholls 10.08.2022 2. Final Non-Technical Summary of the SA Report for the Exeter Plan (Outline Draft Consultation) M. Andrew B. Miller S. Temple K. Nicholls K. Nicholls 09.09.2022 3.

1y ago

108 Views

2017-2018 GRANDE ÉCOLE MSc in MANAGEMENT

Descriptif des cours Course Outlines 10 Catalogue des cours/ Course Catalog 2017-2018 FIN: Finance/Finance A : Actuariat/Actuarial, Insurance E : Finance d’entreprise/Corporate Finance The course liste tables and the course outlines G : Finance générale/General Finance M : Finance de marché/Market Finance S : Synthèse/Synthesis IDS: Systèmes d’Information, Sciences de la Décision et .

3y ago

312 Views

Behavioral Finance and Wealth L Management

Introduction to Behavioral Finance CHAPTER1 What Is Behavioral Finance? Behavioral Finance: The Big Picture Standard Finance versus Behavioral Finance The Role of Behavioral Finance with Private Clients How Practical Application of Behavioral Finance Can Create a Successful Advisory Rel

2y ago

377 Views

Catalogue des Cours Course Catalog - ESSEC Business School

10 Catalogue des cours/Course Catalog 2021-2022 FIN: Finance/Finance E : Finance d'entreprise/Corporate Finance G : Finance générale/General Finance M : Finance de marché/Market Finance S : Synthèse/Synthesis IDS: Systèmes d'Information, Sciences de la Décision et Statistiques/ Information Systems, Decision Sciences and Statistics

1y ago

222 Views

SINGAPORE - Kelly Services

FINANCE Chief Financial Officer Degree/Master 15 20,000 25,000 Finance Assistant Diploma 1-3 2,800 3,400 Finance Controller Degree 10-15 10,000 18,000 Finance Director Degree 15 15,000 20,000 Finance Executive/ Senior Finance Executive Degree 2-5 3,000 6,000 Finance Manager/ Assistan

2y ago

527 Views

Ministries of Finance and Nationally Determined Contributions

Rodrigo Rojo, IDB Sr. Consultant and advisor to Ministry of Finance of Chile. Colombia German Romero Otalora and Laura Marcela Ruiz Daza — Office of the Vice-Minister — Ministry of Finance. Ireland Paul Ryan — International Finance Division — Ministry of Finance Sean Judge — Department of Finance — Ministry of Finance

1y ago

232 Views

Nonlinear Network Structures For Optimal Control

It looks like you're using an ad-blocker