Optimal Control With Distorted Probability Distributions

2y ago

32 Views

3 Downloads

262.15 KB

15 Pages

Last View : 17d ago

Last Download : 3m ago

Upload by : Mya Leung

Report this link

Download PDF

Transcription

Optimal Control with Distorted ProbabilityDistributionsKerem UğurluTuesday 11th February, 2020AbstractWe study a robust optimal control of discrete time Markov chains with finite terminal T and bounded costs using probability distortion. The time inconsistency of theseoperators and hence its lack of dynamic programming are discussed. Due to that,dynamic versions of these operators are introduced, and its availability for dynamicprogramming is demonstrated. Based on dynamic programming algorithm, existenceof the optimal policy is justified and an application of the theory to portfolio optimization is also presented.Keywords: Decision Science; Probability Distortion; Controlled Markov Chains; Risk Management; Mathematical Finance1IntroductionDynamic programming [1] is one of the fundamental areas in operations research. Initially,dynamic programming models have used expected value as the performance criteria, butsince in many real life scenarios, the expected value as the performance measure is notappropriate, models with risk aversion are represented via different approaches. The firstapproach is using concave utility functions modelling risk-aversion (see e.g. [2, 3, 4, 5, 6] andthe reference therein). Another approach has been to use the so-called coherent/convex riskmeasures. Starting from the seminal work of Artzner et al. [7] dynamic coherent/convexrisk measures have seen huge developments since then (see [8, 20, 10, 11, 12, 13, 14]).The third approach that we will follow in this paper is so called probability distortion.Probability distortion is an approach that is used frequently in behaviour finance ( see e.g.1

2[15, 16, 17]). It has been motivated by empirical studies in behavioural finance and aims tomodel the human tendency to exaggerate small probabilities of extreme events with respectto the underlying probability measure such as catastrophic ruin or the chance of winninglottery. This is characterized by an operator on random outcomes, where the underlyingprobability distribution is distorted by a weight function w satisfying some properties. Theability to capture human decision dynamics under uncertainty has strong empirical support([18]).Although, modelling random outcomes representing gains/losses using probability distortion goes back to at least 1970’s ([15]), its axiomatic incorporation into multiperiod settingsis still absent in the literature. There are few recents works in this direction. [19] studiesa portfolio optimization problem in continuous time using probability distortion. [21] studies a discrete time controlled Markov chain in infinite horizon. [22] extends the probabilitydistortion operator to multitemporal setting under some monotonicity assumptions of thecost/gain functions.The reason for scarcity of using probability distortions in multitemporal settings is it doesnot satisy “Dynamic Programming Principle” (DPP) or “Bellman Optimality Principle”.Namely, a sequence of optimization problems with the corresponding optimal controls iscalled time-consistent, if the optimal strategies obtained when solving the optimal controlproblem at time s stays optimal when the optimal control problem is solved at time t s.We refer the reader to [26] for an extensive study on time-consistency.In this paper, we introduce a dynamic version of probability distortion that does notsuffer from time-inconsistency. Hence, DPP can be applied readily in our framework undercontrolled Markov chains, and additionally DPP gives the existence of the optimal policyunder some reasonable assumptions on the model.The rest of the paper is as follows. In Section 2, we describe probability distortion onrandom variables in static one period case first. Next, we introduce the concept of dynamicprobability distortion on stochastic processes in multi-temporal discrete time setting. InSection 3, we introduce the controlled Markov chain framework that we are going to workon. In Section 4, we state and solve our optimal control problem by characterizing theoptimal policy. In Section 5, we apply our results to a portfolio optimization problem andconclude the paper.

32Probability Distortion2.1Probability Distortion on Random VariablesLet (Ω, F, P) be a probability space and denote by L (Ω, F, P) the set of non-negativeessentially bounded random variables on (Ω, F).Definition 2.1. A mapping w : [0, 1] [0, 1] is called a distortion function, if it iscontinuous, strictly increasing, and satisfies w(0) 0 and w(1) 1. For any ξ L (Ω, F, P), the operator with respect to the distortion function w isdefined byZ w(P(ξ z))dz(2.1)ρ(ξ) ,0Lemma 2.1. (i) Let x, y, α [0, 1], ξ L (Ω, F, P) and w : [0, 1] [0, 1] be a distortionfunction that satisfiesw(αx (1 α)y) αw(x) (1 α)w(y).(2.2)Then, ρ(ξ) E[ξ]. Namely, for ξ representing the nonnegative bounded random losses,ρ(·) evaluates a bigger risk for ξ than E[·] does.(ii) Conversely, suppose w satisfies for α [0, 1]w(αx (1 α)y) αw(x) (1 α)w(y),(2.3)then ρ(ξ) E[ξ]. Namely, for ξ representing the nonnegative bounded random gains,ρ(·) evaluates a smaller gain for ξ than E[·] does.Proof. We will only prove the first part. By w(0) 0 and (2.2), we have w(αx) αw(x)for any α [0, 1]. In particular, for x 1, we get w(α) α for any α [0, 1]. Thus,w(P(ξ z)) P(ξ z) for any z R. By taking integrals on both sides, we conclude theresult. Remark 2.1. Lemma 2.1 implies that (2.2), respectively (2.3), is an appropriate property ofthe distortion function w for modelling risk averse behaviour towards random costs, respectively risk seeking behaviour, towards random profits.Lemma 2.2. Let ρ : L (Ω, F, P) R be the distortion operator as in (2.1). Then

4(i) ρ is positively translation invariant, i.e., ρ(ξ c) ρ(ξ) c for c 0. In particular,ρ(c) c for any c 0.(ii) ρ is positively homogeneous, i.e. ρ(λξ) λρ(ξ) for λ 0.(iii) ρ is monotone, i.e. ρ(ξ1 ) ρ(ξ2 ) for ξ1 , ξ2 L (Ω, F, P) and ξ1 ξ2 .Proof.(i)Z w(P(ξ c z))dzρ(ξ c) 0Z w(P(ξ z c))dzZ 0Z w(P(ξ z))dz 0 c w(P(ξ z))dz0 c ρ(ξ)Moreover, we have Zw(P (0 z))dzρ(0) Z0w(P (0 z))dz {0}Zw(1)dz 0 {0}Hence, by the first equality above, we have ρ(0 c) c for c 0.(ii) Zw(P(λξ z))dzρ(λξ) 0Z w(P(ξ λ0zz))dλ λ λρ(ξ)(iii) Since ξ1 ξ2 and w is monotone, we have for any z 0P(ξ1 z) P(ξ2 z)w(P(ξ1 z)) w(P(ξ2 z))Thus, we have ρ(ξ1 ) ρ(ξ2 ).

52.2Dynamic Probability Distortion on Stochastic ProcessesThe main issue occurs when one tries to extend (2.1) to the multiperiod setting. In particular,it is not clear what the “conditional version” of distortion operator is. Hence, we firstconstruct the corresponding operator for multitemporal dynamic setting.Fix T N and denote T , [0, 1, . . . , T ] and Te , [0, 1, . . . , T 1]. Let Ω be the samplespace with F0 F1 . . . FT being the filtration and P being the probability measure onΩ such that (Ω, (Ft )t T , P) is the stochastic basis. Let ξ (ξt )t T be a discrete time nonnegative stochastic process that is adapted to the filtration (Ft )t T and uniformly boundedthat is supt T ess sup(ξt ) . We denote in that case ξ L (Ω, (Ft )t T , P). Then, if wedefine for t TZ w(P(ξT z Ft ))dz,ρt (ξT ) ,0we do not necessarily haveρ(ξ) ρ(ρt (ξ))In particular, the “tower property” of expectation operator fails in distortion operators. Inthe context of stochastic optimization, this implies that the optimization problem becomes“time-inconsistent”, i.e. the “Dynamic Programming Principle” (DPP) does not hold. Onthe other hand, for w(x) x, the distortion operator (2.1) reduces to expectation operator,where for Et [ξT ] , E[ξT Ft ], we have E[ξT ] E[Et [ξT ]] with the towering property and DPPholds (see Example 2.1 below.)Thus, we define first dynamic distortion mappings on a filtered probability space (Ω, (Ft )t T )analogous to Definition 2.1 in multitemporal setting as follows.Definition 2.2. Let t Te and ξt 1 L (Ω, Ft 1 , P). w(·) is the distortion function as inDefinition 2.1. A one-step dynamic distortion mapping %t 1 t : L (Ω, Ft 1 , P) L (Ω, Ft , P) is defined asZ %t 1 t (ξt 1 ) ,w(P(ξt 1 zt 1 Ft ))dzt 10 A mapping %t : L (Ω, FT , P) L (Ω, Ft , P) is called a dynamic distortion mapping,if it is composition of one step dynamic distortion mappings of the form%t , %t 1 t . . . %T T 1Remark 2.2. Definition 2.2 is well defined. Indeed, let ξT L (Ω, FT , P), going backwardsiterative, by properties of w and construction of %t , uniform boundedness and Fs measurability

6at each s [t, . . . , T ] are preserved, such that %t (ξT ) maps ξT to L (Ω, Ft , P). Furthermore,by construction %s (·) %s (%t (·)) for 0 s t T . In particular, it is a time-consistentoperator. Lemma 2.3. For t T , let %t : L (Ω, FT , P) L (Ω, Ft , P) be the dynamic distortionoperator as in Definition 2.2 and ξ, ξ1 , ξ2 L (Ω, FT , P). Then(i) %t is positively translation invariant, i.e., %t (ξ c) %t (ξ) c P-a.s., if c is nonnegativeand Ft measurable.(ii) %t is positively homogeneous, i.e. %t (λξ) λ%t (ξ) P-a.s. for any scalar λ 0.(iii) %t is monotone, i.e. %t (ξ1 ) %t (ξ2 ) P-a.s. for ξ1 ξ2 , P-a.s.Proof. The proof is a simple modification of Lemma 2.2. Next, we illustrate the failure of towering property that causes time inconsistency via thefollowing example.Example 2.1. Let X and Y be two i.i.d. random variables on some probability space(Ω, F, P) with P(X 1) P(Y 2) 12 and w(x) x1/2 . Let ξ1 X and ξ2 X Y ,with F1 σ(X) and F2 σ(X, Y ). Thenρ2 1 (ξ2 ) X ρ1 0 (Y )1 X w(1) w( ),2where we use Lemma 2.3 i) in the first equality, above. Similarly, we have1ρ1 0 ρ2 1 (ξ2 ) ρ(X) w(1) w( )21 2w(1) 2w( )21 1/2 2 2( ) .2

7On the other hand, we haveZ w(P (ξ2 z))dzρ(ξ2 ) 0ZZZ w(P (ξ2 z))dz w(P (ξ2 z))dz w(P (ξ2 z))dz[0,2](2,3](3,4]ZZw(3/4)dz w(1/4)dz 2w(1) (2,3](3,4] 2 w(3/4) w(1/4)31 2 ( )1/2 ( )1/244Hence, ρ1 0 ρ2 1 (ξ2 ) ρ(ξ2 ) by strict concavity of w. We further note that the two expressionswould be equal to each other, if w(x) x.3Controlled Markov Chain FrameworkIn this section, we are going to introduce the necessary background on controlled Markovchain (see e.g. [25]) that we are going to work on with dynamic probability distortion. Wetake the control model M {Mt , t T }, where we haveMt : (St Xt , At , Kt , Qt , Ft , rt )with the following components: St Xt and At denote the state and action (or control) space, respectively, whichare assumed to be complete seperable metric spaces with their corresponding Borelσ-algebras B(St Xt ) and B(At ). We emphasize here that the state space is composedof two spaces, St and Xt , where both are subsets of Polish spaces. For each (s, x) St Xt , let At (s, x) At be the set of all admissible controls in thestate (s, x). We assume that At (s, x) is compact for t T and denoteKt : {(s, x, a) : (s, x) St Xt , a At (s, x)}as the set of feasible state-action pairs. We define the system function asFt (st , xt , at , ηt ) , (st 1 , xt 1 ),(3.1)

8for all t Te with xt Xt and at At with i.i.d. random variables (ηt )t Te on aprobability space (Y, B(Y ), P η ) with values in Y that are complete separable Borelspaces. We assume that the mapping (s, x, a) F (s, x, a, y) in (3.1) is continuous onSt Xt At for every y Y at every t Te . LetΩ , Tt 0 (St Xt )and for t T , andFt σ(S0 , X0 , A0 , . . . , St 1 , Xt 1 , At 1 , St , Xt )be the filtration of increasing σ-algebras. Let Ft be the family of measurable functions and πt Ft with πt : St Xt At fort Te . A sequence (πt )t Te of functions πt Ft is called a control policy (or simplya policy), and the function πt (·, ·) is called the decision rule or control at time t. Wedenote by Π the set of all control policies. We denote by P(At (st , xt )) as the set ofprobability measures on At (st , xt ) for each time t Te . A randomized Markovian policyπ (πt )t Te is a sequence of measurable functions such that πt (st , xt ) P(At (st , xt ))for all (st , xt ) St Xt , i.e. πt (st , xt ) is a probability measure on At (st , xt ). (πt )t Te iscalled a deterministic policy, if πt (st , xt ) at with at At (st , xt ). Let rt (st , xt , at ) : St Xt At R for t Te and rT : ST XT R be thenon-negative real-valued reward-per-stage and terminal reward function, respectively.For (πt )t Te Π, we writert (st , xt , πt ) , rt (st , xt , πt (st , xt )), rt (st , xt , at ). For a fixed π Π and given x0 X0 with πt (xt , st ) at , we aggregate the cumulativereward at time t [1, . . . , T 1] asxt , x0 i t 1Xri (si , xi , ai )i 0 Let π Π and x0 X0 be given. Then, there exists a unique probability measure Pπon (Ω, F) such that given (st , xt ) St Xt , a measurable set Bt 1 St 1 Xt 1 and(st , xt , at ) Kt , for any t Te , we haveQt 1 (Bt 1 st , xt , at ) , Pπt 1 (xt 1 Bt 1 st , xt , at , . . . , x0 ).

9Here, Qt 1 (Bt 1 st , xt , at ) is the stochastic kernel (see e.g. [23]). Namely, for each pair(st , xt , at ) Kt , Qt 1 (· st , xt , at ) is a probability measure on St 1 Xt 1 , and for eachBt 1 Bt 1 (St 1 Xt 1 ), Qt 1 (Bt 1 ·, ·, ·) is a measurable function on Kt . We remarkthat at each t T , the stochastic kernel depends only on (st , xt , at ) rather than thewhole history (x0 , a0 , x1 , a1 , s1 , . . . , st , at , xt ). By (3.1), we haveZQt 1 (Bt 1 st , xt , at ) IBt 1 [F (s, x, a, y)]dP η (y), Bt 1 B(St 1 Xt 1 ),Ywhere IBt 1 denotes the indicator function of Bt 1 .Assumption 3.1. The reward functions rt (st , xt , at ) for t Te and rT (sT , xT ) are nonnegative, continuous in their arguments and uniformly bounded i.e. 0 rt (st , xt , at ) and 0 rT (sT , xT ) . The multi-function (also known as a correspondence or point-to-set function) (st , xt ) mAt (st , xt ) is upper semi-continuous (u.s.c.). That is, if {smt , xt } St Xt andmmmmm{amt } At (st , xt ) are sequences such that (st , xt ) (s t , x̄t ), and at āt , thenāt At (s t , x̄t ) for t Te . For every state (s, x) St Xt , the admissible action set At (s, x) is compact for t Te .44.1Optimal Control ProblemMain ResultFor every t Te , (st , xt ) St Xt and π Π, letT 1XVt (st , xt , π) , %t (ri (si , xi , πi ) rT (sT , xT ))i tbe the performance evaluation from time t Te onwards using the policy π Π given theinitial condition (s, x) St Xt . The corresponding optimal (i.e. maximal) value is thenVt (st , xt ) , sup Vt (st , xt , π)(4.1)π ΠA control policy π (πt )t Te is said to be optimal, if it attains the maximum in (4.1), thatisVt (s, x) Vt (s, x, π ) for all (s, x) St Xt and for t Te .(4.2)Thus, the optimal control problem is to find an optimal policy and the associated optimalvalue (4.2) for all t T . We now present the main result of the paper.

10Theorem 4.1. The optimization problem (4.1) obeys dynamic programming principle andhas an admissible policy π Π. Furthermore, Vt (st , xt ) is continuous in its arguments.4.2Proof of Theorem 4.1To prove Theorem 4.1, we need the following key lemma.Lemma 4.1. Let K be defined asK : {(s, x, a) : (s, x) S X, a A},where S X and A are complete separable metric Borel spaces. Suppose further that A iscompact. Let V : K R be a nonnegative continuous function. For (s, x) S X, defineV (s, x) , sup V (s, x, a),a Athen for any (s, x) S X, there exists a B(S X) measurable mapping π : S X Asuch thatV (s, x) V (s, x, π (s, x))(4.3)and V : S X R is continuous.Proof. By [24], we have that there exists B(S X) measurable mapping π : S X Asuch that (4.3) holds and V (s, x) is upper semi-continuous. But, since V (·, ·, ·) is continuous, supa A V (s, x, a) is lower semi-continuous in (s, x) S X, as well. Hence, V (·, ·) iscontinuous in its arguments. Lemma 4.2. Suppose Assumption 3.1 holds true. Then, supremum is attained at (4.1) forsome B(St Xt ) measurable mapping πt (st , xt ) a t for t Te . Furthermore, each Vt iscontinuous in its arguments.Proof. We will show only the case for t T 1. The others follow going backwards iterativedown to t 0. We first show thatZ (sT 1 , xT 1 , aT 1 ) w(P η (xT 1 rT (F (xT 1 , aT 1 , ηT 1 )) zT sT 1 , xT 1 )dzT0mmis continuous in its arguments. Let (smT 1 , xT 1 , aT 1 ) (sT 1 , xT 1 , aT 1 ) as m .

11Then, we havemmlim VT 1 (smT 1 , xT 1 , aT 1 )Z mmmmmw(P η (xm limT 1 rT (F (sT 1 , xT 1 , aT 1 , ηT 1 )) zT (sT 1 , xT 1 ))dzTm 0Z mmmmm lim w(P η (xmT 1 rT (F (sT 1 , xT 1 , aT 1 , ηT 1 )) zT (sT 1 , xT 1 ))dzTm Z0 mmmmm w( lim P η (xmT 1 rT (F (sT 1 , xT 1 , aT 1 , ηT 1 )) zT (sT 1 , xT 1 ))dzTm 0Z mmmmmw(P η ( lim xm T 1 rT (F (sT 1 , xT 1 , aT 1 , ηT 1 )) zT (sT 1 , xT 1 ))dzTm Z0 w(P η (xT 1 rT (F (sT 1 , xT 1 , aT 1 , ηT 1 )) zT (xT 1 , xT 1 ))dzT m 0The second equality follows by boundedness of rT (), w(·) and Lebesgue dominated convergence theorem. The third equality follows by continuity of w(·), the fourth equality followsby continuity of probability measure, and the fifth equality follows by continuity of transitionF (·, ·, ·) as in (3.1). Hence, VT 1 (·, ·, ·) is continuous in its arguments. The result follows byLemma 4.1. Now, we are ready to prove Theorem 4.1.Proof of Theorem 4.1. We haveZ w(P η (xT 1 rT (F (xT 1 , πT 1 (sT 1 , xT 1 ), ηT 1 )) zT sT 1 , xT 1 )dzTVT 1 (sT 1 , xT 1 ) supπ ΠZ 0 w(P η (xT 1 rT (F (xT 1 , πT 1 (sT 1 , xT 1 ), ηT 1 )) zT sT 1 , xT 1 )dzT .0By Lemma (4.2), there exists a B(ST 1 XT 1 ) measurable mapping π Π such thatπT 1 (sT 1 , xT 1 ) a T 1 , and VT 1 (sT 1 , xT 1 ) is continuous in (sT 1 , xT 1 ). Hence, fort T 2, we have Z supw(P η (VT 1 (xT 2VT 2 (sT 2 , xT 2 ) aT 2 AT 2 (xT 2 ,xT 2 )aT 1 AT 1 (sT 1 ,xT 1 )0 rT 1 (F (xT 2 , aT 2 , ηT 2 ), aT 1 ), F (xT 2 , aT 2 , ηT 2 )) zT 1 sT 2 , xT 2 )dzT 1

12Plugging πT 1 (xT 1 , sT 1 ) in VT 2 (sT 2 , xT 2 ), we have Z w(P η (VT 1 (xT 2VT 2 (sT 2 , xT 2 ) supaT 2 AT 2 (sT 2 ,xT 2 ) 0rT 1 (F (sT 2 , xT 2 , aT 2 , ξT 2 ), a T 1 ), F (sT 2 , xT 2 , aT 2 , ηT 2 )) zT 1 sT 2 , xT 2 )dzT 1 Z supaT 2 AT 2 (sT 2 ,xT 2 ) w(P η (VT 1 (xT 20 rT 1 (F (sT 2 , xT 2 , aT 2 , ηT 2 ), a T 1 ), F (sT 2 , xT 2 , aT 2 , ηT 2 )) zT 1 sT 2 , xT 2 )dzT 1By Lemma (4.2) again, it admits an optimal policy a T 2 AT 2 such that Z VT 2 (sT 2 , xT 2 ) w(P (VT 1 (xT 20 rT 1 (F (sT 2 , xT 2 , a T 2 , ηT 2 ), a T 1 ), F (sT 2 , xT 2 , a T 2 , ηT 2 )) zT 1 sT 2 , xT 2 )dzT 1Going backwards iterative, we conclude that dynamic programming holds, (4.1) admits anoptimal policy π Π attaining supremum that depends only on on st and xt at each t Te .Furthermore, Vt (·, ·) is continuous again by Lemma (4.2). Hence, we conclude the proof. 5An Application to Portfolio OptimizationSuppose an investor has a portfolio of n stocks. The price of n stocks at t T are denotedbySt , (St1 , . . . , Stn ).The price of stock i [1, 2, , . . . , n] at time t Te , denoted by Sti , has dynamicsi (1 δ i )Sti with probability pi ,St 1(5.1)where 1 δ i 1 is the proportional decrement/increment of price of ith stock Sti . LetP η (·) denote the joint probability mass function of St for t T . Let π (πt )t T be the

13policy of the investor that stands for the number of shares of n stocks investor is holding attime t Te withπt : St Xt Rn ,where πt is B(St Xt ) measurable. Here, Xt is the total wealth of the portfolio at t with x0 0being the initial wealth. We assume that the investor has a capacity to be in the long or shortposition. Namely, we take that kπt (s, x)k C for some C 0 for all (s, x) St Xt andt Te . We denote all those strategies (πt )t Te that are B(Xt , St ) measurable and uniformlybounded by C by Π. We take that the market is self-financing in the senseXt 1 πt St 1 for t Te ,with St 1 being the n-dimensional vector as defined in (5.1) such that denoting x 1 , x0 , Xt , Xt xt 1 ,is the difference of the total wealth between time t and t 1 for t T . Hence, the rewardfunction at t T reads asrt (st , xt , πt ) XtrT (sT , xT ) XTXt xt 1 rt (st , xt , πt ),Let w(x) x2 for x [0, 1] be the distortion of the probability function such that for a fixedπT 1 given XT 1 xT 1 and ST 1 sT 1 , the performance measure is defined by 2Z η%T 1 (XT ) ,P (xT 1 πT 1 (ST sT 1 ) zT xT 1 , sT 1 ) dzT0, VT 1 (sT 1 , xT 1 , π)such thatVT 1 (sT 1 , xT 1 ) max VT 1 (sT 1 , xT 1 , π).π ΠHence, going backwards iterative, we have at each time t Te 2Z ηVt (st , xt , π) P (xt Vt 1 (St 1 , Xt 1 , π) zt 1 xt , st ) dzt 10Vt (st , xt ) max Vt (st , xt , π).π Π(5.2)Then, by Theorem 4.1, the dynamic programming yields an optimal strategy (πt )t Te Πattaining (5.2) along with the ptimal value Vt at each time t Te . An important observationis that our scheme dictates that the investor should take both his current wealth xt and thecurrent stock price st into consideration while making his decision πt on investing into theassets at t Te .

14References[1] Bellman, R. (1952). On the theory of dynamic programming, . Proc. Natl. Acad. Sci38, 716[2] Chung,K., Sobel,M.J. (1987). Discounted MDP’s: Distribution functions and exponential utility maximization. SIAM J. Control Optim. 25, 49-62.[3] Fleming, W.H., Sheu, S.J. (1999). Optimal long term growth rate of expected utilityof wealth. Ann. Appl. Probab. 9, 871-903.[4] Fleming, W.H., Sheu, S.J. (2000). Risk-sensitive control and an optimal investmentmodel. Math. Finance 10, 197-213.[5] Jaquette, S.C.(1973).Markov decision processes with a new optimality criterion: Discrete time (1976). Ann. Stat. 1, 496-505[6] erionforMarkovdecisionprocesses.[7] Artzner, P., Delbaen, F., Eber, J.M., Heath, D. (1999). Coherent measures ofrisk, Math. Finance 9, 203-228.[8] Artzner, P., Delbaen, F., Eber, J.M., Heath, D., Ku, H. (2007). Coherentmultiperiod risk adjusted values and Bellmans principle. Ann. Oper. Res. 152, 5-22.[9] Ruszczynski, A. (2010). Risk-averse dynamic programming for Markov decision processes, Math. Program. B 125 (2010) 235-261.[10] Cheridito, P., Delbaen, F., Kupper, M (2006). Dynamic monetary risk measuresfor bounded discrete-time processes. Electron. J. Probab. 11, 57-106.[11] Eichhorn, A. Romisch, W. (2005). Polyhedral risk measures in stochastic programming. SIAM J. Optim. 16, 69-95.[12] Follmer,H.,Penner,I. (2006). Convex risk measures and the dynamics of theirpenalty functions. Stat.Decis. 24, 6196[13] Fritelli,M.,Rosazza orderinriskmeasures.

15[14] Frittelli,M., Rosazza Gianin,E. (2005). Dynamic convex risk measures (2005).Risk Measures for the 21st Century, 227-248. Wiley, Chichester[15] Kahneman, D. and Tversky, A. (1979), Prospect Theory: An Analysis of DecisionUnder Risk Econometrica, 47, 263-292.[16] Kahneman, D. and Tversky, A. (1992), Advances in prospect theory: Cumulativerepresentation of uncertainty. Journal of Risk and Uncertainty, 5, 297-323.[17] Zhou, X. (2010), Mathematicalising Behavioural Finance, (2010). Proceedings of theInternational Congress of Mathematicians Hyderabad, India.[18] Wakker, P. (2010), Prospect theory: For risk and ambiguity, Cambdridge UniversityPress.[19] He, X. D., Zhou, X. Y. (2011), Portfolio choice via quantiles. Mathematical Finance,21(2), 203-231.[20] Ruszczynski, A. (2010), Risk-averse dynamic programming for Markov decision processes, Math. Program. B 125 (2010), 235-261.[21] Kun, L., Cheng J., Marcus, S. I (2018), Probabilistically distorted risk-sensitive infinitehorizon dynamic programming, Automatica (97), 1-6.[22] Ma, J., Wong T., Zhang, J. Time-consistent Conditional Expectation under Probability Distortion, preprint.[23] Hernandez-Lerma, O.(1989), Adaptive Markov Control Processes, Springer-Verlag.New York.[24] Rieder, U. (1978). Measurable Selection Theorems for Optimisation Problems,Manuscripta Mathematica, 24, 115-131.[25] Hernandez-Lerma, O., Lasserre, J.B. (1996). Discrete-time Markov control processes, in: Basic Optimality Criteria, Springer, New York.[26] Bjork,T., Khapko, M., Murgoci, A. (2017). On Time-inconsistent Stochastic control in Continuous Time, Finance and Stochastics, 21, 331-360.

not satisy \Dynamic Programming Principle" (DPP) or \Bellman Optimality Principle". Namely, a sequence of optimization problems with the corresponding optimal controls is called time-consistent, if the optimal strategies obtained when solving the optimal control problem at time sstays optimal when the o

Related Documents:

Probability Cheatsheet v2.0 Thinking Conditionally Law of ...

Joint Probability P(A\B) or P(A;B) { Probability of Aand B. Marginal (Unconditional) Probability P( A) { Probability of . Conditional Probability P (Aj B) A;B) P ) { Probability of A, given that Boccurred. Conditional Probability is Probability P(AjB) is a probability function for any xed B. Any

92 Views

2y ago

Decision Analysis: an Overview - Veterans Affairs

Pros and cons Option A: - 80% probability of cure - 2% probability of serious adverse event . Option B: - 90% probability of cure - 5% probability of serious adverse event . Option C: - 98% probability of cure - 1% probability of treatment-related death - 1% probability of minor adverse event . 5

9 Views

4m ago

Adaptive Optimal Control of Partially-unknown Constrained ...

II. Optimal Control of Constrained-input Systems A. Constrained optimal control and policy iteration In this section, the optimal control problem for affine-in-the-input nonlinear systems with input constraints is formulated and an offline PI algorithm is given for solving the related optimal control problem.

34 Views

3y ago

Chance and Probability

Probability measures how likely something is to happen. An event that is certain to happen has a probability of 1. An event that is impossible has a probability of 0. An event that has an even or equal chance of occurring has a probability of 1 2 or 50%. Chance and probability – ordering events impossible unlikely

96 Views

2y ago

7.1 Sample space, events, probability

probability or theoretical probability. If you rolled two dice a great number of times, in the long run the proportion of times a sum of seven came up would be approximately one-sixth. The theoretical probability uses mathematical principles to calculate this probability without doing an experiment. The theoretical probability of an event

69 Views

2y ago

Engineering Formula Sheet - cusd80.com

Engineering Formula Sheet Probability Conditional Probability Binomial Probability (order doesn’t matter) P k ( binomial probability of k successes in n trials p probability of a success –p probability of failure k number of successes n number of trials Independent Events P (A and B and C) P A P B P C

58 Views

2y ago

AP Statistics Syllabus - Math

Chapter 4: Probability and Counting Rules 4.1 – Sample Spaces and Probability Classical Probability Complementary events Empirical probability Law of large numbers Subjective probability 4.2 – The Addition Rules of Probability 4.3 – The Multiplication Rules and Conditional P

29 Views

2y ago

Artificial Intelligence: How to get it right

Artificial Intelligence (AI) has the potential to make a significant difference to health and care. A broad range of techniques can be used to create Artificially Intelligent Systems (AIS) to carry out or augment health and care tasks that have until now been completed by humans, or have not been possible previously; these techniques include inductive logic programming, robotic process .

62 Views

3y ago

Recent Views

Saint Robert Bellarmine - WordPress

Aug 08, 2018 · Sister Laura Gorman Sister Anna Frances Portisch Sister Mary Edward Haren Sister Dolores Priske (Helen Julie) Sister Scholastica Healy Sister olette Marie Quinn Sister lara . S. Heidelman Sister Alice Mary Reilly Sister Genevieve Henneberry (Fidelis) Sister Genevieve Rigney

2y ago

160 Views

Sunday, September 12, 2021 10:00 a.m.

Sep 12, 2021 · On our 154th Church Anniversary, We salute the members of Mount Pleasant Baptist Church who have served for 50 years or more. Sister Brenda Bradley Sister Mary Lockett Sister Aaronita Brown Sister June Marshall Deacon Carlton Brown Sister Barbara Moore Sister Gwendolyn Brown Sister Frances Robinson Deaconess Josephine Byrd Sister Frances Ross

2y ago

344 Views

MRS Title 21-A. ELECTIONS - Maine Legislature

stepgrandchild, stepsister, stepbrother, mother-in-law, father-in-law, brother-in-law, sister-in-law, son-in-law, daughter-in-law, guardian, former guardian, domestic partner, the half-brother or half-sister of a person's spouse, or the spouse of a person's half-brother or half-sister. [PL 2009, c. 253, §2 (AMD).] 21. Incoming voting list.

1y ago

118 Views

12 PUBLIC LAW AND PRIVATE LAW - Home: The National .

INTRODUCTION TO LAW MODULE - 3 Public Law and Private Law Classification of Law 164 Notes z define Criminal Law; z list the differences between Public and Private Law; and z discuss the role of Judges in shaping Law 12.1 MEANING AND NATURE OF PUBLIC LAW Public Law is that part of law, which governs relationship between the State

3y ago

745 Views

Dr. Ram Manohar Lohiya National Law University, Lucknow

2. Health and Medicine Law 3. Int. Commercial Arbitration 4. Law and Agriculture IXth SEMESTER 1. Consumer Protection Law 2. Law, Science and Technology 3. Women and Law 4. Land Law (UP) Xth SEMESTER 1. Real Estate Law 2. Law and Economics 3. Sports Law 4. Law and Education **Seminar Courses Xth SEMESTER (i) Law and Morality (ii) Legislative .

3y ago

496 Views

Companies Law - Cayman Islands dollar

Law 1 of 1971-15th December, 1970 Law 7 of 2000- 20th July, 2000 Law 7 of 1973-28th June, 1973 Law 5 of 2001-20th April, 2001 Law 24 of 1974-22nd November, 1974 Law 10 of 2001-25th May, 2001 Law 25 of 1975-9th December, 1975 Law 29 of 2001-26th September, 2001 Law 19 of 1977-10th November, 1977 Law 46 of 2001-14th January, 2002

3y ago

454 Views

It’s the Law!

ciples stated in Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law. Students will be able to explain the application of Boyle’s Law, Charles’ Law, Gay-Lussac’s Law, Henry’s Law, and Dalton’s Law to observations or events related to SCUBA diving. MateriaLs None audio/visuaL MateriaLs None teachinG tiMe

2y ago

378 Views

WHAT LAW IS ? An Introduction to Law

common law system civil law system!! sources of law in civil law !! a1. primary: statutes (written law) enacted by legislative power are the principal source of law. ! a2. two subsidiary sources of law: ! a2.1 administrative regulations a.2.2 customs!! ! sources of law in common law !!! b1. two primary sources of

2y ago

385 Views

Immaculata, Pennsylvania 19345-0200 Catholic Schools

Fall, 2012 Cover Sister Monica Therese Sicilia, I.H.M. IHM Best Practices Sister Margaret Rose Adams, I.H.M For Teachers: Sister Adrienne Saybolt, I.H.M. “Helping K-2 Students Struggling with Reading and Writing” Prime Times Sister Rita James Murphy, I.H.M. Good Writer

2y ago

117 Views

Winter 2012 - IHM EDUCATIONAL RESOURCES - Home

IHM Best Practices Sister Margaret Rose Adams, IHM For Teachers: Sister Adrienne Saybolt, IHM “Helping K-2 Students Struggling with Reading and Writing” Prime Times Sister Elaine deChantal Brookes, IHM Sister

2y ago

138 Views

Tributes in Honor of: SISTER JANET AHLER, CSA CSA SISTERS .

Everett & Jeannine Solon SISTER CORINNE HEIMANN, CSA St Mary's Hospital Board of Directors Teresa Hebble John & Mary Sterba SISTER MARY VERONICA HEIMANN, CSA Sybil Teehan Teresa Hebble Rebecca & Gary Tirevold MR EDWARD HELSTOSKY Bonnie Young Barbara Britz SISTER JOELLEN FLYNN, CSA RAY HINZ Susan Flynn Carol Hinz Fran Frigo JEAN W HOFF

2y ago

341 Views

How to Use These “Snippets” and Poems

For Sale By Shel Silverstein One sister for sale! One sister for sale! One crying and spying young sister for sale! I’m really not kidding, So who’ll start the bidding? Do I hear the dollar? A nickel? A penny? Oh, isn’t there, isn’t there, isn’t there any One kid that will buy this old sister for sale,

2y ago

367 Views

CODIS2006 - Mixture Interpretation - Butler FINAL

“Things we do not do: Calculate mixture ratios for casework – Calculation used for this study: Find loci with 4 alleles (2 sets of sister alleles). Make sure sister alleles fall within 70%, then take the ratio of one allele from one sister set to one allele of the second sister set, figure ratios for all combinations and average.

2y ago

315 Views

CONSECRATA

Salesian Sisters of St. John Bosco Sister Marie Amata D’Amico, C.K. School Sisters of Christ the King Sister Mary Stephany Rose, O.S.H.J. Oblate Sisters of the Sacred Heart of Jesus Sister Brigid Mary Meeks, R.S.M. Religious Sisters of Mercy of Alma Sister Hae-Jin Lim, F.M.A. Salesian Sisters of

2y ago

111 Views

Sister Makes House Calls During the Pandemic

Sister Patricia Deckert, RSM. As an elementary school teacher, Sister Patricia (Pat) taught in the Trenton, Metuchen and Camden dioceses in New Jersey, serving eight years at Cathedral School in Trenton, and seven years at St. James School in Red Bank. Attending nursing school at the age of 50, Sister Pat first ministered at McAuley Hall

11m ago

86 Views

Optimal Control With Distorted Probability Distributions

It looks like you're using an ad-blocker