Ackrawledgement:This work is being conducted under the fuzzv rules such as those for fuzzy control and fuzzy
fellowship granted by the Turkish Scientific and Technical optimization [6,7,8].
Research Council (TuBlTAK) ;Grant No:BYG312(91)/6352. In this paper we introduce a fuzzy dynamic programming
technique with fuzzy state variables and decision variables.
Abstract:% paper developes a fuzzy dynamic programming Section 2 presents our methodology which is applied to a
technique with fuzzy state variables and decision variables. problem of ballistic missile tracking problem (section 3).
Fuzzy discretization is initially applied to the state and conml Results are discussed in section 4.
spaces generating fuzzy cellular spaces. State transition is
achieved by a fuzzy cell to cell mapping. The target state cell
membership is subsequently computed by premiss
matching.The paths that lead to a target cell which is the image
of many different state cell combmations have different costs Fuay dynamic programming is developed as a process
with different memberships. A cost functional is representing accepting preprocessed inputs and having outputs that are
the time spent and the fuzzy distance of the present fuzzy state further defuzzified for actual applications. Processes that are
cell to a fuzzy desired state. Paths having minimum costs are generally coupled to our approach are given in Fig.1.
thereafter selected. The developed methodology is applied to a As in classical dynamic programming, discretization is fist
ballistic &sile tracking problem and results are discussed applied to a continuous problem in order to obtain the crisp
from the efficiency of the new fuzzy optimal control technique states and crisp controls in a constrained region (time interval).
point of view. Possibilistic uncertainty is then intraduced by fuzzifying these
variables. There. the state and control variables are fuzzified
generating a fuzzy cell space (section 2.1). This is done by
1.INTRODUCTION defining fuzzy sets on the respective value sets of the variables
and associating appropriate memkship functions to these
Solving a problem, within a dynamic programming fuzzy sets. The defuzzification module restores the uncertainty
!kamework implies the imbedding of the problem in a class of in the outputs of the FDP for further low level processing.
similar problems, the solutions of which are both logically
related and scaled in degree of difficulty. Multistage decision
processes are characteristic of all dynamic programming 2.1. Formation of the fuzzy cell space
problems where the variables involved are state and decision
variables. In classical approaches state variables are taken In our fuzzy dynamicprogramming approach the state space
basically as point valued (crisp) measures of conditions that and control space are divided into n cells where each cell is
exist at the beginning of any phase of the solution. On the other represented by a fuzzy set that overlaps with the neighboring
hand, decision variables also referred to as control variables ar ones. (Fig. 2)
policy variables, are directly specified in order to obtain an
optimal solution at each stage of a dynamic programming
solution [l].Uncertainties in both state values and decision
variables render them fuzzy in nature. Our earlier works
[2,3.45]focused on modeling uncertainties in a state variable
as a state cell and generated the state dynamic changes as a
fuzzy cell to cell mapping. On the other hand, control laws and
decision rules are generally ill-defined incorporating cell k-1 dllk
imprecision. incompleteness and inconsistencies when F2g.2 Fuzzy cells
problems to be solved are drawn f"unstructured or ill- (Vertical axis represents memberships of discretized states in a
structwed environments. Approximate decision rules fuzzy cell.)
necessitates uncertainty models. One such representation are
723
ii. After finding the image cell of each support point in a
given state cell we compute the overall image cells of the state
cell considend.
Let B be the image e l l of one discrete point in state cell A.
This image gcnerally intemxts with multiple different cells Ci
in the state space Fig.(4).
>
x1 x2 * "p
Fig.3: F u z y cell discretuation f
E.ch fiazy M is haher subdivided into papal lmgth imavdr
Fig.3). rmch that t
k s u p n t of a hzzy MA (rupp A) is uperred m:
nrppA=x 13x2)............
SbEX (1)
where X is the universal set representing the state space or the Fig. 4 Premiss matching with Fig.S:Fuzzy cell D with
control space. the image cell and the fuzzy cells. a support of fup;y cells.
(In Fig.4 and Fig.5 MD represents memberships of the cells C;
724
Finally the path that has minimum cost with maximum t t
membership is chosen according to the expression:
min(Coa(xi)/( 1Op(xi))) (3)
This selection is performed for each next state cell
combinations Di. The procedure continues until all time states
in the fuzzy time cell are considered or the objective is reached.
f m
PROBLEM
3. APPIJCA~ON
(3)
v
Fig.6:Discretized cells of states and control.
(4)
Here the path of the aircraft x l state is cellularized into 3
For illustrative purposes we use a simplified model that fuzzy cells representing the near region to the path, the medium
neglects gravitational and aerodynamical forces and considers region and the far region of the path. The near region and the
the aircraft as a point mass.The state variables for the aircraft far region are funher discretized into 11 levels and the medium
are region is discretized into 21 levels. The aircraft's velocity which
is the x2 state is cellularized into 3 cells representing the small
velocity: d/dt(xl(t))=x2(t) velocity. the medium velocity and the large velocity
acceleration:d/dt(x2(t))=u( t) respectively. Then small and large velocities are further
time: d/dt(x3)=l discretized into 9 levels and the medium velocity is discretized
into 17 levels. The x3 state which is the fuzzy time is
Interception is solved by minimizing the objective cellularized into 3 cells representing the near time .the medium
functional that is formalized by the weighted sum of minimum time and the far time. Then near and far time cells are
time and the "pursuit for interception" objectives as: discretized into 11 levels and the medium time is discretized
into 21 levels. The control (acceleration, thrust) is cellularized
T T into 3 cells representing the negative thrust (discretized into 9
levels), the thrust near 0 (discretized into 17 levels) and the
positive thrust (discretized into 9 levels). At the end Ax3
Here the state vector is increased by a third component (change in time) is represented by only one cell discretized into
representing time in order to render the system model 11 levels. In any other application, these cells and levels may
autonomous. Normalizing the mass to unity, we find the motion be changed frcely by the user.
of the aircraft in x direction as described by the state equations: For each of these discretized levels, the image cells are
computed based on the max, min operations given in the
expressions below:
kilN="@xl min(clx2*pAx3)) (13)
k2N'max@x2,m;l(y1,pAx3)) (14)
ki3N'max@x3.~Ax3) (15)
The thrust U@) is constrained by the relationship -1 .&U@)
11.0 .The discrete cost functional takes the form of:
4.REsu~~s
AND DJSCUSSI~N
N-1 N-1
i
FDP calculations show that image dell memberships are
highly dependent on initial conditions while Cost operations
are made independent of initial conditions and memberships.
Integrated selection is performed. for compensation. by
Fig. 6 gives the cell space discretization levels for each choosing the optimum path using an expression that involves
state and control considered in our problem. both cost and membership of a given cell. This allows a
possibilistic choice of the locally optimal path weighted by
memberships.
725
The efficiency of the FDP increases if a "best case-worst [8]Chuen Chien Lee,"Fuzzy Logic in Control Systems:Fuzzy
case" window is c a n s i d e d when compiling the memberships Logic Conmller-PartlLPart2rt2,"IEEE Trans. on Systems, Man
of the image cells. For finding the resultant membership of an and Sybernetics. V01.20,"0.2. March-April 1990, pp.404435.
image cell Di under several membership assignments, a best
case is computed with max(p@i)) and a worse case with min(p
P i ) ) . This window is propagated through. generating to
optimal paths, one worse case optimal and the second , best
case optimal path. Discretization levels of fuzzy states in FDP
provide a large flexibility in the solution of the problem by
offering possibilities of variable grid structure.In safe regions of
state space the number of discretization levels itre made low for
coarse processing, while being increased in strategic regions. In
the illustrative example of this paper, when the missile is out of
range, level numbers are taken low, thus increasing the interval
between discretization steps. Reaching the range of the missile
by a coarse path, level numbers are subsequently increased for
fine optimization. This approach is equivalent to a variable grid
method with a single pass. Still comparable with classical
Dynamic Programming methods such as variable grid multipass
dynamic p r o g " h g in their computational burden. the FDP
offers flexibility in the solution technique with high
adaptability
The main problem of classical dynamic programming
resides in the existence of subintervals: after that the next
states are found, these image states fall in between discretized
state levels such that interpolation has to be used. For problems
having many states interpolation is difficult and causes the
solutions to be approximate.
In FDP, this difficulty is eliminated by cellularization of the
states and controls and by intersecting cells.
726