com
Abstract
In this paper, both full state and output feedback adaptive neural network (NN) controllers are presented for a class of strict-feedback
discrete-time nonlinear systems. Firstly, Lyapunov-based full-state adaptive NN control is presented via backstepping, which avoids the
possible controller singularity problem in adaptive nonlinear control and solves the noncausal problem in the discrete-time backstepping
design procedure. After the strict-feedback form is transformed into a cascade form, another relatively simple Lyapunov-based direct output
feedback control is developed. The closed-loop systems for both control schemes are proven to be semi-globally uniformly ultimately
bounded.
? 2003 Published by Elsevier Science Ltd.
error converges to a small neighborhood of the origin. The disturbance bounded by a known constant d10 ¿ 0, i.e.,
main contributions of this paper are as follows: |d1 (k)| 6 d10 ; fi (Qi (k)) and gi (Qi (k)); i = 1; 2; : : : ; n are
unknown smooth functions.
(i) By transforming the original one-step ahead descriptor
description into an equivalent n-step ahead predictor Assumption 1. The signs of gi (Qi (k)); i = 1; 2; : : : ; n are
description, an elegant approximation-based controller known and there exist two constants gi ; gQi ¿ 0 such that
can be constructed without the causality contradic- gi 6 |gi (Qi (k))| 6 gQi ∀Qn (k) ∈ ⊂ Rn .
tion problem which has been regarded as the main
obstacle encountered in controller design for general
Without losing generality, we shall assume that gi (Qi (k))
strict-feedback nonlinear system through backstepping.
and gn (Qn (k)) are positive in this paper. The objective is to
(ii) By choosing Lyapunov functions consisting of plant
design control u(k) to make the system output yk follow a
states and NN parameter deviation from the ideal val-
known and bounded trajectory yd (k).
ues W ∗ , the proposed controllers are free from the
controller singularity problem usually encountered in
Assumption 2. The desired trajectory yd (k) ∈ y ; ∀k ¿ 0 is
adaptive control using feedback linearization, and at
smooth and known, where y := { | = 1 }.
the same time avoids the problem of indirect adaptive
control, where one relies on independently showing
Denition 2.1. The solution of (1) is semi-globally uni-
that the NN modeling converges to a small neighbor-
formly ultimately bounded (SGUUB), if for any , a com-
hood and then appealing to stability arguments.
pact subset of Rn and all Qn (k0 ) ∈ , there exist an ¿ 0
(iii) Through a diAeomorphism transformation, the
and a number N (; Qn (k0 )) such that Qn (k) ¡ for all
strict-feedback form is 9rst transformed into a cascade
k ¿ k0 + N (Lin & Saberi, 1995).
form, Lyapunov-based direct output feedback control
is then developed, which is relatively easy to imple-
Denition 2.2. The sequence S(k) is said to be persistently
ment as it only requires the measurement of inputs and
exciting (PE) if there are Q ¿ 0 and integer L ¿ 0 such
outputs, and the controller structure is less complex
that
than that of the proposed backstepping design—this is
k +L−1
especially true for high-order systems. 0
n (k + 1) = fn (Qn (k)) + gn (Qn (k))u(k) + d1 (k); Under certain conditions, it has been proven that sev-
yk = 1 (k); (1) eral approximation methods, such as polynomials, splines,
NNs, fuzzy systems, have function approximation abilities,
where Qi (k) = [1 (k); 2 (k); : : : ; i (k)]T ∈ Ri ; i = 1; 2; : : : ; n, and have been frequently used as function approximators.
u(k) ∈ R, yk ∈ R are the state variables, system in- There are several types of NNs that have been frequently
put and output respectively; d1 (k) denotes the external used, which include linearly parametrized and nonlinearly
S.S. Ge et al. / Automatica 39 (2003) 807 – 819 809
parametrized networks (Ge et al., 2001). For clarity and sim- used in the proof of closed-loop system stability. Let
plicity, consider HONNs (Ge et al., 2001; Kosmatopoulos,
Polycarpou, Christodoulou, & Ioannou, 1995) S(z(k)) = [s1 (z(k)); s2 (z(k)); : : : ; sl (z(k))]T ;
(W; z) = W T S(z); W and S(z) ∈ Rl ; then S(k) has the following properties:
S(z) = [s1 (z); s2 (z); : : : ; sl (z)]T ; (4) max [S(k)S T (k)] ¡ 1; S T (k)S(k) ¡ l: (8)
si (z) = [s(zj )]dj (i) ; i = 1; 2; : : : ; l; (5) 4. Adaptive state feedback NN control
j∈Ii
where fn;c i (Qi+1 (k)) = fi (Qi (k)) + gi (Qi (k))i+1 (k), Continuing the procedure as above recursively, after (n − 2)
i = 1; 2; : : : ; n − 1. Accordingly, we have steps, the 9rst two equations in (1) become
c
1 (k + 1) fn; 1 (Q2 (k)) 1 (k + n − 1) = f2;c 1 (Qn (k));
.. ..
Qi (k + 1) = . = . ;
2 (k + n − 1) = F2 (Qn (k))
i (k + 1) c Q
fn; i (i+1 (k))
+ G2 (Qn (k))3 (k + n − 2); (16)
i = 1; 2; : : : ; n − 1; (11)
where f2;c 1 (Qn (k)) = f1 (F3;c 1 (Qn−1 (k))) + g1 (F3;c 1 (Qn−1 (k)))
which is a vectored function of Qi+1 (k) denoted by
×f3;c 2 (Qn (k)), F2 (Qn (k))=f2 (F3;c 2 (Qn (k))) and G2 (Qn (k))=
Qi (k + 1) = Fn;c i (Qi+1 (k)) ∈ Ri ; i = 1; 2; : : : ; n − 1: (12) g2 (F3;c 2 (Qn (k))).
After still another further step, the 9rst equation in (1)
It should be noted that fn;c i (Qi+1 (k)) is a scalar, but
becomes
Fn;c i (Qi+1 (k)) is a vector of dimension i. When i = 1,
Fn;c 1 (Q2 (k)) degrades into scalar fn;c 1 (Q2 (k)). 1 (k + n) = F1 (Qn (k)) + G1 (Qn (k))2 (k + n − 1); (17)
After one more step, the 9rst (n − 1) equations in (1)
become where F1 (Qn (k)) = f1 (f2;c 1 (Qn (k))) and G1 (Qn (k)) =
g1 (f2;c 1 (Qn (k))).
i (k + 2) = fi (Qi (k + 1)) + gi (Qi (k + 1))i+1 (k + 1); Since all the equations from (13) to (17) are derived from
i = 1; 2; : : : ; n − 2; the original system, the original strict-feedback form (1) is
equivalent to
n−1 (k + 2) = fn−1 (Qn−1 (k + 1)) 1 (k + n) = F1 (Qn (k)) + G1 (Qn (k))2 (k + n − 1);
+ gn−1 (Qn−1 (k + 1))n (k + 1): (13) ..
.
Substituting (10) and (12) into (13), we obtain
c Q n−1 (k + 2) = Fn−1 (Qn (k))
i (k + 2) = fn−1; i (i+2 (k));
+ Gn−1 (Qn (k))n (k + 1);
i = 1; 2; : : : ; n − 2;
n (k + 1) = fn (Qn (k)) + gn (Qn (k))u(k) + d1 (k);
n−1 (k + 2) = Fn−1 (Qn (k))
yk = 1 (k): (18)
+ Gn−1 (Qn (k))n (k + 1); (14)
It should be noted that functions Fi (Qn (k)), i=1; 2; : : : ; n−1,
where become highly nonlinear. In fact, as i decrease, Fi (Qn (k))
c
fn−1; Q c Q and Gi (Qn (k)) become more entangled and complex, be-
i (i+2 (k)) = fi (Fn; i (i+1 (k)))
cause Fn−1 (Qn (k)) and Gn−1 (Qn (k)) are derived by one-step
+ gi (Fn;c i (Qi+1 (k)))fn;c i+1 (Qi+2 (k)); substitution, while F1 (Qn (k)) and G1 (Qn (k)) are derived by
(n − 1)-step substitution. As mentioned in Section 3, an
i = 1; 2; : : : ; n − 2; HONN can emulate a given nonlinear function to a small
error tolerance, therefore it is a good choice to construct our
Fn−1 (Qn (k)) = fn−1 (Fn;c n−1 (Qn (k))); controller using HONN without knowing the exact struc-
tures of Fi (Qn (k)) and Gi (Qn (k)). In the following discus-
Gn−1 (Qn (k)) = gn−1 (Fn;c n−1 (Qn (k))): sion, we will show how to construct our HONN controller
Following the same procedure, the 9rst (n − 2) equations in by backstepping.
(14) can be described by From the de9nition of Gi (Qn (k)) in each step, it is
c clear that the value of Gi (Qn (k)); i = 1; 2; : : : ; n − 1,
1 (k + 2) fn−1; 1 (Q3 (k)) is the same as gi (Qi (k)), therefore Gi (Qn (k)) satisfy
.. .. gi 6 Gi (Qn (k)) 6 gQi ; ∀Qn (k) ∈ under Assumption 1.
Qi (k + 2) = . = . ;
Now we can construct control for (18) via backstepping
i (k + 2) c
fn−1; Q method without the problem of causality contradiction. The
i (i+2 (k))
design procedure is recursive. At the ith step, the ith-order
i = 1; 2; : : : ; n − 2; subsystem is stabilized with respect to a Lyapunov function,
which is a function of Qi+2 (k) and denoted as Vi , by the design of adaptive NN function Ŵ i Si (zi (k)) as a
9ctitious control. The feedback control u(k) is designed in
Qi (k + 2) = Fn−1;
c Q
i (i+2 (k)); i = 1; 2; : : : ; n − 2: (15) the 9nal step. For convenience, let Wi∗ ; Ŵ i ; W̃ i ; i=1; 2; : : : ; n,
S.S. Ge et al. / Automatica 39 (2003) 807 – 819 811
be the ideal value, estimate, and estimation error of the NN order to pursue an easy-to-implement controller for systems
weights at the ith design step. After n recursions, we can with a relatively high order, and with only output measur-
construct the 9ctitious controls jf (k), real control u(k), and able, the strict-feedback form is transformed into a cascade
the NN weight updating law as follows: form 9rst. For the convenience of analysis and less techni-
cal complexity, we assume that there is no disturbance, i.e
jf (k) = Ŵ Tj−1 (k)Sj−1 (zj−1 (k)); j = 2; 3; : : : ; n;
d1 (k) = 0, in this case.
u(k) = Ŵ Tn (k)Sn (zn (k)); (19) For convenience of analysis, for i = 1; 2; : : : ; n − 1, let
Fi (k) = Fi (Qn (k)); Gi (k) = Gi (Qn (k));
Ŵ i (k + 1) = Ŵ i (ki ) − +i [Si (zi (ki )),i (k + 1); +-i Ŵ i (ki )];
fn (k) = fn (Qn (k));gn (k) = gn (Qn (k)):
ki = k − n + i; i = 1; 2; : : : ; n; (20) They are functions of system state Qn (k) at the kth step.
where diagonal gain matrix +i = +iT ¿ 0 with max (+i ) = Substituting 2 (k + n − 1) in equation (16) into equation
.Qi , constant -i ¿ 0, z1 (k) = [QTn (k); yd (k + n)]T ∈ z1 ⊂ (17), we obtain
Rn+1 , zj (k) = [QTn (k); jf (k)]T ∈ zj ⊂ Rn+1 , j = 2; 3; : : : ; n, 1 (k + n) = F1 (k) + F2 (k)G1 (k)
Ŵ i ∈ Rli and Si (zi (k)) ∈ Rli with li denotes the neurons used.
+ G1 (k)G2 (k)3 (k + n − 2): (22)
The error vector ,(k) = [,1 (k); ,2 (k); : : : ; ,n (k)]T is de9ned
as ,1 (k) = 1 (k) − yd (k); ,2 (k) = 2 (k) − 2f (k − n + Continuing the substitution procedure until u(k) appears, we
1); : : : ; ,n (k) = n (k) − nf (k − 1). obtain
In comparison with the standard parameter adaptation al- 1 (k + n) = F(Qn (k)) + G(Qn (k))u(k); (23)
gorithms, it should be noted that parameter adaptation algo-
where
rithm (20) is of k + 1 − ki th order (i = 1; 2; : : : ; n) in order
to solve the control problem for the ith equation. In fact, F(Qn (k)) = F1 (k) + F2 (k)G1 (k) + · · ·
the current weight estimate of the ith equation, Ŵ i (k), is n−2 n−1
updated from the estimate, Ŵ i (ki − 1), of n + 1 − i steps + Fn−1 (k) Gi (k) + fn (k) Gi (k); (24)
earlier rather than that of the previous step for systems in i=1 i=1
n−1
a one-step ahead predictor form. From this observation, we
deduce that high-order controllers are useful and eAective G(Qn (k)) = gn (k) Gi (k): (25)
i=1
in handling high-order nonlinear systems.
The stability results of the closed-loop system are sum- De9ne the new system states x(k)=[x1 (k); x2 (k); : : : ; x n (k)]T
marized in Theorem 1. as
x1 (k) = 1 (k);
Theorem 1. The closed-loop adaptive system consisting of
plant (1), controller (19) and update law (20) is SGUUB x2 (k) = 1 (k + 1) = fn;c 1 (Q2 (k));
and has an equilibrium at ,=[,1 ; ,2 ; : : : ; ,n ]T =0, if Qn (0) is
..
initialized in . This guarantees that all the signals include .
the states Qn = [1 ; 2 ; : : : ; n ], the control u and NN weight
estimates Ŵ i ; i = 1; 2; : : : ; n are SGUUB, subsequently, x n (k) = 1 (k + n − 1) = f2;c 1 (Qn (k)); (26)
lim |yk − yd (k)| 6 ; (21) which can be written as x(k) = T (Qn (k)), where T (Qn (k))
k→∞ is a nonlinear coordinate transformation of the form
where is a small positive number. T (Qn (k)) = [1 (k); fc (Q2 (k)); : : : ; fc (Qn (k))]T : (27)
n; 1 2; 1
In Section 4, we have proposed an adaptive NN Lemma 2. Let U be an open subset of Rn and let ’ =
controller for strict-feedback nonlinear system via back- (’1 ; : : : ; ’n ) : U → Rn be a smooth map. If the Jacobian
stepping. This method is relatively easy to implement for Matrix
relatively lower-order systems, but it becomes too compli- 9’1 9’1
···
cated for high-order systems because it needs to update too 9x1
9x n
many parameters. In addition, too many NNs used in the d’
= ... ..
.
..
.
controller not only consume much computing time but are dx
also hard to tune. Furthermore, the control scheme becomes 9’ 9’n
n
infeasible when some system states are unmeasurable. In ···
9x1 9x n
812 S.S. Ge et al. / Automatica 39 (2003) 807 – 819
is nonsingular at some point p ∈ U , or equivalently, According to the de9nition of the new states, we know
Rank(d’=d x) = n at some point p ∈ U , then there exists that y(k) = [x1 (k − n + 1); : : : ; x1 (k − 1); x1 (k)]T . From Eq.
a neighborhood V ⊂ U of p such that ’ : V → ’(V ) is a (28), we obtain
di:eomorphism (Marino & Tomei, 1995).
yk+1 = x2 (k) = x3 (k − 1) = · · ·
Lemma 3. For system (1), the nonlinear coordinate trans- = x n (k − n + 2) = f(y(k)) + g(y(k))uk−n+1 : (30)
formation T (Qn (k)) in (27), T : → x ⊂ Rn , is a
smooth map and a di:eomorphism, namely, there exists It means that x2 (k) is a function of y(k) and uk−n+1 . It should
a unique transformation function T −1 (x(k)) such that be noted that although the right-hand side of (30) does not
Qn (k) = T −1 (x(k)), and both T and T −1 are one-to-one. contain all the elements of z(k), for convenience of analysis,
we can denote (30) as follows without any ambiguity:
Proof. See Appendix B.
yk+1 = x2 (k) = 2 (z(k)): (31)
Based on the above analysis, we can obtain the cascade Similarly, we obtain the following equation from (28):
system description, which is equivalent to the original sys-
tem (1), as follows: yk+2 = x3 (k) = f(y(k + 1)) + g(y(k + 1))uk−n+2 : (32)
uQ∗k = uQ ∗ (z(k));
Q 1.5
T
Q = [z T (k); yd (k + n)] ∈ zQ ⊂ R2n with compact
where z(k) 1
∗
As mentioned in Section 3, there exist an integer l and an -0.5
ideal constant weight vector W ∗ , such that for all l ¿ l∗ ,
-1
uQ ∗ (z)
Q = W ∗T S(z)
Q + zQ; ∀zQ ∈ zQ; (38) 0 500 1000 1500
where zQ is the NN estimation error satisfying |zQ| ¡ 0 . Fig. 1. Tracking performance of state feedback.
If we choose the control law and the weight updating law
as
uk = Ŵ (k)S(z(k));
Q (39)
||W1||
2
||W2||
1
Ŵ (k + 1) = Ŵ (k1 ) + +[S(z(k
Q 1 ))(yk+1 − yd (k + 1)) 0 u
X 2f
+ -Ŵ (k)]; -1
-2
k1 = k − n + 1; (40) -3
Theorem 2. Consider the closed-loop system consisting of Fig. 2. Boundedness of x2f (k), u(k), Ŵ 1 (k) and Ŵ 2 (k).
system (1), controller (39) and adaptation law (40). There
exist compact sets y0 ⊂ y , w0 ⊂ w and positive con-
stants l∗ ; .∗ ; and -∗ such that if 1 (k)
f2 (Q2 (k)) = 2
;
1+ 1 (k) + 22 (k)
(i) assumptions 1 – 3 being satis<ed, the initial condition
d(k) = 0:1 cos(0:05k) cos(1 (k)):
y(0) ∈ y0 , W̃ (0) ∈ w0 , and
(ii) the design parameters are suitably chosen such that Suppose that there is no a priori knowledge of the system
l ¿ l∗ ; - ¡ -∗ and .Q ¡ .∗ with .Q being the largest nonlinearities. It can be checked that Assumption 1 and 2
eigenvalue of +, are satis9ed. The tracking objective is to make the output
yk follow a desired reference signal yd (k) = 12 sin(k4=20) +
then the closed-loop system is SGUUB. The tracking error 1
2 sin(k4=10).
can be made arbitrarily small by increasing the approxi-
mation accuracy of the NNs.
6.1. State feedback control via backstepping
Proof. See Appendix C.
The initial condition for system states is x(0)=[0 0]T , and
those for NNs are Ŵ 1 (0) = 0; Ŵ 2 (0) = 0. Other controller
6. Simulation study parameters are chosen as l1 = 22; l2 = 22; +1 = 0:08I; +2 =
0:08I . The simulation results are presented in Figs. 1 and 2.
To demonstrate the eAectiveness of the proposed schemes, Fig. 1 shows the output yk and the reference signal yd (k).
consider a nonlinear discrete-time SISO plant described by Fig. 2 illustrates the boundedness of the 9ctitious control
x2f (k), real control u(k), and the norms of NN weights
1 (k + 1) = f1 (1 (k)) + 0:32 (k); Ŵ 1 (k) and Ŵ 2 (k).
2 (k + 1) = f2 (Q2 (k)) + u(k) + d(k);
6.2. Direct output feedback control
yk = x1 (k);
The initial condition for system states is x(0) = [0 0]T ,
where and that for NNs is Ŵ (0) = 0. Other controller parameters
1:421 (k) are chosen as l1 = 29; + = 0:15I . The simulation results are
f1 (1 (k)) = ;
1 + 21 (k) presented in Figs. 3 and 4. Fig. 3 shows the output yk and the
814 S.S. Ge et al. / Automatica 39 (2003) 807 – 819
0.8
controller parameters, stability of the closed-loop adaptive
system can be guaranteed.
y
0.6
Output y and Reference signal yd
0.4
0.2 Acknowledgements
0
The authors are very grateful to the many positive and
-0.2
y
d constructive comments from the Editor and anonymous re-
-0.4 viewers. Without these comments, the paper would not be
-0.6 improved to its present quality.
0 500 1000 1500
1.5
i = 1; 2; : : : ; n − 1, let
Control input u and norm of NN weight
1 ||W||
• It should be pointed out that HONNs used in this pa- − yd (k + n): (A.1)
per may be replaced by any other linear approximator Considering 2 (k + n − 1) as a 9ctitious control for (A.1),
such as radial basis function networks (Sanner & Slotine, it is obvious that ,1 (k + n) = 0 if we choose
1992), spline functions (Nurnberger, 1989) or fuzzy sys-
tems (Spooner & Passino, 1996), which have the similar 2 (k + n − 1) = ∗2d (k)
properties as described in (8), while the stability and per- 1
formance properties of the adaptive system are still valid. =− [F1 (k) − yd (k + n)]: (A.2)
G1 (k)
• Since normally S T (z)S(z)l, we can choose a little larger
+ for fast learning rate without destroying the stability of Since F1 (k) and G1 (k) are unknown, they are not available
the closed-loop system. for constructing a 9ctitious control ∗2d (k). However, F1 (k)
and G1 (k) are function of system state Qn (k), therefore we
can use HONN to approximate ∗2d (k) as follows:
7. Conclusion
∗2d (k) = W1∗T S1 (z1 (k)) + z1 (z1 (k));
In this paper, for a class of nonlinear unknown z1 (k) = [QTn (k); yd (k + n)]T ∈ z1 ⊂ Rn+1 : (A.3)
discrete-time systems in strict-feedback form, an adaptive
full state feedback NN controller has been presented via Letting Ŵ 1 be the estimate of W1∗ , consider the direct adap-
backstepping, which avoids the possible controller singu- tive 9ctitious control
larity problem in adaptive nonlinear control and solves the 2 (k + n − 1) = 2f (k) = Ŵ T1 (k)S1 (z1 (k)) (A.4)
noncausal problem in the discrete-time backstepping design
procedure. After the system description form is transformed and the robust updating algorithm for NN weights as
into the cascade form, another adaptive direct output feed- Ŵ 1 (k + 1) = Ŵ 1 (k1 ) − +1 [S1 (z1 (k1 )),1 (k + 1)
back control scheme has also been presented using neural
networks. It has been shown that for appropriately chosen + -1 Ŵ 1 (k1 )]: (A.5)
S.S. Ge et al. / Automatica 39 (2003) 807 – 819 815
Substituting 9ctitious control (A.4) into (A.1), the error Using the facts that
equation (A.1) is re-written as
S1T (z1 (k1 ))S1 (z1 (k1 )) ¡ l1 ;
,1 (k + n) = F1 (k) − yd (k + n)
+ G1 (k)Ŵ T1 (k)S1 (z1 (k)): (A.6) S1T (z1 (k1 ))+1 S1 (z1 (k1 ))
Adding and subtracting G1 (k)∗2d (k) on the right-hand side 6 .Q1 S1T (z1 (k1 ))S1 (z1 (k1 )) 6 .Q1 l1 ;
of (A.6) and noting (A.3), we have
.Q1 ,21 (k + 1) gQ1 z21
,1 (k + n) = F1 (k) − yd (k + n) + G1 (k)[Ŵ T1 (k)S1 (z1 (k)) 2z1 ,1 (k + 1) 6 + ;
gQ1 .Q1
− W1∗T S1 (z1 (k)) − z1 (z1 (k))]
2-1 Ŵ T1 (k1 )+1 S1 (z1 (k1 )),1 (k + 1)
+ G1 (k)∗2d (k); ∀z1 (k) ∈ z1 : (A.7)
.Q1 l1 ,21 (k + 1)
Substituting (A.2) into (A.7) leads to 6 + gQ1 -12 .Q1 Ŵ 1 2 ;
gQ1
,1 (k + n) = G1 (k)[W̃ T1 (k)S1 (z1 (k)) − z1 ]: (A.8)
2W̃ T1 (k1 )Ŵ 1 (k1 ) = W̃ 1 (k1 )2 + Ŵ 1 (k1 )2 − W1∗ 2 ;
Choose the Lyapunov function candidate
we obtain
n−1
1 2 51 2 1
V1 (k) = ,1 (k) + W̃ T1 (k1 + j)+1−1 W̃ 1 (k1 + j); (A.9) VV1 6 − ,1 (k + 1) − ,21 (k) + 61
gQ1 gQ1 gQ1
j=0
Step 2: As de9ned before, ,2 (k) = 2 (k) − 2f (k1 ). Its As explained in Step 1, V2 (k) is bounded for all k ¿ 0,
(n − 1)th diAerence is given by and the tracking errors ,1 (k) and ,2 (k) are also bounded
and will asymptotically converge to the compact set denoted
,2 (k + n − 1) = 2 (k + n − 1) − 2f (k) by 2⊂ R2 , where 2 := {| = [1 ; 2 ]T ; 1 6 gQ1 62 ,
= F2 (k) + G2 (k)3 (k + n − 2) 2 6 gQ2 62 }. The boundedness of Ŵ 2 (k), or equivalently
of W̃ 2 (k), can be proved as in Step 1.
− 2f (k): (A.12) Step i: Following the same procedure as in Step 2, for
,i (k) = i (k) − if (ki−1 ), we have the direct adaptive 9c-
Similarly, consider 3 (k + n − 2) as a 9ctitious control for
titious controller and the robust updating algorithm for NN
(A.12). It is obvious that ,2 (k + n − 1) = 0 is true when we
weights as follows:
choose
1 i+1 (k + n − i) = i+1; f (k) = Ŵ Ti (k)Si (zi (k)); (A.20)
3 (k + n − 2) = ∗3d (k) = − [F2 (k) − 2f (k)]:
G2 (k)
(A.13) Ŵ i (k + 1) = Ŵ i (ki ) − +i [Si (zi (ki )),i (k + 1)
∗3d (k) = W2∗T S2 (z2 (k)) + z2 (z2 (k)); zi (k) = [QTn (k); if (k)]T ∈ zi ⊂ Rn+1 : (A.21)
z2 (k) = [QTn (k); 2f (k)]T ∈ z2 ⊂ Rn+1 : (A.14) Accordingly, we obtain the ith error equation
Consider the direct adaptive 9ctitious controller as ,i (k + n − i + 1) = Gi (k)[W̃ Ti (k)Si (zi (k)) − zi ]: (A.22)
3 (k + n − 2) = 3f (k) = Ŵ T2 (k)S2 (z2 (k)) (A.15) Choose the Lyapunov function candidate
i−1
and the robust updating algorithm for NN weights as 1 2
Vi (k) = Vj (k) + , (k)
gQi i
Ŵ 2 (k + 1) = Ŵ 2 (k2 ) − +2 [S2 (z2 (k2 )),2 (k + 1) j=1
n−i
+ -2 Ŵ 2 (k2 )]: (A.16)
+ W̃ Ti (ki + j)+i−1 W̃ i (ki + j); (A.23)
Following the same procedure and methods as in Step 1, j=0
we obtain the second step error equation
where ki = k − n + i. The 9rst diAerence of (A.23) along
,2 (k + n − 1) = G2 (k)[W̃ T2 (k)S2 (z2 (k)) − z2 ]: (A.17) (A.21) and (A.22) is given as
i i
Choose the Lyapunov function candidate 5j 2 1 2
VVi 6 − ,j (k + 1) − , (k)
1 2 gQj gQj j
j=1 j=1
V2 (k) = V1 (k) + , (k)
gQ2 2
+ 6i − -i (1 − -i .Qi − gQi -i .Qi )Ŵ i (ki )2 ;
n−2
+ W̃ T2 (k2 + j)+2−1 W̃ 2 (k2 + j); (A.18) where 5j ; j=1; 2; : : : ; i−1, are de9ned as the previous (i−1)
j=0 steps, 5i = 1 − .Qi − .Qi li − gQi .Qi li and 6i = 6i−1 + gQi zi2 = .Qi +
-i Wi∗ 2 . If we choose the design parameters as follows:
where k2 = k − n + 2. The 9rst diAerence of (A.18) along
1 1
(A.16) and (A.17) is given by .Qi ¡ ; -i ¡ ; (A.24)
1 + li + gQi li (1 + gQi ).Qi
51 2 1
VV2 6 − , (k + 1) − ,21 (k) then VVi 6 0 once any one of the i errors satis9es
gQ1 1 gQ1
|,j (k)| ¿ gQj 6i j = 1; 2; : : : ; i. This demonstrates that
52 2 1 the tracking error ,1 (k); ,2 (k); : : : ; ,i (k) are bounded for
− , (k + 1) − ,22 (k)
gQ2 2 gQ2 all k ¿ 0, and will asymptotically converge to the com-
pact set denoted byi ⊂ Ri , where i := {| =
+ 62 − -2 (1 − -2 .Q2 − gQ2 -2 .Q2 )Ŵ 2 (k2 )2 ; [1 ; 2 ; : : : ; i ]T ; j 6 gQj 6i j = 1; 2; : : : ; i}. The bounded-
where 51 is de9ned as in Step 1, and 52 =1−.Q2 −.Q2 l2 −gQ2 .Q2 l2 , ness of Ŵ i (k), or equivalently of W̃ i (k), can be proved as
2
62 = 61 + gQ2 z2 = .Q2 + -2 W2∗ 2 . in Step 1.
If we choose the design parameters as follows: Step n: For ,n (k) = n (k) − nf (k − 1), its 9rst diAerence
is given by
1 1
.Q2 ¡ ; -2 ¡ ; (A.19)
1 + l2 + gQ2 l2 (1 + gQ2 ).Q2 ,n (k + 1) = n (k + 1) − nf (k)
then VV2 6 0 once |,1 (k)| ¿ gQ1 62 and/or |,2 (k)| ¿ gQ2 62 . = fn (k) + gn (k)u(k) + d1 (k) − nf (k): (A.25)
S.S. Ge et al. / Automatica 39 (2003) 807 – 819 817
Continue the process recursively until we 9nd the 9nal equa- Using the same techniques as in Appendix A, the 9rst dif-
tion expressed as ference of (C.4) along (C.3) and (40) is given as
n (k) = tn (x1 (k); : : : ; x n (k)) ≡ tn (k); (B.8) 5 1
VV 6 − ey2 (k + 1) − ey2 (k) + 6
gQ gQ
9tn (k) 1
= : (B.9)
9x n (k) g1 (x n (k)) · · · gn ([x1 (k); : : : ; tn (k)]T ) − -(1 − -.Q − g- Q Ŵ (k1 )2 ;
Q .)
Now, we can summarize that the unique transformation where 5 = 1 − .Q − .l
Q − gQ.l,
Q and 6 = gQ z2Q = .Q + -W ∗ 2 .
T −1 (x(k)) as T −1 (x(k)) = [x1 (k); t2 (k); : : : ; tn (k)]T and its If we choose the design parameters as follows:
Jacobian Matrix takes the triangular form of
1 1
dT −1 (x(k)) .Q ¡ ; -¡ (C.5)
1 + l + gl
Q (1 + g)
Q .Q
d x(k)
then VV 6 0 once the tracking error ey (k) is larger than
1 0 ··· 0
(gQ + 820 )6. This implies the boundedness of V (k) for all
1
∗
··· 0
k ¿ 0, which leads to the boundedness of ey (k). Further-
g1 (x1 (k))
more, the tracking error ey (k) will asymptotically
converge
= . .. .. :
. .. to the compact set denoted by 6 g6 Q .
. . . .
Due to negativity of VV , we can conclude that yk+1 ∈ y
1
∗ ∗ ··· if all past outputs yk−j ∈ y ; j = 0; : : : ; n − 1, and compact
g1 (x n (k)) · · · gn ([x1 (k); : : : ; tn (k)]T )
set is small enough.
(B.10) We have proved that yk+1 ∈ y , we can use the same
Based on Assumption 1, Trace (dT −1 (x(k))=d x(k)) = 0, techniques as in Appendix A to show that the NN weight
therefore T −1 is a diAeomorphism according to Lemma 2. error stays in a small compact set we , and the control signal
Equivalently, T is a diAeomorphism. uk ∈ L∞ . Finally, if we initialize state y0 ∈ y0 , W̃ (0) ∈ w0 ,
and we choose suitable parameters .Q , - according to (C.5)
to make small enough, there exists a constant k ∗ such that
Appendix C. Proof of Theorem 2 all tracking errors asymptotically converge to , and NN
weight error asymptotically converges to we for all k ¿ k ∗ .
Proof. Substituting controller (39) into (36), Eq. (36) can This implies that the closed-loop system is SGUUB. Then
be re-written as yk ∈ y and Ŵ (k) ∈ L∞ will hold for all k ¿ 0.
ey (k + n) = −yd (k + n) + f0 (z(k))
References
+ g0 (z(k))Ŵ T (k)S(z(k)):
Q (C.1)
Adding and subtracting g0 (z(k))uQ ∗ (z(k))
Q on the right-hand Anderson, B. D. O., Bitmead, R. R., Johnson, C. R. Jr., Kokotovic,
P. V., Kosut, R. L., Mareels, I. M. Y., Praly, L., & Riedle, B. D.
side of (C.1) and noting (38), we have (1986). Stability of adaptive systems: Passivity and averaging
analysis. Cambridge, MA: MIT Press.
ey (k + n) = −yd (k + n) + f0 (z(k)) + g0 (z(k))uQ ∗ (z(k))
Q
Ge, S. S., Hang, C. C., Lee, T. H., & Zhang, T. (2001). Stable adaptive
neural network control. Norwell, USA: Kluwer Academic.
+ g0 (z(k))[Ŵ T (k)S(z(k))
Q
Ge, S. S., Lee, T. H., & Harris, C. J. (1998). Adaptive neural network
control of robotic manipulators. London: World Scienti9c.
− W ∗T S(z(k))
Q − zQ]: (C.2)
Ge, S. S., Lee, T. H., Li, G. Y., & Zhang, J. (2003). Adaptive nn control
Substituting (37) into (C.2) leads to for a class of discrete-time nonlinear systems. International Journal
of Control, to appear.
ey (k + n) = g0 (z(k))[W̃ T (k)S(z(k))
Q − zQ]; (C.3) Girosi, F., & Poggio, T. (1989). Networks and the best approximation
property. Arti<cial Intelligence Laboratory Memorandum No. 1164,
where W̃ (k) = Ŵ (k) − W ∗ . MIT, Cambridge.
S.S. Ge et al. / Automatica 39 (2003) 807 – 819 819
Gupta, M. M., & Rao, D. H. (1994). Neuro-control systems: Theory and Narendra, K. S., & Parthasarathy, K. (1990). Identi9cation and control
applications. New York:IEEE Press. of dynamic systems using neural networks. IEEE Transactions on
Hunt, K. J., Sbarbaro, D., Zbikowski, R., & Gawthrop, P. J. (1992). Neural Networks, 1(1), 4–27.
Neural networks for control system-a survey. Automatica, 28(6), Nurnberger, G. (1989). Approximation by spline functions. New York:
1083–1112. Springer.
Kanellakopoulos, I., Kokotovic, P. V., & Morse, A. S. (1991). Systematic Paretto, P., & Niez, J. J. (1986). Long term memory storage capacity of
design of adaptive controller for feedback linearizable systems. IEEE multiconnected neural networks. Biological Cybernetics, 54, 53–63.
Transactions on Automatic Control, 36(11), 1241–1253. Polycarpou, M. M. (1996). Stable adaptive neural control scheme for
Kosmatopoulos, E. B., Polycarpou, M. M., Christodoulou, M. A., & nonlinear systems. IEEE Transactions on Automatic Control, 41(3),
Ioannou, P. A. (1995). High-order neural network structures for 447–451.
identi9cation of dynamical systems. IEEE Transactions on Neural Sadegh, N. (1993). A perception network for functional identi9cation and
Networks, 6(2), 422–431. control of nonlinear systems. IEEE Transactions on Neural Networks,
Kosut, R. L., & Friedlander, B. (1985). Robust adaptive control: 4(6), 982–988.
Conditions for global stability. IEEE Transactions on Automatic Sanner, R. M., & Slotine, J. E. (1992). Gaussian networks for direct
Control, 30(7), 610–624. adaptive control. IEEE Transactions on Neural Networks, 3(6),
Krstic, M., Kanellakopoulos, I., & Kokotovic, P. V. (1995). Nonlinear 837–863.
and adaptive control design. New York: Wiley. Seto, D., Annaswamy, A. M., & Baillieul, J. (1994). Adaptive control of
Levin, A. U., & Narendra, K. S. (1996). Control of nonlinear dynamical nonlinear systems with a triangular structure. IEEE Transactions on
systems using neural networks—Part II: Observability, identi9cation, Automatic Control, 39, 1411–1428.
and control. IEEE Transactions on Neural Networks, 7(1), Spooner, J. T., & Passino, K. M. (1996). Stable adaptive control using
30–42. fuzzy systems and neural networks. IEEE Transactions on Fuzzy
Lewis, F. L., Yesildirek, A., & Liu, K. (1996). Multilayer neural-net robot Systems, 4(3), 339–359.
controller with guaranteed tracking performance. IEEE Transactions Yeh, P. C., & Kokotovic, P. V. (1995). Adaptive control of a class
on Neural Networks, 7(2), 388–398. of nonlinear discrete-time systems. International Journal of Control,
Lin, Z., & Saberi, A. (1995). Robust semi-global stabilization of 62(2), 303–324.
minimum-phase input-output linearizable systems via partial state and Yesidirek, A., & Lewis, F. L. (1995). Feedback linearization using neural
output feedback. IEEE Transactions on Automatic Control, 40(6), networks. Automatica, 31(11), 1659–1664.
1029–1041. Zhang, Y., Wen, C. Y., & Soh, Y. C. (2000). Discrete-time robust
Marino, R., & Tomei, P. (1995). Nonlinear control design. London: backstepping adaptive control for nonlinear time-varying systems.
Prentice-Hall. IEEE Transactions on Automatic Control, 45(9), 1749–1755.