1, JANUARY/FEBRUARY 1993
204
Correspondence
Dynamic System Identification Using Neural Networks
Yd
-
I. INTRODUCTION
205
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS, VOL. 23, NO. 1, JANUARYffEBRUARY 1993
Plant
Position
Actual Position
'I'ocess
Trajectory
Neural
Network
Time
Fig. 6 . Concept of learning type
- Y
Plant
I *
Neural
Network
i
Fig. 4. Scheme of inverse transfer function identifier.
Position
,Actual Position
/Desired
Trajectory
Time
Fig. 5. Concept of adaptive type.
206
The teaching signal Y(k)(the plant output) is divided into both the
unknown vector a of the plant and the signal vector I(k) as (8)
Y(k)= a T I ( k )
T = aTI.
(1)
Nout= W T I .
Thus, error
given by
(2)
= (T
- Nout)
= (aT - W T ) I .
(3)
IV. CONTROLLER
DESIGN
METHOD
This section describes general design methods for both the direct and inverse transfer function identifiers. The object plant is a
predominantly linear SISO plant system. The leaming is performed
every sampling time. Since we assume that the dominant part of the
plant is linear, the neural network identifiers are designed for a linear
system. The basic neural network structure is a three layer, linear
PDP type. As to the nonlinear effect, the sigmoid function is added
to the designed linear neural network. The transfer function of the
object plant is assumed to be expressed as
A(2-l) =
+
*=1
m
..
B(z-') = 1
+ 1b , C i
+ +
YN(k) w T ( k ) W ( k ) r ( k )
(4)
(9)
= Y ( k )- YN(k)
= a T I ( k ) - wT(k)W(k)I(k)
= {a* - W T ( k ) W ( k ) } I ( k ) .
(10)
W(k
w(k
A(t-')Y(k)
=~-~Gol?(z-~)U(k)
(8)
+ 1) = W ( k )+ qAW(k)
+ 1) = w ( k ) + qAw(k)
A W ( k )= - d E / d W ( k )
Aw(k) = - d E / d w ( k )
E = E2(k)
(11)
(12)
(13)
(14)
i=l
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS, VOL. 23, NO. 1, JANUARYFEBRUARY 1993
207
r----------- 1
I
- - - - - - - - - - --- - 1
111
U(k-d) =(l/Go)~(k)+Ca,l'(k-i)-GoCb,I-k-/-d.
2=1
,=1
(15)
Both the neural network input vector I ( k ) and the unknown
parameter vector cy are defined in the same way as the direct transfer
function identifier:
IT(k)=
(16)
(17)
The teaching signal L ' ( k ) (the plant input) is divided into both
unknown vector a and the signal vector I ( k ) . ( [ - ( k - d )= n ' l ( k ) ) .
On the other hand, when the neural network is linear, its output
U N ( ~is) obtained by
l - , v ( k )= d J T ( k ) W ( k ) I ( k ) .
(18)
&(IC) = U ( k - d - r v ( k )
= c r T I ( k )- w T ( k ) W ( k ) l ( k )
= {a' - J ( k ) W ( k ) } 1 ( k ) .
(19)
IEEE TRANSACTIONS ON SYSTEMS, MAN,AND CYBERNETICS, VOL. 23, NO. 1, JANUARYFEBRUARY 1993
208
- Netwbrk
c,
xg=20
4-
output
a
P
c,
a
c,
*g
*s
0
E
g=o.1
- Network output
---Plant output
c,
0
c,
(A
q=0.3
0
100
200
h
0
200
100
Sampling number
Sampling number
Fig. 11. Learning process of direct transfer function identifier.
' Fig. 13. Learning process of direct transfer function identifier applied
to nonlinear plant.
10
q=o.1
h
w-
+.
w
z o
a
output
---- Network
Plant output
0
100
-10
-10
10
Input x
Fig. 14. Input-output relationship of nonlinear function.
200
Sampling number
Fig. 12. Learning process of direct transfer function identifier using different
initial value.
100
200
Sampling number
Fig. 15. Nonlinear simulator ushg results of direct transfer function
identifier.
shows the relation between the true plant output and the nonlinear
simulator output using the identified result from Fig. 13. The feedback
loop is used to stabilize the system with h'p = 0.2 in this simulator.
Although the nonlinear simulator output exhibits similar behavior to
the plant output, a relatively small error between the nonlinear output
and the plant output remains. This is because both the nonlinear
and parasite terms prevent perfect identification. Fig. 16 shows
the relation between the nonlinear simulator output and the true
plant output where the weight matrixes are the same as those in
Fig. 15 with a different feedback gain K p = 0.4. Both outputs
209
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS, VOL. 23, NO. I , JANUARYFEBRUARY 1993
- Plant output
I
+-
x9=20
Neural
P
c
3
-1
+E
m
n
U
c
m
Fig. 18. Direct controller using results of inverse transfer function identifier.
-cm
iij
100
200
-Plant output
K,=O
Sampling number
Fig. 16. Nonlinear simulator using results of direct transfer function
identifier.
q=0.75
- Network output
- _ - _Plant input
0
200
400
Sampling number
Fig. 19. Simulation result of direct controller.
100
2 10
Sampling number
Fig. 17. Process of inverse transfer function of identifier.
+-
P
c
3
.
0
+r
200
400
Sampling number
Fig. 20. Simulation result of direct controller.
the telation between the plant output and the desired value where
the feedback gain l i p = 0.3. The error in Fig. 20 is larger than
that in Fig. 19. Fig. 21 shows the relation between the feedback
loop output and the neural network output under the same conditions
as those of Fig. 20. Since the sign of the feedback loop output is
the same as that of the neural network output, the feedback loop
increases the error. This result shows that there is a possibility of
increasing the output error by adding the direct controller with the
feedback loop in order to compensate for the identification error.
This phenomenon is caused by using past plant output as the neural
network input.
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS, VOL. 23, NO. 1, JANUARY/FEBRUARY 1993
210
xg=7
tl =0.03
El
1.0
Y
LL
'b--
0.51
Plant output
0
1000
0
25
50
1100
Sampling nuniber
Sampling number
x 10-3
1.5894
b
1.5892
Sensor
Amp.
xg
VI. EXPERIMENT
The realization of identifiers in an actual system is confirmed
experimentally for a one freedom force control system. Fig. 22 shows
the experimental setup. In this control system, the contact force can
not be accurately controlled using the motor current because of the
friction of the harmonic drive. Thus, it is necessary to estimate the
stiffness of the contact object using adaptive contlol or leaming
control. We can approximately express the transfer function G ( z )
of the one freedom force control system as follows.
xg=7
El
1.0
Q)
2 0.51
LL
o
bp,
v
Y
0
1000
1100
Sampling number
where T, is the sampling period, h-,,,
is the gain of the system, T, is
the time constant of the system and K e is the stiffness of the contact
object. Fig. 23 shows the experimental result for the direct transfer
function identifier. As shown in Fig. 23, the neural network output
converges with the actual plant output. This result confirms that the
proposed direct transfer function identifier is very useful in an actual
system. Fig. 24 shows the effect of the lionlinear function of (21).
The vertical axis of Fig. 24 is the root mean squared error of 500
sampling errors after learning. As shown in Fig. 24, this is minimum
at about Xg = 7. Since the actual control system has a nonlinear
effect, the result in Fig. 24 indicates the existence of an optimum
nonlinear sigmoid function shape. Fig. 25 shows the experimental
result for the nonlinear simulator using the results from Fig. 24. As
shown in Fig. 25, the simulator behaves similarly to an actual plant.
This result confirms that the proposed nonlinear simulator is very
useful in an actual system.
VII. DISCUSSION
As shown in Section V, the direct transfer function identifier has
a local minimum because the 5 rule is used as the leaming laws.
This local minimum causes the weight matrix u T w to converge
with a value different form that of the unknown parameter vector
211
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS, VOL. 23, NO. 1. JANUARYIFEBRUARY 1993
REFERENCES
D. Psaltis, A. Siders and A. Yamamura, Neural controller, in Proc.
I987 IEEE Int. Con$ Neural Networks, vol. IV, San Diego, 1987, pp.
551-558.
A. Guez, J. Eilbert and M. Kam, Neuromophic architecture adaptive
control: A preliminary analysis, in Proc. 1987 IEEE Int. Con/: Neural
Networks, vol. IV, San Diego, 1987, pp. 567-572.
C. A. Niznik, W. Hoss and C. Watts, Modeling robot cerebellar neural
network pathway information measures, in Proc. 1987 IEEE Int. Con$
Neural Networks, vol. IV, San Diego, 1987, pp. 583-591.
M. Kawato, Y. Uno, M. Isobe and R. Suzuki, A hierarchical model for
voluntary movement and its application to robotics, Proc. 1987 IEEE
Int. Conf: Network, vol. IV, 1987, pp. 573-582.
T. Tokita, T. Fukuda, T. Mitsuoka, T. Kurihara, Force control of
robot manipulator by neural network (control of one degree-of-freedom
manipulator), J. Robotics SOC.Japan, vol. 7, no. 1, pp. 47-51, 1989.
T. Yabuta, T. Tsujimura, T. Yamada and T. Yasuno, On the characteristics of the manipulator controller using neural network, in Proc. Int.
Workshop on Machine Intell. and Esion, Apr. 1989, pp. 76-81.
I. INTRODUCTION
Information-processing in most large scale systems is effected
by a network of geographically or functionally distributed decision
makers (DMs) with limited communications. These problems arise
naturally in a wide variety of contexts, including weather forecasting, military command and control (C2), air-traffic control, electric
power networks, surveillance systems, oil prospecting, emergency
management, product distribution and supply, and medical diagnosis.
Each DM (variously referred to as a node, a sensor, a commander, etc.
depending on the problem context) has access to only a portion of the
information available to the network of DMs. Moreover, even the total
information set is generally incomplete and is characterized by a high
level of uncertainty due to lax updating, missed detection of events,
Manuscript received March 30, 1991; revised April 10, 1992. This work was
supported in part by NSF Grant #IRI-8902755 and in part by ONR Contract
#N00014-90-3-1753.
2.-B. Tang was with the Department of Electrical and Systems Engineering,
University of Connecticut, Storrs, CT 06269-3157 and is now with the
Division of Applied Sciences, Harvard University, Cambridge, MA 02138.
K. R. Pattipati and D. L. Kleinman are with the Department of Electrical
and Systems Engineering, University of Connecticut, Storrs, CT 06269-3157.
IEEE Log Number 9202124.