Anda di halaman 1dari 5

Preprint, 11th IFAC Symposium on Dynamics and Control of Process Systems,

including Biosystems
June 6-8, 2016. NTNU, Trondheim, Norway

Design of a Data-Driven Controller for a


Spiral Heat Exchanger
Shin Wakitani ∗ Mingcong Deng ∗∗ Toru Yamamoto ∗∗∗

Tokyo University of Agriculture and Technology, Tokyo, 184-8588
Japan (e-mail: wakitani@cc.tuat.ac.jp).
∗∗
Tokyo University of Agriculture and Technology, Tokyo, 184-8588
Japan (e-mail: deng@cc.tuat.ac.jp)
∗∗∗
Hiroshima University, Hiroshima, 739-8527, Japan (e-mail:
yama@cc.tuat.ac.jp)

Abstract: A data-driven proportional-integral-derivative (DD-PID) controller has been pro-


posed as an effective controller for nonlinear systems. The DD-PID controller can tune the
PID parameters adaptively at each equilibrium point. In order to train the PID parameters
in a database, an offline learning algorithm based on a fictitious reference iterative tuning
(FRIT) method was established. This method can compute the PID parameters by using a set
of operating data. However, the FRIT method is a control parameter tuning method that is
only based on the minimization of the system output in its criterion; therefore, the criterion
is insufficient for systems in which the stability of a closed-loop system is important such as
chemical process systems because sometimes the sensitivity of an obtained controller becomes
high. In order to solve this problem, an extended FRIT (E-FRIT) method that penalizes the
input variation in its criterion has been proposed. In this method, the PID parameters that are
taken into stability can be calculated. The effectiveness of the proposed method is evaluated by
an experimental result of a spiral heat exchanger.

Keywords: Process control, PID control, self-tuning control, data-driven control, extended
fictitious reference iterative tuning (E-FRIT)

1. INTRODUCTION which is one of the offline learning methods. This method


is known as the DD-FRIT method (see Wakitani et al.
(2013)). According to the method, the time required for
Proportional-integral-derivative (PID) controllers (Ziegler learning a database is significantly reduced because this
and Nichols (1942); Chien et al. (1952); Vilanova and Visi- method can update the database using one-shot experi-
oli (2012)) have been mainly used in many process systems mental I/O data.
because of its simple structure. The control performance of
a PID controller is strongly affected by the combination of In the FRIT method, a fictitious reference signal includ-
three parameters: the proportional gain, the integral gain ing obtained experimental I/O data and adjustable con-
and the derivative gain. If a control system has nonlinear- trol parameters is generated firstly. After that, a desired
ity, a fixed PID controller may not maintain the desired controller is realized by adjusting the control parameters
control performance when the equilibrium point of the sys- in order to minimize the error of the system output in
tem output is changed by altering the set points. A data- its criterion. The FRIT method does not require any
driven PID (DD-PID) controller that uses a database for mathematical models of the controlled object to tune the
tuning PID parameters has been proposed by Yamamoto control parameters. However, control parameters tuned
et al. (2009) as an effective controller for such nonlinear by the FRIT method may become high gain because the
systems. FRIT method is only based on the minimization of the
control output. In order to solve this problem, an extended
The DD-PID controller is one of a just-in-time con- FRIT (E-FRIT) method that has a penalty to compensate
troller (Stenman et al. (1996); Zheng and Kimura (2001); for the differential input is proposed (see Masuda et al.
Nakpong and Yamamoto (2012)). The DD-PID controller (2010), Kano et al. (2011)). The E-FRIT method can help
adaptively tunes its PID parameters at each equilibrium suppress the sensitivity of a controller by setting a penalty
point by using trained PID parameters that are stored in a factor appropriately.
database. Learning algorithms of the DD-PID controllers
are divided into two methods: one is an online learning In this paper, a new offline learning method of the DD-
method that trains PID parameters while under control, PID controller based on the E-FRIT method is proposed.
and the other is an offline learning method that trains In this method, the PID gain updating rule based on the
the parameters in advance by using obtained experimental E-FRIT is derived. A more stable DD-PID controller can
data. There is a method based on a fictitious reference be obtained compared to the previous learning method
iterative tuning (FRIT) method (Kaneko et al. (2005)), based on the FRIT method. In this study, the proposed

Copyright © 2016 IFAC 342


IFAC DYCOPS-CAB, 2016
June 6-8, 2016. NTNU, Trondheim, Norway

DD-PID controller is applied to a spiral heat exchanger


and its effectiveness is evaluated. Database

2. DESIGN OF DATA-DRIVEN CONTROLLER


[Step1] & [Step2]
Local Controller
2.1 System Description Design Method

It is assumed that the controlled object is described as the PID Controller System
following equation.
y(t) = f (φ(t − 1)), (1)
where, y(t) is the system output and f (·) indicates a non- Fig. 1. Block diagram of the data-driven proportional-
linear function whose output is determined by a historical integral-derivative (DD-PID) control system.
data vector φ(t − 1) (See Yamamoto et al. (2009)). The
historical data φ(t − 1) denotes as follows. PID gains at each step t are calculated by the following
φ(t − 1) := [y(t − 1), . . . , y(t − ny ), [STEP 2] and [STEP 3].
u(t − 1), . . . , u(t − nu )]. (2) [STEP 2] Calculate Distance and Select Neighbor Data
In (2), u(t) is the system input, ny and nu are orders of
y(t) and u(t), respectively. The distance between the query (which is the information
vector that indicates current system state) φ̄(t) and an
2.2 PID Control Law information vector φ̄(tj ) in the database is calculated by
the following L 1 norm with some weights.
When a PID controller is applied to process systems, ny +nu +1  
  φ̄l (t) − φ̄l (tj ) 
sometimes derivative kick depending on set value change d(φ̄(t), φ̄(tj )) =  
 max φ̄ (m) − min φ̄ (m)  ,
causes problems for stability of a process. In order to avoid l=1 l l
the derivative kick, this paper introduces the following PID (8)
control law. This control law is known as I-PD control law. j = 1, . . . , N.
Δu(t) = KI (t)e(t) − KP (t)Δy(t) − KD (t)Δ2 y(t),
(3) In (8), φ̄l (t) expresses the l-th element in the j-th dataset,
and φ̄l (t) expresses the l-th element in the query at t.
where, e(t) is the control error between the set value r(t) Moreover, max φ̄l (m) and min φ̄l (m) indicate the maxi-
and the system output y(t), and is defined as mum value and minimum value of the l-th element of all
e(t) := r(t) − y(t). (4) datasets in the database. In this method, the datasets in
In (3), KP (t), KI (t) and KD (t) express the proportional the database are sorted in ascending order of their dis-
gain, the integral gain and the derivative gain, respectively. tance, and k-pieces of datasets with the smallest distances
Moreover, Δ denotes the differencing operator given by among them are chosen as neighbor datasets. Where, k is
Δ := 1 − z −1 , and z −1 is the backward operator which set by the user at will.
implies z −1 y(t) = y(t − 1). In the DD-PID method, these
[STEP 3] Compute PID gains
PID gains at each step t are adaptively tuned using a
database. From the selected k-pieces of neighbor datasets, a suitable
set of PID gains at t steps are computed by the following
2.3 Data-driven PID controller equation.
k  k
This section explains the working principle of the DD-
K(t) = wi K(i), wi = 1, (9)
PID controller. In the DD-PID controller, an initial
i=1 i=1
database has to be created because the controller requires
a database for its actions. Thus, if a database does not where
exist, an initial database is created by the following pro- exp(−di )
wi = k . (10)
cedure. 
exp(−di )
[STEP 1] Generate Initial Database
i=1
Initial operating data r0 (t), u0 (t), y0 (t) are obtained by
using an I-PD controller with fixed PID gains. Datasets The block diagram of the DD-PID controller is shown in
at each step are generated by obtained operating data, Fig. 1. By executing [STEP 2] and [STEP 3] every time,
and are sequentially stored in the database. The dataset the PID gains are adaptively tuned if the PID gains in
is defined by the following equation. the database are suitably tuned in advance. However, if
a result obtained by a fixed PID controller is applied to
Φ(j) = [φ̄(tj ), θ P ID (tj )], j = 1, 2, . . . , N. (5) create a database, then all PID gains included in the initial
Where, N indicates the total number of datasets. φ̄(tj ) information vectors may be equal. Expressed numerically,
and θ P ID (tj ) are expresses as follows. that is
φ̄(tj ) := [r0 (tj + 1), r0 (tj ), θ P ID (1) = θ P ID (2) = · · · = θ P ID (N ). (11)
y0 (tj ), . . . , y0 (tj − ny + 1),
In this case, the PID gains in the database have to be tuned
u0 (tj − 1), . . . , u0 (tj − nu + 1)] (6)
in an offline manner or online manner. The online learning
θ P ID (tj ) = [KP (tj ), KI (tj ), KD (tj )]. (7) method requires many experiments to get optimal PID

343
IFAC DYCOPS-CAB, 2016
June 6-8, 2016. NTNU, Trondheim, Norway

the extended FRIT (E-FRIT). The criterion of the E-FRIT


System is shown as follows.
θ ∗ = arg min JE-FRIT (θ) (17)
θ
N
1 
JE-FRIT (θ) = (y0 (θ, t) − yr (θ, t))2
N t=1
Fig. 2. Block diagram of the FRIT method. 
+λfs Δũ(θ, t)2 (18)
gains. Therefore, it is an unsuitable way from the point Δũ(θ, t) = ũ(θ, t) − ũ(θ, t − 1) (19)
of view of practical use. Thus the next section considers ũ(θ, t) = C(θ)(r̃(θ, t) − yr (θ, t)) (20)
about the offline learning algorithm for designing an initial 
database by using one shot experimental I/O data. Var(yr (θ, t) − y0 (t))
fs = (21)
Var(Δũ(θ, t))
3. OFFLINE LEARNING METHOD BASED ON
E-FRIT Where, fs is a scaling parameter. Thanks to this parame-
ter, the weight coefficient λ can always be set as 1.0 and
can be adjusted based on this value. This paper considers
3.1 Extended FRIT (E-FRIT) method
about an offline learning method based on the E-FRIT.
The FRIT method can calculate an optimal control pa-
rameters vector θ ∗ without a mathematical model of 3.2 Offline learning rule based on E-FRIT
the controlled object. This method introduces a fictitious
reference signal r̃(θ, t) to tune the control parameters. First, initial operating data r0 (θ̄ P ID , t), y0 (θ̄ P ID , t),
The fictitious reference signal is calculated by experimen- u0 (θ̄ P ID , t) are obtained by using a fixed PID controller,
tal I/O data u0 (θ, t), y0 (θ, t) and an initial controller and an initial database is created following the [STEP 1]
C(θ, z −1 ) including an initial control parameter vector in Section 2.3. Next, the query φ¯0 (t) is created along to
θ = [c0 , c1 , ..., cn ]. temporal sequence of the operating data, and PID gains
at t steps θ P ID (t) are calculated following [STEP 2] and
The block diagram of the FRIT is shown in Fig.2. In [STEP 3]. However, a information vector Φ(1) in the
the figure, C(θ, z −1 )/Δ expresses a controller with an sorted database and the query φ¯0 (t) are coincident at all of
integrator. Moreover, C(θ, z −1 ) is given as follows. the steps. Thus the weight factor to calculate the local PID
C(θ, z −1 ) = c0 + c1 z −1 + · · · + cn z −n . (12) gains for Φ(1) becomes bigger than weight factors of other
Where, n indicates the order of the controller. From Fig.2, datasets, and the offine learning can not perform effec-
I/O relationship of the controller C(θ, z −1 )/Δ can be tively. In order to avoid the above problem, in this learning
described as follows. algorithm, datasets from Φ(2) to Φ(k) in a database are
C(θ, z −1 ) chosen as neighbor data, and local PID gains are calculated
u0 (θ, t) = {r̃(θ, t) − y0 (θ, t)} (13) by these data according to the following equation (22). To
Δ
simplify further descriptions, r0 (θ̄ P ID , t), y0 (θ̄ P ID , t) and
From (13), the fictitious reference signal r̃(t) can be cal- u0 (θ̄ P ID , t) are expressed as r0 (t), y0 (t), u0 (t). Moreover,
culated as follows by the controller and the experimental r̃(θ P ID , t), yr (θ P ID , t) and Δũ(θ P ID , t) are expressed as
I/O data. r̃(t), yr (t) and Δũ(t), respectively.
r̃(θ, t) = C −1 (θ, z −1 )Δu0 (θ, t) + y0 (θ, t) (14) k k
 
−1 θ P ID (t) = wi θ P ID (i), wi = 1. (22)
A reference model Gm (z ) including desired properties is
designed by the user, and the model output corresponding i=2 i=2
to the fictitious reference signal is given as yr (t). In The calculated PID gains θ P ID (t) = [KP (t), KI (t), KD (t)]
the FRIT method, it adjusts the control parameters in are updated by the following steepest descent method.
C(θ, z −1 ) so that the controller achieves a desired closed-
∂J(t + 1)
loop property. In particular, the FRIT method solves θ new
P ID (t) = θ P ID (t) − η , (23)
the following optimization problem and derives optimal ∂θ P ID (t)
control parameters θ ∗ . where,
θ ∗ = arg min JFRIT (θ), (15) 1 
θ J(t) = (y0 (t) − yr (t))2 + λfs Δũ(t − 1)2 . (24)
2
N
1  2 η = [ηP , ηI , ηD ] are learning coefficients. yr (t) is calculated
JFRIT (θ) = {y0 (θ, t) − yr (θ, t)} . (16) as follows by using the fictitious reference signal r̃(t) and
N t=1
a reference model Gm (z −1 ).
The FRIT method tunes the control parameters based
on minimization of the controlled response. However, in yr (t) = Gm (z −1 )r̃(t), (25)
process systems that are emphasized on stability such r̃(t) = {Δu0 (t) + (KP (t) + KI (t) + KD (t))y0 (t)
as chemical process systems, the criterion is insufficient − (KP (t) + 2KD (t))y0 (t − 1),
because the obtained controller is usually sensitive. In + KD (t)y0 (t − 2)}/KI (t). (26)
order to solve this problem, Masuda et al. (2010) were
proposed a new criterion that adds a penalty to the In this paper, the reference model is designed as the
variation of the control input Δũ(t). This method is called following second order system.

344
IFAC DYCOPS-CAB, 2016
June 6-8, 2016. NTNU, Trondheim, Norway

z −1 P (1)
Gm (z −1 ) = , (27)
P (z −1 )
P (z −1 ) = 1 + p1 z −1 + p2 z −2 , (28)
  √ ⎫
ρ 4μ − 1
p1 = −2 exp − cos ρ ⎪ ⎪

  2μ 2μ ⎪


ρ
p2 = exp − . (29)
μ ⎪



ρ := Ts /σ ⎪

μ := 0.25(1 − δ) + 0.51δ
In (29), Ts indicates the sampling time. σ and δ are
defined as the rise time and the attenuation parameter,
respectively, and these parameters are designed by the
user at will. Δũ(t) is calculated based on the relationship
between (3) and (20).
Δũ(t) = KI (t){r̃(t) − yr (t)}
− KP (t)Δyr (t) − KD (t)Δ2 yr (t). (30)
From the above relationship, the second term of the right
Fig. 3. Appearance of the heat exchange system.
side (23) is expanded as follows.

∂J(t + 1) ∂J(t + 1) ∂yr (t + 1) ∂ r̃(t) ⎪

∂yr (t + 1) ∂ r̃(t) ∂KP (t) ⎪
= ⎪
∂KP (t) ⎪

2 ⎪

∂Δũ(t) ∂Δũ(t) ⎪

+λfs ⎪

∂Δũ(t) ∂KP (t) ⎪

∂J(t + 1) ∂J(t + 1) ∂yr (t + 1) ∂ r̃(t) ⎪ ⎪


= ⎬
∂KI (t) ∂yr (t + 1) ∂ r̃(t) ∂KI (t) . (31)
∂Δũ(t)2 ∂Δũ(t) ⎪

+λfs ⎪

∂Δũ(t) ∂KI (t) ⎪


∂J(t + 1) ∂J(t + 1) ∂yr (t + 1) ∂ r̃(t) ⎪ ⎪

= ⎪

∂KD (t) ∂yr (t + 1) ∂ r̃(t) ∂KD (t) ⎪ ⎪


2
∂Δũ(t) ∂Δũ(t) ⎪

+λfs ⎭
∂Δũ(t) ∂KD (t)
The PID gains in the database are updated based on the
PID gains θ new
P ID (t) corrected by (23) as follows.
Φ(1) ← [φ̄(t1 ), θ new
P ID (t)],
Φ(j) ← [φ̄(tj ), ωj θ new
P ID (t)], j = 2, . . . , k. (32) Fig. 4. Appearance of the spiral heat exchanger.
It can execute offline learning to iterate the above proce-
dure until J(t) becomes small. Step7 Iterating from Step1 to Step 5 until (24) at each
step becomes small.
3.3 Algorithm
4. APPLICATION TO SPIRAL HEAT EXCHANGER
The proposed algorithm is summed up as follows:
In this paper, the proposed DD-PID controller is applied
Step1 Creating a query φ̄0 (t) from operating data, and to a heat exchange system shown in Fig.3. This system
calculating distance between φ̄0 (t) and all of the φ̄(j) by has two tanks for storing hot water and cold water,
(8). respectively, and conducts heat exchange by flowing these
waters into a spiral heat exchanger shown in Fig.4. In this
Step2 Sorting the database in ascending order of their experiment, the flow of cold water is fixed at 2.0 L/min,
distance, choosing out data from Φ(2) to Φ(k) as neighbor the temperature of hot water in the tank is kept at 50 ◦ C.
data. Then, the temperature of the cold water is controlled by
adjusting the flow of hot water. Moreover, the sampling
Step3 Calculating local PID gains θ P ID (t) using the time is set as Ts = 5 s, and reference value is set as follows.
neighbor data by (22).
33 ◦ C (0 ≤ t < 200s)
r(t) = . (33)
Step4 yr (t), r̃(t) and Δũ(t) are calculated by (25), (26) 40 ◦ C (200 ≤ t ≤ 400s)
and (30). First, the control result with the fixed PID controller is
shown in Fig.5. PID gains are determined as follows by
Step5 Calculating correction terms by (31) and calcu- the CHR method.
lating θ new
P ID (t) by (23).
KP = 3.62, KI = 0.65, KD = 3.75. (34)
Step6 Updating PID gains in the database by (32). Next, an initial database is designed by the above I/O

345
IFAC DYCOPS-CAB, 2016
June 6-8, 2016. NTNU, Trondheim, Norway

45
Temp. HEX[deg.]
this study, the updating rule of the PID parameters is
40
derived based on the criterion of the E-FRIT method;
35
it is shown that the database can be updated in an
30
offline manner using a set of experimental I/O data. The
25
effectiveness of the proposed method is experimentally
20
0 50 100 150 200 250 300 350 400 evaluated by applying it to a spiral heat exchanger. The
100
results show that the tracking property of the closed
loop system is improved by only a set of experimental
Valve aperture[%]

80

60
data. However, in this method, the calculated control
40
parameters may converge to a local solution because the
20
method obtains optimum parameters using the steepest
0
descent method. In the future, more suitable optimization
0 50 100 150 200 250 300 350 400
time [s] calculations of the PID gains should be considered.

Fig. 5. Control result using the fixed PID controller. ACKNOWLEDGEMENTS


45

In the experiment of the heat exchange system, we have


Temp. HEX[deg.]

40

35 had the support and encouragement of Akira Inoue who


30 is a professor in Okayama university. We would like to
25 express our gratitude to him.
20
0 50 100 150 200 250 300 350 400
REFERENCES
100
Chien, K.L., Hrones, J.A., and Reswick, J.B. (1952). On
Valve aperture[%]

80

60
the automatic control of generalized passice systems.
40
Trans. ASME, 74, 175–185.
20
Kaneko, O., Souma, S., and Fujii, T. (2005). A fictitious
0 reference iterative tuning (frit) in the two-degree of
0 50 100 150 200 250 300 350 400
time [s] freedom control scheme and its application to closed
loop system identification. Proc. of 16th IFAC World
Fig. 6. Control result using the proposed method. congress (CD-ROM).
Kano, M., Tasaka, K., Ogawa, M., Takinami, A., Taka-
4 hashi, S., and Yoshii, S. (2011). Extended fictitious
P

reference iterative tuning and its application to chemical


K

0
processes. Proc. of the 2011 4th International Sympo-
0 50 100 150 200 250 300 350 400

2
sium on Advanced Control of Industrial Processes (AD-
1.5
CONIP), 379–384.
Masuda, S., Kano, M., and Yasuda, Y. (2010). A fictitious
KI

0.5
reference iterative tuning method with simultaneous
0 50 100 150 200 250 300 350 400
delay parameter tuning of the reference model. Inter-
5
4
national Journal of Innovative Computing, Information
and Control, 6(4), 1–13.
D

3
K

2
Nakpong, N. and Yamamoto, S. (2012). Just-in-time
1
0 50 100 150 200
time [s]
250 300 350 400 predictive control for a two-wheeled robot. Proc. ICT-
KE2012, 95–98.
Fig. 7. Trajectories of PID gains corresponding to Fig.6. Stenman, A., Gustafsson, F., and Ljung, L. (1996). Just
in time models for dynamical systems. 35th IEEE
data and is learned by the proposed method. Where, Conference on Decision and Control, 1115–1120.
design parameters for the learning are set as k = 3, Vilanova, R. and Visioli, A. (2012). PID control in the
λ = 1.0, and the learning coefficients are set as follows. Third Millennium : lessons learned and new approaches.
η = [0.1 0.01 0.1]. (35) Springer.
In addition, the design polynomial is designed as follows. Wakitani, S., Nishida, K., Nakamoto, M., and Yamamoto,
T. (2013). Design of a data-driven pid controller using
P (z −1 ) = 1 − 1.03z −1 + 0.264z −2 . (36) operating data. Proc. 11th IFAC International Work-
The control result by the proposed DD-PID controller is shop on Adaptation and Learning in Control and Signal
as shown in Fig.6. Also, the trajectories of PID gains are Processing, 587–592.
shown in Fig.7. From the results, the tracking property of Yamamoto, T., Takao, K., and Yamada, T. (2009). Design
the closed loop system is improved by adjusting PID gains of a data-driven pid controller. IEEE Trans. Control
adaptively. Systems Technology, 17(1), 29–39.
Zheng, Q. and Kimura, H. (2001). Just-in-time pid control.
5. CONCLUSIONS The 44th Japan Joint Automatic Control Conference,
336–339.
In this paper, a new offline learning method of a DD-PID Ziegler, J.G. and Nichols, N.B. (1942). Optimum settings
controller based on the E-FRIT method is proposed. In for automatic controllers. Trans. ASME, 64(8), 759–768.

346

Anda mungkin juga menyukai