_ _
ABSTRACT
Recently, Support Vector Regression (SVR) has~beena p
plied to financial time series prediction. Typical characteristics of financial time series are non-stationary and noisy
in nature. The volatility, usually time-varying, of the time
series is therefore some valuable information about the series. Previously. we had proposed to use the volatility to
adaptively change the width of the margin of SVR. We have
noticed that upside margin and downside margin do not necessary be the same, and we have observed that their choice
would affect the upside risk, downside risk and as well as
the overall prediction result. In this paper, we introduce
a novel approach to adapt the asymmetrical margins using
momentum. We applied and compared this method to predict the Hang Seng Index and Dow Jones Industrial Average.
1. INTRODUCTION
SVR is a recently indoduced approach to regression problems [51. It is avariationofSupponVectorMacbine(SVM),
which was developed by Vapnik and his co-workers [8,9].
Nowadays, SVR has been successfully applied to time series prediction [3,2] and financial forecasting [7,6].
In general, SVR uses the E-insensitive loss function to
measure the empirical risk and minimizes the regression error based on the Structural Risk Minimization (SRM)principle [SI. The E-insensitive loss function contains a fixed
and symmetrical margin and it is insensitive and non-adaptive
to the input data which may result in less-than-optimalperformance in the testing data
For financial data, due to the embedded noise, one must
set a suitable margin in order to improve the accuracy of
prediction. In [lo], we have extended the standard SVR
with adaptive margin and classified it as four cases: Fixed
and Symmetrical Margin (FASM), Fixed and Asymmehical Margin (FAAM), Non-tixed and Symmetrical Margin
1398
/'
\
/
/
_.
C(a;
f(z)= (w
$(.)I + b,
(1)
subject to
N
C(a;- a f )= 0, a;,
(
I
:
r(.)
0,
Iy - f(z)l - E ,
if IY - fWl < 6
otherwise
(3)
f(z)=
'
as,
0,
(7)
if
(4)
C(U;
- 4 ) ( $.$(z;))
( ~ ) + b.
i=l
In[10], wehaveintroducedageneraltypeofc-insensitive
loss function, r'(f(z;)- y;), i = 1,... ,N, which is given
Y. - f(G)
-U(.;)>
fb)- Y; - d(zi),
(6)
r ( m- Y) =
E [O, q,
i=l
+ c; - y; +
(I:(~(+
z ;c;) + y; (I~(z~(z;)
. $ ( x i ) ) + b) = 0 (8)
( W . $(z;))- b) = 0
(W
and
(C-aJC;
= 0
(C-orf)~: = 0
1399
Since ai . a: = 0 and
be computed as follows:
b=
y, - ( W . d(z,))-u(z,),
yi (v C$(Zi)) d(z;),
f o r a i E (0,C)
for a;E (0, C)
2.2. Momentum
~.
= XI .u(z,) p . A(z,)i = 1 , . .. ,N ,
U(.,)
d(zi)
where ~(2;)
is the standard deviation of input z i which determines the width of margin and A(z,) is the momentum
at point xi.X1, Xz are two positive constants called the coefficients of the margin width. p is a non-negative constant
called the coefficient of momentum. Thus, when p = 0,
it becomes the NASM case. When p # 0 and A(z) > 0,
the up margin is larger than the down margin and we use
them to under-predict the stock price. While p # 0 and
A(z) < 0, the up margin is smaller than the down margin
and we use them to over-predict the stock price.
As a matter of fact, there are many ways to calculate the
momentum. For example, it may be set as a constant. In
this paper, we will consider using the Exponential Moving
Average (EMA), which is time-varying and can reflect the
up trend and down tendency of the financial time series data
An n-days EMA is calculated by
EMAi
= EMA;-1 x
( 1 - r ) + y; x r,
We use the following statistical metrics to evaluate the prediction performance in Table 1, including Mean Absolute
HSI
DlTA
02/01/1998 - 04/07/w)
02/01/1998 - 29/06/00
16WO
so00
T
2-22
1400
type
NASM
NAAM
30
50
100
IO
11
MAE
UMAE
216.78
222.43
218.18
217.93
21650
104.58
115.64
114.04
113.38
113.04
..
I 100 II 84.80 I
~~
DMAE
112.20
106.79
104.14
104.55
I 103.46
42.41 I 42.39
O.*[
"
7-
1
Figure 3: DJIA and 30 days' EMA
dataset
HSI
DJIA
n
100
182.28
30
79.95
A
20.80
15.64
ratio
0.114
0.196
1401
5. ACKNOWLEDGEMENT
The work described in this paper was partially supported by
a grant from the Research Grants Council of the Hong Kong
Special Administration Region, China
6. REFERENCES
[l]Chih-Chung Chang and Chih-Jen Lin. Libsvm: a
library for support vector machines (version 2.31),
2001.
121 S . Mukhejee, E. Osuna, and E G i s i . Nonlinear prediction of chaotic time series using support vector machines. In J. Principe, L. Giles, N. Morgan, and E. Wilson, editors, IEEE Workshop on Neural Networks for
Signal Processing VII, page 51 I. IEEE Press, 1997.
. .
#hidden
MAE
3
5
7
9 .
88.31
98.44
90.53
87.23
UMAE
44.60
48.46
46.22
44.09
DMAE
43.71
49.98
44.31
43.14
[4] Ian T.Nabney. Netlab: Algoriihmsfor Panem-Recognition. Springer, London; New York, 2002.
[5] A. Smola and B. Schokopf. A tutorial on support vector regression. Technical Report NC2-TR-1998-030,
NewoCOLT2,1998.
[a] E. H. Tay and L. J. Cao. Application of support vector
machines to financial time series forecasting. Omega,
29309-317,2001.
[SI V. N. Vapnik. The Narure of Sratisrical Leaming Theory. Springer, New York, 1995.
[9] V. N. Vapnik. Staristical Learning Theory. Wiley, New
..
York, 1998.
2. The SVR algorithm using NASM-andNAAM outper- forms the AR model with same order.
~~
1402
port vector machine regression for volatile stock market prediction. In Third Inremarional Conference on
Intelligent Data Engineering and Auromnred Leaming(lDEAL.'OZJ,pages 391-396.Springe, 2002.