Automatic extraction and identification of chart patterns towards financial forecast ,

Transcription

Automatic extraction and identification of chart patterns towards financial forecast ,
Applied Soft Computing 7 (2007) 1197–1208
www.elsevier.com/locate/asoc
Automatic extraction and identification of chart patterns
towards financial forecast
James N.K. Liu *, Raymond W.M. Kwong
Department of Computing, The Hong Kong Polytechnic University, Hong Kong
Available online 20 March 2006
Abstract
Technical analysis of stocks mainly focuses on the study of irregularities, which is a non-trivial task. Because one time scale alone cannot be
applied to all analytical processes, the identification of typical patterns on a stock requires considerable knowledge and experience of the stock
market. It is also important for predicting stock market trends and turns. The last two decades has seen attempts to solve such non-linear financial
forecasting problems using AI technologies such as neural networks, fuzzy logic, genetic algorithms and expert systems but these, although
promising, lack explanatory power or are dependent on domain experts. This paper presents an algorithm, PXtract to automate the recognition
process of possible irregularities underlying the time series of stock data. It makes dynamic use of different time windows, and exploits the potential
of wavelet multi-resolution analysis and radial basis function neural networks for the matching and identification of these irregularities. The study
provides rooms for case establishment and interpretation, which are both important in investment decision making.
# 2006 Elsevier B.V. All rights reserved.
Keywords: Forecasting; Wavelet analysis; Neural networks; Radial basis function network; Chart pattern extraction; Stock forecasting; CBR
1. Introduction
According to the efficient market theory, it is practically
impossible to infer a fixed long-term global forecasting model
from historical stock market information. It is said that if the
market presents some irregularities, someone will take advantages of it and this will cause the irregularities to disappear. But it
does not exclude that hidden short-term local conditional
irregularities may exist; this means that we can still take
advantage from the market if we have a system which can identify
the hidden underlying short-term irregularities when they occur.
The behavior of these irregularities is mostly non-linear amid
many uncertainties inherent in the real world. In general, the
response to those irregularities will follow the golden rule —
‘‘buy low, sell high’’ for most investors. If one foresees that the
stock prices will have a certain degree of upward movement, one
will buy the stocks. In contrast, if one foresees that a certain
degree of drop will happen, one will sell the stocks on hand. This
gives arise the problems of what irregularities we should focus on,
forecasting techniques we can deplore, effective indicators we
can assemble, data information and features we can select to
facilitate the modeling and making of sound investment decision.
* Corresponding author.
E-mail addresses: csnkliu@comp.polyu.edu.hk (James N.K. Liu),
cskwong@comp.polyu.edu.hk (Raymond W.M. Kwong).
1568-4946/$ – see front matter # 2006 Elsevier B.V. All rights reserved.
doi:10.1016/j.asoc.2006.01.007
Since the late 1980s, advances in technology have allowed
researchers in finance and investment to solve non-linear
financial forecasting problems using artificial intelligence
technologies including neural networks [1–4], fuzzy logic [5–
7], genetic algorithms and expert systems [8]. These methods
have all shown promise, but each has its own advantages and
disadvantages. Neural networks and genetic algorithms have
produced promisingly accurate and robust predictions, yet they
lack explanatory power and investors show little confidence in
their recommendations. Expert systems and fuzzy logic provide
users with explanations but usually require experts to set up the
domain knowledge. At last but not least, none of these expert
systems can learn.
In this paper we introduce an algorithm, PXtract to automate
the recognition process of possible irregularities underlying the
time series of stock data. It makes dynamic use of different time
windows, and exploits the potential of using wavelet multiresolution analysis and radial basis function neural networks for
the matching and identification of these irregularities.
2. Related work
Many of financial researchers believe that there are some
hidden indicators and patterns underlying stocks [9]. Weinstein
[10] found that every stock has its own characteristics. It mainly
1198
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
falls into five categories, they are: finance, utilities, property,
and commercial/industrial and technology. Stocks’ price movements in different categories are depending on different factors. It
is difficult to identify which factors will affect a particular stock’s
price movement. To address the problem, we explored the use of
genetic algorithm to provide a dynamic mechanism for selecting
appropriate factors from available fundamental data and
technical indicators [11]. Our investigation of the HK stock
market included potential parameters in fundamental data such
as daily high, daily low, daily opening, daily closing, daily
turnover, gold price, oil price, HK/US dollar exchange rate, HK
deposit call, HK interbank call, HK prime rate, silver price, and
Hang Seng index comprising 33 stocks from the said five
categories. The aggregate market capitalization of these stocks
accounts for about 79% of the total market capitalization on The
Stock Exchange of Hong Kong Limited (SEHK).
On the other hand, for the technical indicators, we examined
the influences of popular indicators such as the relative strength
index (RSI), moving average (MA), stochastic and Ballinger
bands, prices/index movements, time lags and several data
transformations [12,13]. Each of these indicators provides
guidance for investors to analyze the trend of the stocks’ prices
movements. In particularly, the RSI is quite useful to technical
analyst in chart interpretation. The theoretical basis of the
relative strength index is the concept of momentum. A
momentum oscillator is used to measure the velocity or rate of
change of price over time. It is essentially a short-term trading
indicator and also quite effective in extracting price information
for a non-trending market. In short, the total number of
potential inputs being tested was 57 [11]. We applied GAs to
determine which input parameters are optimal for different
stock modeling in Hong Kong. The fitness value of the
chromosome in the genetic algorithm was the classification rate
of the neural network. It was calculated by counting on how
many days the network’s output matched the derived ‘‘best
strategy’’. We defined the best strategy at trading time t as:
8
priceðt þ 1Þ priceðtÞ
>
>
buy if
> z%
>
>
<
priceðtÞ
best strategy ¼
priceðt þ 1Þ priceðtÞ
>
sell if
< z%
>
>
priceðtÞ
>
:
hold otherwise
where z is the decision threshold, and the output of the network
is encoded as 1, 0, and 1 corresponding to the suggested
investment strategies ‘buy’, ‘hold’, ‘sell’, respectively. We
observed that the daily closing price and its transformation
were the most sensitive input parameters for the stock forecast.
In contrast, technical indicators such as RSI and MA were not
critical in those experiments. As such, we feel confident to
concentrate on the investigation of the closing price movements
for possible trends and irregularities. This will be the subject of
chart pattern analysis below.
3. Wave pattern identification
According to Thomas [14], there are up to 47 different chart
patterns, which can be identified in stock price charts. These
chart patterns play a very important role in technical analysis
with different chart patterns revealing different market trends.
For example, a head-and-shoulders tops chart pattern reveals
that the market will most likely to have a 20–30% rise in the
coming future. Successfully identifying the chart pattern is said
to be the crucial step towards the win. Fig. 1 shows 16 samples
of typical chart patterns.
However, the analysis and identification of wave patterns is
difficult for two reasons. Firstly, there exists no single time
scale that works for all analytical purposes. Secondly, any stock
chart may exhibit countless different pattern combinations,
some containing sub-patterns. Choosing the most representative presents quite a dilemma. Furthermore, there is no readily
report of research development on the automatic process of
identifying chart patterns. We address this problem using the
following algorithm.
3.1. The PXtract algorithm
The PXtract algorithm extracts wave patterns from stock
price charts based on the following phases:
3.1.1. Window size phase
As there is hardly a single time scale that works for all
analytical purposes in a wave identification process [2,29], a set
of time window sizes W={fw1 ; w2 ; . . . ; wn g j w1 > w2 > . . .
> wn is defined (wi is the window size for 1 < = i< = n).
Different window sizes are used to determine whether a wave
pattern occurs in a specific time range. For example, in a shortterm investment strategy, a possible window size can be defined
as Wi 2 W = {40, 39, . . ., 10}.
3.1.2. Time subset generation phase
Stock price trading data contain a set of time data T = {t1, t2,
. . ., tn} j t1 > t2 > . . . > tn. For a given time window size wi , T
will be divided into a temporary subset T0. A set P is also
defined, where P T. It contains the time ranges in which
previously identified wave patterns have occurred. Set P is f in
the beginning.
It is said that any large change in a trend plays a more
important role in the prediction process [13]. A range which
has previously been discovered to contain a wave pattern will
not be tested again (i.e. If T0 P, tests will not be carried out).
Details about time subset T0 generation processes are shown in
Fig. 2.
For example, T = {10 Jan, 9 Jan, 8 Jan, 7 Jan, 6 Jan, 5 Jan, 4
Jan, 3 Jan, 2 Jan, 1 Jan}, the current testing window size is 3
(w ¼ 3), and P = {9 Jan, 8 Jan, 7 Jan, 6 Jan}. After the time
subset generation process, T0 = {(5 Jan, 4 Jan, 3 Jan), (4 Jan, 3
Jan, 2 Jan), (3 Jan, 2 Jan, 1 Jan)}.
3.1.3. Pattern recognition
For a given set of time T00 j T00 T0, apply the wavelet theory
to identify the desired sequences. If a predefined wave pattern is
discovered, add T00 to P. Details are described below.
The proposed algorithm PXtract is given in Fig. 3. The
function genSet(wi ) is the subset generation process discussed
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
1199
Fig. 1. Samples of typical chart patterns [14].
earlier. At the end of the algorithm, all the time information of
the identified wave pattern is stored in set P.
Pattern matching can be carried out using simple multiresolution (MR) matching (or radial basis function neural
network (RBFNN) matching. Details of the wavelet recognition
and simple MR matching can be found in our previous work
[15].
univariate function c, defined on R when subjected to
fundamental operations of shifts and dyadic dilation, yielding
an orthogonal basis of L2(R).
The orthonormal basis of compactly supported wavelets of
L2(R) is formed by the dilation and translation of a single
function c (x).
4. Wavelet recognition and matching
where j, k 2 Z. Vanishing moments means that the basis functions are chosen to be orthogonal to the low degree polynomials. It is said that a function w(x) has a vanishing kth moment
at point t0 if the following equality holds with the integral
converging absolutely:
Z
ðt t0 Þk ’ðtÞdt ¼ 0
Wavelet analysis is a relatively recent development of applied
mathematics in 1980s. It has since been applied widely with
encouraging results in signal processing, image processing and
pattern recognition [16]). As the waves in stock charts are 1D
patterns, no transformation from higher dimension to 1D is
needed. In general, wavelet analysis involves the use of a
Fig. 2. Time subset generation.
c j;k ðxÞ ¼ 2i=2 cð2 j x kÞ
Fig. 3. Algorithm PXtract.
1200
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
The function c(x) has a companion, the scaling function f(x),
and these functions satisfy the following relations:
fðxÞ ¼
L1
pffiffiffiX
2 hk fð2x kÞ
k¼0
’ðxÞ ¼
L1
pffiffiffiX
2 gk fð2x kÞ
ing to the wavelet orthonormal decomposition as shown in
Eq. (1), Vj is first decomposed orthogonally into a high-frequency sub-space Vj+1 and Wj+1. The low-frequency sub-space
Vj+1 is further decomposed into Vj+2 and Wj+2 and the processes
can be continued. The above wavelet orthonormal decomposition can be represented by
V j ¼ W jþ1 V jþ1 ¼ W jþ1 W jþ2 V jþ2
k¼0
where hk and gk are the low- and high-pass filter coefficients,
respectively, L is related to the number of vanishing moments k
and L is always even. For example, L = 2k in the Daubechies
wavelets.
gk ¼ ð1Þk hLk1 ;
þ1
Z
k ¼ 0; . . . ; L 1
¼ W jþ1 W jþ2 W jþ3 V jþ3 ¼ . . .
According to Tang et al. [16], projective operators Aj and Dj
are defined as:
A j : L2 ðRÞ V j projective operator from L2 ðRÞ to V j
D j : L2 ðRÞ W j projective operator from L2 ðRÞ to W j
Since f ðxÞ 2 V j L2 ðRÞ :
X
c j;k f j;k ðxÞ ¼ A jþ1 f ðxÞ þ D jþ1 f ðxÞ
f ðxÞ ¼ A j f ðxÞ ¼
fðxÞdx ¼ 1
1
The filter coefficients are assumed to satisfy the orthogonality relations:
X
hn hnþ2 j ¼ dð jÞ
X
¼
k 2 ZZ
c jþ1;m f jþ1;m ðxÞ þ
m 2 ZZ
X
d jþ1;m c jþ1;m ðxÞ
m 2 ZZ
Also, Tang et al. [16] has proved the following equations:
n
X
c jþ1;m ¼
hn gnþ2 j ¼ 0
X
hk c j;kþ2m
(2)
gk c j;kþ2m
(3)
n
for all j, where d(0) = 1 and d(j) = 0 for j 6¼0.
d jþ1;m ¼
X
4.1. Multi-resolution analysis
Multi-resolution analysis (MRA) was formulated based on the
study of orthonormal, compactly supported wavelet bases [17].
The wavelet basis induces a MRA on L2(R), the decomposition of
the Hilbert space L2(R), into a chain of closed sub-space:
V4 V3 V2 V1 V0 such that
\ j 2 Z V j ¼ f0g and [ j 2 Z V j ¼ L2 ðRÞ
f ðxÞ 2 V j , f ð2xÞ 2 V jþ1
f ðxÞ 2 V0 , f ðx kÞ 2 V0
9 c 2 V0 ; fcðx kÞgk 2 Z is an orthogonal basis of V0
In pattern recognition, an 1D pattern, f(x), can always be
viewed as a signal of finite energy; such that,
þ1
Z
j f ðxÞj2 < þ 1
1
It is mathematically equivalent to f(x) 2 L2(R). It means that
MRA can be applied to the function f(x) and can decompose it
to L2(R) space. In MRA, closed sub-space Vj1 can be
decomposed orthogonally as:
V j ¼ V jþ1 W jþ1
(1)
Vj contains the low-frequency signal component of Vj1 and Wj
contains the high-frequency signal component of Vj1. Accord-
According to the wavelet orthonormal decomposition as shown
in Eq. (1), the original signal V0 can be decomposed orthogonally into a high-frequency sub-space W0 and a low-frequency sub-space V0 by using the wavelet transform Eqs. (2)
and (3). In the chart pattern recognition process, V0 should be
the original wave pattern, while V1 and W1 should be the
wavelet-transformed sub-patterns.
If we want to analyze the current data to determine whether it
is a predefined chart pattern, a template of the chart pattern is
needed. According to the noisy input data, direct comparing the
data with the template will lead to an incorrect result.
Therefore, wavelet decomposition should be applied to both
the input data and the template. Example of matching the input
data to a ‘‘head-and-shoulder, top’’ pattern is illustrated in
Fig. 4.
We can match sub-patterns using either a range of coarse-tofine scales, or by matching the input data with features in the
pattern template. The matching process will only be terminated
if the target is accepted or rejected. If the result is
undetermined, it continues to at the next, finer scale. The
coarse scale coefficients obtained from the low pass filter
represent the global features of the signal.
For a high-resolution scale, the intraclass variance will be
larger than for a low resolution scale. A threshold scale
should be defined to determine the acceptance level. For
example, scale n is defined as the lowest resolution. The
resolution threshold is t and t > n. At each resolution t, its
root-means-square should be greater than another threshold
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
Fig. 4. Wavelet decomposition in both input data and chart pattern template.
value l, which called the level threshold. It is difficult to
derive optimal thresholds; therefore, we need to determine
this through empirical testing. Fig. 5 illustrates the details of
the process.
4.2. Radial basis function neural network (RBFNN)
Neural networks are widely used to provide non-linear
forecasts [18–20] and have been found to be good in pattern
recognition and classification problems. Radial basis function
neural network (RBFNN), its universal approximation capabilities have been proven by Park and Sandberg [21,22] to be
suitable for solving our pattern/signal matching problem [23].
We have created different RBFNNs for recognizing different
patterns at different resolution levels. The input of the network
is the wavelet-transformed values in a particular resolution.
As shown in Fig. 6, a typical network consists of three layers.
The first layer is the input layer having two portions: (1) past
1201
network outputs that feedback to the network; (2) major corelative variables that are concerned with the prediction
problem. Past network outputs enter into the network by means
of time-delay unit as the first inputs. These outputs are also
affected by a decay factor g ¼ aelk , where l is the decay
constant, a is the normalization constant, and k is the forecast
horizon. In general, the time series prediction of the proposed
network is to predict the outcome of the sequence x1i+k at the
time t + k that is based on the past observation sequence of size
n, i.e. x1t, x1t1, x1t2, x1t3, . . ., x1tn+1 and on the major
variables that influence the outcome of the time series at time t.
The numbers of input nodes in the first and second portions are
set to n and m, respectively. The number of hidden nodes is set
to p. The predictive steps are set to k, so the number of output
nodes is k. At time t, the input will be [x1t, x1t1, x1t2, x1t3, . . .,
x1tn+1] and [x21, x22, . . ., x2m], respectively. The output is given
by xt+k, denoted by pkt for simplicity, wijt denotes the connection
weight between the ith node and the jth node at time t.
To simplify the network, the choice of the centers of the
Gaussian functions is determined by the K-means algorithm
[24]. The variances of the Gaussians are chosen to be equal to
the mean distance of every Gaussian center from its
neighboring Gaussian centers. A constructive learning
approach is used to select the number of hidden units in the
RBFNN. The hidden nodes can be created one at a time. During
the iteration we add one hidden node and check the new
network for errors. This procedure is repeated until the error
goal is met, or until the preset maximum number of hidden
nodes is reached. Normally, the preset maximum number of
hidden nodes should be less than the total number of input
patterns. The existence of fewer hidden nodes shows that the
network generalizes well though it may not be accurate enough.
5. Training set collections
Stock chart pattern identification is highly subjective and
humans are far better than machines at recognizing stock
patterns, which are meaningful to investors. Moreover,
Fig. 5. The multi-resolution matching.
1202
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
Fig. 6. Schematic diagram of a typical RBFNN.
extracting chart patterns in the stock time series data is a time
consuming and expensive operation. We have examined five
typical stocks for the period 1 January 1995 to 31 December
2001 (see Table 1). A summary of the total numbers of real
training data for fourteen different chart patterns is shown in
Table 2. The training set of the chart patterns is collected based
on the judgment of a human critic following the rules suggested
by Thomas [14] from the real and deformed data described in
the following. The training set contains totally 308 records. A
quarter of the training set is extracted as the validation set. We
set the wavelet resolution equal to 8. We found that the signal/
pattern for the resolution 1–3, was too smooth and each pattern
was similar to each others at those levels. The network was not
able to recognize different patterns well. Therefore, only four
RBFNNs were created for training different chart patterns at the
resolution levels 4–7. The performance of the networks at
different resolution levels and the classification results are
shown in Section 6.
In our training set, the initial quantity of data is insufficient
for training the system well. If we tried to extract over 200 chart
patterns in the time series data, it would be infeasible, time
consuming and expensive. In order to expand the training set,
we use a simple but powerful mechanism to generate more
training data based on the real data.
To generate more training samples, a radial deformation
method is introduced. Here are the major steps of the radial
deformation process:
(a) P = {p1, p2, p3, . . ., pn} is a set of data points containing a
chart pattern.
(b) Randomly pick i points (i< = n) in set P for deformation.
(c) Randomly generate a set of the radial deformation distance
D = {d1, d2, . . ., di}.
(d) For each point in P, a random step dr is taken in a random
direction. The deformed pattern is constructed by joining
consecutive points with straight lines. Details are depicted
in Fig. 7.
(e) Justify the deformed pattern using human critics.
Psychophysical studies [25] tell us that humans are better
than machines at recognizing objects, which are more
Table 1
The five different stocks and their stock IDs
Stock ID
Stock name
00341
00293
00011
00005
00016
CAFÉ DE CORAL HOLDINGS Ltd.
CATHAY PACIFIC AIRWAYS Ltd.
HANG SENG BANK Ltd.
HSBC HOLDINGS PLC.
SUN HUNG KAI PROPERTIES Ltd.
Fig. 7. Radial deformation. (a) An example of accepted deformed pattern. (b)
An example of NOT accepted deformed pattern.
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
1203
Table 2
Total numbers of training patterns in fourteen typical chart patterns of five different stocks
meaningful to humans. In assessing the generated training
data, the whole training set (including real and generated
data) is accepted and selected based on the opinion of the
human critic. In the training set, 64 chart patterns have been
extracted from five different stocks in Hong Kong stock
market.
By applying the radial deformation technique, the 64 real
training patterns were extended to a total of 308 patterns. All of
the patterns generated by radial deformation must be judged by
humans to identify whether a human would accept the
deformed pattern or find it meaningful. Fig. 8 illustrates
examples of (a) an accepted and (b) a NOT accepted of
deformed pattern. Fig. 9 shows the training set from both the
real and deformed chart patterns.
6. Experimental results
Two set of experiments have been conducted to evaluate the
accuracy of the proposed system. The first set evaluates whether
the algorithm PXtract is scaleable, and the second set evaluates
the performance comparison between using simple multiresolution matching and RBFNN matching.
Algorithm PXtract uses different time window sizes to
locate any occurrence of a specific chart pattern. The major
concern is the performance of the algorithm. To assess the
relative performance of the algorithms and to investigate their
scale-up properties, we performed experiments on an IBM PC
workstation with 500 MHz CPU, 128 MB memory. To evaluate
the performance of the algorithm using RBFNN Matching over
Table 3
Optimal wavelet and thresholds setting found by empirical testing
Wavelet family
Resolution threshold
Threshold value
Accuracy (%)
Total number of patterns discovered
Daubechies (DB2)
4
0.3
0.2
0.15
0.1
0.3
0.2
0.15
0.1
0.3
0.2
0.15
0.1
0.3
0.2
0.15
0.1
6.2
7.1
14.2
43.1
7.1
9.4
17.4
53
8.9
13.5
19.9
56.9
10.5
14.5
18.5
48.3
8932
7419
3936
543
7734
6498
2096
420
7146
5942
1873
231
6023
5129
1543
194
5
6
7
Processing time (s)
312
931
3143
8328
1204
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
Fig. 8. Accepted and NOT accepted deformed patterns by radial transformation.
a large range of widow’s sizes, we used typical stock prices of
SUN HUNG KAI and Co. Ltd. (0086) for the period from 2
January 1992 to 31 December 2001.
As shown in Fig. 10, the algorithm scales linearly as the size
of the time window increases.
In the experiments on wavelet chart patterns recognition,
different wavelet families were selected as the filter. The
maximum resolution level was set to be 7. The highest
resolution level 8 is taken as the raw input. The left hand side of
Fig. 11 shows the price of the stock CATHAY PACIFIC (00293)
for the period from 7 June 1999 to 22 July 1999. This period
contains the ‘Double Tops’ pattern. For the identification of the
chart patterns, two matching methods were studied — simple
multi-resolution (MR) matching and RBFNN matching,
respectively. For the simple MR matching, similarity between
the input and the template is measured by mean absolute
percentage error (MAPE). A low MAPE denotes that they are
similar. The performance of simple MR matching was tested in
experiments using different resolution threshold t and different
level threshold l.
Table 3 shows the most accurate combinations. We note
that the accuracy using simple MR matching is not accurate
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
1205
Fig. 9. Training set from both the real and deformed chart patterns.
Fig. 10. Execution time of algorithm PXtract using RBFNN matching under
different time window sizes.
with an average recognition rate of just 30%. Furthermore,
the calculation of MAPE between the input data and the
pattern templates creates a heavy workload. Although it is
possible to reach a recognition rate of more than 50%, if we
set a level threshold at a low value (about 0.1) and a highresolution threshold (above 6), the processing time is
unacceptably long (about 3143 s). This illustrates that simple
MR matching is not a good choice for use in the matching
process.
Table 4 illustrates the overall classification results. It shows
that the classification rate is over 90% and the optimal
recognition resolution level is 6. Four wavelet families were
tested, and their performances were more or less the same
except that the Haar wavelet was found to be not suitable for
use.
1206
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
Fig. 11. Algorithm PXtract using wavelet multi-resolutions analysis on the pattern ‘‘double tops’’ template.
Having found the appropriate setting for the RBFNN, we
applied it to extract all the chart patterns from 10 different stocks
over the last 10 years. Table 5 shows the accuracy of the 14
different chart patterns. The RBFNN is on average 81% accurate.
Multi-resolution RBFNN matching has a high accuracy in
recognizing different chart patterns. However, the accuracy
Table 4
RBFNNs: accuracy in different wavelet families and at different resolution
levels
Wavelet
Families
Resolution
level
Training
set (%)
Validation
set (%)
Haar
DB1
4
5
6
7
66
75
81
87
64
72
78
74
Daubechies
DB2
4
5
6
7
73
85
95
97
64
78
91
85
Coiflet
C1
4
5
6
7
77
86
95
98
72
81
90
84
Symmlet
S8
4
5
6
7
75
84
93
96
68
78
89
82
Table 5
Accuracy of identifying the fourteen different chart patterns using RBFNN
extraction methods
Chart pattern
Accuracy (%)
Broadening bottoms
Broadening formations, right-angled and ascending
Broadening formations, right-angled and descending
Broadening tops
Broadening wedges, ascending
Bump-and-run reversal bottoms
Bump-and-run reversal tops
Cup with handle
Double bottoms
Double tops
Head-and-shoulders, top
Head-and-shoulders, bottoms
Triangles, ascending
Triangles, descending
73
84
81
79
86
83
82
63
92
89
86
87
73
76
of the recognition process is heavily dependent on the
resolution level. Once the resolution level has been identified,
based on empirical testing, the proposed method is highly
accurate.
7. Conclusion and future works
In this paper, we examined the sensitive factors associated
with stock forecast and stressed the importance of chart pattern
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
identification. We have demonstrated how to automate the
process of chart pattern extraction and recognition, which has
not been discussed in previous studies. The PXtract algorithm
provides a dynamic means for extracting all the possible
chart patterns underlying stock price charts. It is shown that
PXtract consistently achieves high accuracy with a desirable
result.
Currently, we have analyzed only 14 of the representatives
of chart patterns and templates from a total of 308 training
samples. According to Thomas [14], there are totally 47
different chart patterns, which can be extracted from the time
series data. In order to complete the system, the future direction
of work will be to build templates for the remaining chart
patterns.
On the other hand, the identification and extraction of the
chart patterns enable us to establish cases for interpretation and
stock forecast. We regard these chart patterns as potentially
suitable for case representation in a CBR system. It may be
worthwhile revisiting the selection of indicators associated with
the relevant chart patterns in order to form feature vector (e.g.
time range, RSI, OBV, prices moving average, wave pattern).
We might then compare these feature vectors, v1 ; v2 2 ½a; b,
such that the similarity of v1 and v2 is computed by the
following expression:
simðv1 ; v2 Þ ¼ 1 jv1 v2 j
ba
for b 6¼ a
For the attribute ‘‘class pattern’’ in the feature vector, the
similarity measure of the attribute between the two cases can be
measured by the following expression:
1 if v1 ¼ v2
simðv1 ; v2 Þ ¼
0 otherwise
The overall similarity between two cases c1 and c2 is
measured by the weighted-sum metric shown below:
P
w simðv1i ; v2i Þ
Pi
simðc1 ; c2 Þ ¼ i¼1...n
i¼1...n wi
The system retrieves the updated stock data and converts them
into case knowledge. It then studies the new current status of the
cases and appends the new result set into the result database for
users’ direct query. The system can be set to refer to three
successive cases within a series of stock cases as one complete
CASE for the purposes of prediction. Further exploration in this
area is ongoing. Typical examples can be obtained from our
previous work [26,27]. On the other hand, the consideration of
the use of hybrid approaches such as support vector machine
with adaptive parameters (e.g. [28]), evolutionary fuzzy neural
networks (e.g. [32]), etc. shall be able to help improve financial
forecast. This will be the subject of future research.
Acknowledgement
The authors would like to acknowledge the partial support of
the Hong Kong Polytechnic University via CRG grant G-T375.
1207
References
[1] G. Zhang, B.E. Patuwo, M.Y. Hu, Forecasting with artificial neural
networks: the state of the art, Int. J. Forecasting 14 (1998) 32–62.
[2] M. Austin, C. Looney, J. Zhuo, Security market timing using neural
networks, New Rev. Appl. Expert Syst. (2000).
[3] P.K.H. Phua, X. Zhu, C.H. Koh, Forecasting stock index increments using
neural networks with trust region methods, in: Proceedings of the International Joint Conference on Neural Networks, vol. 1, 2003, pp. 260–
265.
[4] S. Heravi, D.R. Osborn, C.R. Birchenhall, Linear verus neural network
forecasts for European industrial production series, Int. J. Forecasting 20
(2004) 435–446.
[5] M. Funabashi, A. Maeda, Y. Morooka, K. Mori, Fuzzy and neural hybrid
expert systems. Synergetic AI, IEEE Expert (1997) 32–40.
[6] H.S. Ng, K.P. Lam, S.S. Lam, Incremental genetic fuzzy expert trading
system for derivatives market timing, in: Proceeding of IEEE 2003
International Conference on Computational Intelligence for Financial
Engineering, Hong Kong, 2003.
[7] M. Mohammadian, M. Kingham, An adoptive hierarchical fuzzy logic
system for modeling of financial systems, Intell. Syst. Account. Financ.
Manag. 12 (1) (2004) 61–82.
[8] K. Boris, V. Evgenii, Data Mining in Finance — Advances in Relational
and Hybrid Methods, Kluwer Academic Publishers, 2000.
[9] T. Plummer, Forecasting Financial Markets, Kogan Page Ltd., 1993.
[10] S. Weinstein, Stan Weinstein’s Secrets for Profiting in Bull and Bear
Markets, McGraw Hill, 1988.
[11] Kwong, R. (2004). Intelligent web-based agent system (iWAF) for efinance application, MPhil, The Hong Kong Polytechnic University.
[12] E. Gately, Neural Networks for Financial Forecasting — Top techniques
for Designing and Applying the Latest Trading Systems, Wiley Trader’s
Advantage, 1996.
[13] R. Bensignor, New Thinking in Technical Analysis, Bloomberg Press,
2002.
[14] N.B. Thomas, Encyclopedia of Chart Patterns, John Wiley & Sons, 2000.
[15] J.N.K. Liu, R. Kwong, Chart Patterns Extraction and Recognition in CBR
System for Financial Forecasting., in: Proceeding of the IASTED International Conference ACI2002, Tokyo, Japan, (2002), pp. 227–232.
[16] Y.Y. Tang, L.H. Yang, J.N.K. Liu, H. Ma, Wavelet Theory and Its
Application to Pattern Recognition, World Scientific Publishing, River
Edge, NJ, 2000.
[17] S. Mallat, Multiresolution approximations and wavelet orthonormal bases
of L2(R), Trans. Am. Math. Soc. (1989) 69–87.
[18] R.G. Donaldson, M. Kamstra, Forecast combining with neural networks,
J. Forecasting 15 (1996) 49–61.
[19] M. Adya, F. Collopy, How effective are neural networks at forecasting and
prediction? A review and evaluation, J. Forecasting 17 (1998) 481–
495.
[20] A. Kanas, Non-linear forecasts of stock returns, J. Forecasting 22 (2003)
299–315.
[21] J. Park, I.W. Sandberg, Universal approximation using radial basis function networks, Neural Comput. 3 (1991) 246–257.
[22] J. Park, I.W. Sandberg, Approximation and radial basis function networks,
Neural Comput. 5 (1993) 305–316.
[23] F.J. Chang, J.M. Liang, Y.-C. Chen, Flood forecasting using RBF neural
networks, IEEE Trans. SMC Part C 31 (4) (2001) 530–535.
[24] J.T. Tou, R.C. Gonzalez, Pattern Recognition Principles, Addison Wesley,
Reading, MA, 1974.
[25] W.R. Uttal, T. Baruch, L. Allen, The effect of combinations of image
degradations in a discrimination task, Perception Psychophys. 57 (5)
(1995) 668–681.
[26] J.N.K. Liu, T.T.S. Leung, A web-based CBR agent for financial forecasting a workshop program, in: Proceeding of the 4th International Conference on Case-Based Reasoning, Vancouver, CA, (2001), pp. 243–253.
[27] Y. Li, S.C.K. Shiu, S.K. Pal, J.N.K. Liu, Case-base maintenance using soft
computing techniques, in: Proceedings of the Second International Conference on Machine Learning and Cybernetics, Machine Learning and
1208
J.N.K. Liu, R.W.M. Kwong / Applied Soft Computing 7 (2007) 1197–1208
Cybernetics, Sheraton Hotel, Xi’an, China, 02–05 November 2003,
(2003), pp. 1768–1773.
[28] L.J. Cao, F.E.H. Tay, Support vector machine with adaptive parameters in
financial time series forecasting, IEEE Trans. Neural Networks 14 (6)
(2003) 1506–1518.
[29] P. Blakey, Pattern recognition techniques [in stock price and volumes],
IEEE Microwave Mag. 3 (1) (2000) 28–33.
[32] L.Y. Yu, Y.-Q. Zhang, Evolutionary fuzzy neural networks for
hybrid financial prediction, IEEE Trans. SMC Part C 35 (2) (2005)
244–249.