Amy H. Salman*, Nur Ahmadit, Richard Mengko+, Armein Z. R. Langi, Tati L. R. Mengko'
Department of Electrical Engineering, School of Electrical Engineering and Infonnatics
Bandung Institute of Technology, 11. Ganesha No. 10 Bandung, 40132, Indonesia
Email: {*amy.tnurahmadi}@stei.itb.ac.id.+richard@mengko.com.{annein.z.r.langi tati}@stei.itb.ac.id
Abstract-This paper presents the performance analysis and mainly used for image denoising, was proposed by [9] for
comparison of three denosing methods for heart sound signal denoising heart sound signal. Empirical mode decomposition
based on wavelet transform (WT), total variation (TV), and (EMD), a relatively new non-linear and non-stationary signal
empirical mode decomposition (EMD). Extensive simulations are analysis method [11], offers interesting feature of adaptive and
performed using normal and abnormal heart sound data and the
data-driven decomposition capability. The application of EMD
performance is evaluated in terms of signal-to-noise ratio (SNR),
denoising method to heart sound signal may lead to useful
root mean square error (RMSE), and percent root mean square
difference (PRD). The simulation results show that EMD based
results.
denosing method outperforms two other methods.
In this paper, the perfonnance of these methods for heart
Keywords-Heart Sound, Denoising, Wavelet Transform, Total sound signal denoising is evaluated and compared. The qual
Variation, Empirical Mode Decomposition itative evaluation is performed by visual inspection while
quantitative evaluation is carried out by using three standard
1. INT RODUCT ION
metrics: signal-to-noise ratio (SNR), root mean square error
(RMSE), and percent root mean square difference (PRD).
Cardiovascular disease (CVD) has long been the lead The simulation results show a superior performance of EMD
ing cause of death throughout the world. Accroding to the denoising method over others.
American Heart Association, in 2010 alone, more than 2150
Americans die due to cardiac disease every single day, with This paper is organized as follows. Section II describes
an average of 1 death every 40 seconds [1]. To reduce the the theoritical background and mathematical notation used in
death rate, an early accurate diagnosis of CVD is of paramount wavelet transfonn (WT), total variation (TV), and empirical
importance. mode decomposition (EMD) denoising methods. Simulation
results and perfonnance evaluation both qualitative and quan
An intelligent computerized cardiac auscultation utiliz titative are given in Section m. Finally the conclusion is drawn
ing advanced signal processing based on phonocardiography in Section IV.
(PCG) offers low-cost, non-invasive, and accurate diagnosis
and classification of heart diseases. PCG refers to the trac
ing technique of heart sounds and the recording of cardiac II. DENOISING MET HODS
435
2015 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) November 9-12,2015
way dblO
denotes a complex conjugate and 1j;a b(t) serves as
(mother) wavelet function
where *
,
the basis functions which are obtained by scaling (dilation)
and shifting (translation) a wavelet function 1j;(t) (also called
as mother wavelet) with scaJe a and time b, respectively, as of signal x(n) with threshold T is expressed as follows
shown below
(3)
_ _ {
x (n) - 0x(n),, Ix(n)1 T
Ix(n)1 < T (5)
CWT performs continuous shifting over a continuously scal MATLAB has built-in function wden for wavelet denoising
able function of a signal and caJculates the correlation between as described in [6]. Several input parameters and their setting
them. This continuous process will result in highly redundant for this function are explained in Table I where rigsure
wavelet coefficients and large computation time. To remove represents the selection using the principle of Steins Unbiased
this redundancy and enable faster computation, DWT is used Risk Estimate (SURE), h means hard thresholding, and mIn
instead by modifying the wavelet representation as follows denotes threshold rescaling using a level-dependent estimation
of the level noise.
(4)
B. Tot al Variation Denoising
where ao (fixed, > 1) and bo (> 0) represent the fundamental Total Variation (TV), invented by Rudin, Osher, and Fatemi
scaJe factor and time shift, respectively, and m and n are [7], is a powerful denoising method which has great benefit
integers multiple of those two parameters. of preserving and enhancing important features such as edge
Wavelet transform has been widely used for denoising in images. Despite being commonly used for image denoising,
the corrupted heart sounds signaJ. Wavelet denoising method it can also be used for I-D signal denoising as proposed by
comprises three steps: decomposition, thresholding, and re [8]. Varghees and Ramachandran [9] employed TV method for
construction. The noisy signaJ is firstly decomposed using removing noise embedded in heart sound signals.
appropriate (mother) wavelet function 1j;(t) and selected de TV denoising approximates the signaJ y( n) from the noisy
composition level N. The obtained wavelet coefficients are signal x(n) by solving the optimization problem:
thresholded so that the irrelevant components (noise) are
removed and the relevant components (infonnation) are kept. fj argmax Ilx - YII + AII(Y)lll' (6)
y
=
: !f . 1
posed by Huang in 1998 [11], has gained popularity as data
anaJysis method especiaJly for non-stationary and non-linear
signaJs such as biomedicaJ (including heart sounds) signaJs.
EMD, in contrast to other methods such wavelets and fourier
which require predefined basis function, is fully data-driven
o 5 10 15 20 method that does not require any a priori known basis. EMD
adaptively decomposes a signaJ into a series of simple oscilla
Fig. l. Daubechies db to wavelet function tory AM-FM components caJled as intrinsic mode functions
436
2015 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) November 9-12,2015
(IMFs) through iterative procedure (known as sifting). An oscillation modes (low frequency). Figure 2 shows signal x (n)
IMF is defined as a function that satisfies two conditions: the decomposition into 10 IMFs and final residue signal.
number of extrema (maxima and minima) and zero crossing
must be equal or differ by at most 1; and the average value One of main applications which utilizes the characteristics
of the envelopes derived from local maxima and minima is of EMD technique is signal denoising. EMD, whose decom
(approximately) zero. position is based on elementary substractions, enables perfect
reconstruction of a signal. The EMD denoising method starts
Even though EMD still lacks a solid mathematical founda by identifying which IMFs carry dominantly noise and which
tion which could be used for theoritical analysis and perfor IMFs contain primarily useful information. This is done by
mance evaluation, it is oftenly proven to provide interesting comparing the actual energy density with the estimated energy
and useful results. The sifting procedure of EMD for decom density (to form noise-only model [12]) of IMFs. The actual
posing the signal x (n) into IMFs is systematically described energy density of IMFs is calculated as follows
as follows:
1 N .
1) Specify all the local extrema (maxima and minima) Ei= L h' (n), i=1,2,3,L (8)
of x (n) Nn
l =
-6
I \ - e - Real Energy
I
-8
,, I
\
\
\
----+-- Estimated Energy
'es
-9
-10
i
!!! -11
N
-12
-13
-14
-15
-16
1 10
IMF
Fig. 2. Signal decomposition using EMD Fig. 3. Semilog diagram of real vs estimated energy density of [MFs
437
2015 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) November 9-12,2015
3ot=H:5t', =====,,::
: ===
:: ==1ft *
'= : ====J :.w. ,vlf--
: 1
t:"': _FF--3
== J!"====3 ", ==='Wo:
II--=J1
0.5
Noisy Signal
1.5
Noisy Signal
1 1.5 o 0.5
Noisy Signal
1 1.5
-'6b\ , 'v-
, 1
0--0.5=-- --- ---C' ---'.5
-
. 1 Wavelet Denoising w Wavelet Denoising w Wavelet Denoising
1
oH 1 30v.- -- 11.5 f-'
'0
"E -1
J
0 0.5 1.5 0.5 1 0 0.5 1 1.5
TV Denoising TV Denoising TV Denoising
jH ' 1 -1 -'
o 0.5
EMD Denoising
1 1.5 0'----
-- --0.-5
c: ----
EMD Denoising
----,.5 0 ---0.5=-- ---' ----'.5
EMD Denoising
-168 ,*:==:J
o 3f=:::"===I
0.5 1.5
-16''1=FE=
0 0.5
"
=If--
l 1.5 -16
oE
=
0.5
E
W3f---
1
=:J1--- 1.5
Time Time Time
Ca) Normal SI S2 with SNR 0 dB (b) S3 Gallop with SNR 5 dB (c) S4 Gallop with SNR LO dB
scheme which follows the hard thresholding is expressed as The qualitative performance evaluation of three denoising
follows methods were performed by visual inspection and comparison.
Figure 4 presents the input clean signal, noisy signal, and
Ihi(rj)1 > Ti (11) denoised (reconstructed) signal using wavelet, TV, and EMD
Ihi(rj)1 :::; Ti denoising methods. Only three simulation cases (different input
where hi([zj zj+lD represents the samples from time instant signal and SNR value) are shown. Based on Figure 4, it is
shown that EMD denosing method performs better among
zj to Zj+l of tth IMF. The threshold value used in this scheme others with the fact that its denoised signal is the most similar
is expressed below
(resembling) the original signal. If we look closely and zoom in
Ti = C ylV;2 InN, i = 1, 2, 3, L (12) this figure, we will know that the amplitude of denoised signal
by TV method are slightly reduced. Although denoised signal
where C is 0.1 found by empirical simulations and V; is esti by wavelet method keeps the amplitude of its main components
mated energy density (variance) of ith IMF. This thresholding almost the same as original one, the amplitude outside the main
scheme which is inspired and adapted from wavelet [14] will components interval is changed.
set to zero all the samples from time instant [zj zj+ll if the
single extrema amplitude below the theshold value meaning The quantitative performance are evaluated based on three
that there is no useful information (only noise) in the specified metrics namely signal-to-noise ratio (SNR), root mean square
time instant. Otherwise, all the samples will be retained. error (RMSE), and percent root mean square difference (PRD),
which are calculated as follows:
The final signal reconstruction can be obtained by summing
up all the included IMFs (whose actual energy density ex SNR = 10 log 10 2:=:=1 [y(n)]2 (14)
ceeding its estimated estimated energy as described previously) 2:=:=1 [y(n) - y (n)J2
using the following formula
q
RMSE =
2:=:=1 [y(n) - y (n)J2 (15)
y= L h(n) (13) N
i=p
where p and q indicates the lowest and highest index of
included IMF. PRD = (16)
III. RESULTS AND AN ALY SIS where y(n) denotes the clean original signal, y (n) refers to the
denoised (reconstructed) signal, and N represents the length
To evaluate the qualitative and quantitative performance
of the signal.
of these three denoising methods, we performed a number
of simulations using a simulated heart sound data obtained SNR is defined as the ratio of the power of a signal
from University of Michigan's Heart Sound & Murmur Library (useful information) and the power of noise (irrelevant signal).
[15]. In this simulations, we used three types of heart sound RMSE is used to measure the accuracy of denoising method
signals encoded in 44100 Hz sample rate and 16 bits/sample.
The detailed information including the environmental setting
TABLE II. HEART SOUND SIGNALS USED FOR SIMULATION
when the heart sounds were recorded and their corresponding
diagnosis results is shown in Table II. White Gaussian noise
Name Recording Setting Diagnosis
with SNR 0,5,10,15, and 20 were added to the clean input
signal y(n) to form noisy signal x ( n). The simulations were
Normal Sl S2 Apex, Supine, Bell Normal
S3 Gallop Apex, Left Ducubitus, Bell Both Normal and Cardiomyopathy
carried out using MATLAB which runs on dual-core CPU S4 Gallop Apex, Left Ducubitus, Bell Left Ventricular Hypertrophy
@2.2 GHz environment.
438
2015 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) November 9-12,2015
26 0.045 45
..
24 40
0.040
22 /" 35
"
.
0.035
.
20
to 0.030 30
_
Z
18
if) 16
/*.
../.-
..
0.025 25
0
'5
r:: 0.020
:.---- . -- . '"
"- 20
.
% 14
0
12 0.015 15 ...-- .-
10 /' 0.010 : 10
!
O.OOS
:
10 15 20 10 15 20 10 15 20
Input SNR Level (dB) Input SNR Level (dB) Input SNR Level (dB)
Fig. 5. Perfomlance comparison of denoising methods over (a) Nomlal Sl S2 (b) S3 Gallop and (c) S4 Gallop heart sounds signal
in preserving the quality of information in the denoised signal SNR value of three denoising methods for Normal SI S2 heart
by calculating the sample standard deviation of the differences sound signal with respect to different input SNR levels. It is
between denoised signal and original signal. PRO is frequently shown that EMO method (green line) is always above WT
used as a method of quantifying the distortion or the difference (black line) and TV (red line) methods indicating that EMO
between the original and the reconstructed signal. The PRO methods consistently produces largest output SNR value. The
indicates reconstruction fidelity by point wise comparison with comparative RMSE value vs input SNR level for S3 gallop
the original data. A denoising method is said to perfom better heart sound signal is shown in Figure 5b. It is shown that EMO
if at a particular input SNR, the value of output SNR is larger method (green line) is always below WT (black line) and TV
while the value of RMSE and PRO are smaller. (red line) methods indicating that EMO methods consistently
produces smallest RMSE value. The comparative PRO value
Comparative simulation results of three denoising methods over different input SNR levels for S4 gallop heart sound signal
(WT, TV, and EMO) on the basis of SNR, RMSE, and PRO is plotted in Figure 5c, which clearly shows EMO method's
are shown in Table III. It is shown that for all three tapes superiority over WT and TV methods.
of heart sound with the addition of white Gaussian noise (0
dB, 5 dB, 10 dB, 15 dB, and 20 dB), EMO denoising method IV. CONCLUSION
consistently yields larger SNR value, and smaller RMSE and
PRO values. For example Normal S1 S2 heart sound signal, Analysis and comparison of heart sound signal denoising
EMO method shows SNR value 10.0415 dB, RMSE 0.037586 methods based on wavelet transform (WT), total variation
and PRO 31.4716 % where as WT (TV) method shows 9.825 (TV), and empirical mode decomposition (EMO) is presented
(7.9689) SNR, 0.038535 (0.047716) RMSE, and 32.2665 % in this paper. White Gaussian noises with input SNR level 0
(39.9537 %) PRO for 0 dB input SNR level. The performance dB, 5 dB, 10 dB, 15 dB, and 20 dB are artificially added
of EMO method for S3 gallop and S4 gallop over input SNR to clean original normal and abnormal heart sound signals
level range (0 dB - 20 dB) is superior as well compared to obtained from the University of Michigan Health System.
WT and TV methods. Based on extensive simulations, EMO based denoising method
consistently yields best performance in terms of three standard
Graphical visualization of comparative simulation results is metrics: signal-to-noise ratio (SNR), root mean square error
depicted in Figure 5. Figure 5a shows the comparative output (RMSE), and percent root mean square difference (PRO).
439
2015 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) November 9-12,2015
REFE RENCES
440