Electrocardiogram beat classification using deep convolutional neural network techniques

Zafer Cömert; Yaman Akbulut; Muhammed H Akpinar; Ömer F Alçin; Ümit Budak; Muzaffer Aslan; Abdulkadir Şengür

doi:10.1088/978-0-7503-3279-8ch12

Modelling and Analysis of Active Biopotential Signals in Healthcare, Volume 1

Chapter 12

Electrocardiogram beat classification using deep convolutional neural network techniques

Zafer Cömert, Yaman Akbulut, Muhammed H Akpinar, Ömer F Alçin, Ümit Budak, Muzaffer Aslan and Abdulkadir Şengür
Published August 2020 • Copyright © IOP Publishing Ltd 2020
Pages 12-1 to 12-26

Download PDF chapter

Download ePub chapter

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

Download complete PDF book, the ePub book or the Kindle book

Chapter navigation

Modelling and Analysis of Active Biopotential Signals in Healthcare, Volume
1 book home

Preview

Export citation and abstract

BibTeX RIS

Permissions

Get permission to re-use this book

Abstract

In the literature, it can be seen that various advanced signal processing and machine learning techniques and deep learning algorithms have been employed for electrocardiogram (ECG) beat categorization. These methods were generally based on either the time domain or frequency domain. Time–frequency based techniques have also been proposed for ECG beat classification. In this chapter, a different model is proposed for the ECG beat classification task. In the proposed approach, the ECG beats are initially represented by images. Instead of using a time–frequency approach for converting the ECG beats to ECG images, we opt to use the ECG beats directly to construct the ECG images. In other words, the ECG beat values are directly saved as ECG images.

The electrocardiogram (ECG) is a useful method which enables the monitoring of various cardiac conditions, such as arrhythmia and heart rate variability (HRV). ECG beats help to determine various heart failures such as cardiac disease and ventricular tachyarrhythmia. In the literature, it can be seen that various advanced signal processing and machine learning techniques and deep learning algorithms have been employed for ECG beat categorization. These methods were generally based on either the time domain or frequency domain. Time–frequency (T–F) based techniques have also been proposed for ECG beat classification. In this chapter, a different model is proposed for the ECG beat classification task. In the proposed approach, the ECG beats are initially represented by images. Instead of using a time–frequency approach for converting the ECG beats to ECG images, we opt to use the ECG beats directly to construct the ECG images. In other words, the ECG beat values are directly saved as ECG images. Three deep convolutional neural network (CNN) approaches are considered in ECG beat classification. These approaches ensure end-to-end learning schema, fine-tuning of pre-trained CNN models, extraction of deep features and their classification using a traditional classifier, such as the support vector machine (SVM) or deep machine learning approaches. The well-known MIT-BIH arrhythmia database is considered in the evaluation of the proposed deep learning approaches. The database is separated into two sets, the training and test dataset in proportions of 75% and 25%, respectively. The experimental results are evaluated using the classification accuracy score. The results show that the proposed methods have potential for use in ECG beat classification.

12.1. Introduction

Electrocardiography (ECG) is utilized as a standard means for monitoring and diagnosing cardiovascular diseases and it has an important role in routine clinical practice. Using this non-invasive tool, the electrical cardiac activities covering the depolarization and repolarization patterns of the heart can be observed on an ECG trace [1]. An ECG trace reflects the time-varying signal changes that depend on the ionic current flow that causes the cardiac fibers to contract and subsequently relax. The changes in potential between the electrodes placed on the surface of the skin can be observed via ECG [2]. A single and routine period of an ECG presents the sequential atrial and ventricular depolarization as well as repolarization. These continuous cycles consist of significant peaks that reflect the morphological structures underlying specific peaks, denoted by P, Q, R, S and T [3]. This means that a heartbeat creates a specific pattern on the ECG trace. This specific pattern occurs when the signal moves away from the baseline heart rate and then returns to the baseline heart rate level. A single sinus rhythm of the ECG traditionally includes the above-mentioned peaks, as shown in figure 12.1.

**Figure 12.1.** The morphological structure of one PQRST-complex of a single ECG cycle.
Download figure:
Standard image High-resolution image

The depolarization of the atria, which causes depolarization waves to spread from the sinoatrial (SA) node to the atria, increases the voltage measured from the heart surface, causing it to diverge from the baseline heart rate, and this wave is referred to as a P-wave [4]. The time between the events of atrial depolarization and ventricular depolarization is the PQ-interval. The QRS-complex is one of the most vital patterns in computerized heart rate analysis and it is frequently used for basic heart rate analysis. The QRS-complex produced the largest amplitude in an ECG cycle [5]. The ventricles become depolarized before their contractions and this depolarization forms the QRS-complex. Atrial repolarization appears just before the QRS-complex. However, the following wave pattern, which is referred to as the QRS-complex, has a wider amplitude. Therefore, atrial repolarization cannot be observed on an ECG trace [6]. The time between the beginning of the ventricular depolarization and end of the ventricular repolarization overlaps the QT-interval on the ECG trace. It has been shown that the QT-interval increases due to the linear increase of the RR-interval in experimental studies. Also, a prolonged QT-interval has been related to late ventricular repolarization and this cardiac ailment may lead to sudden death [7]. The time interval between the S-wave and T-wave is referred to as the ST-interval. High or depressive patterns far from the baseline are consistently considered as symptoms that indicate heart diseases. The relief time elapsed until the next ECG cycle, in other words, the ventricular repolarization, is shown as a T-wave on the ECG signal [8]. ECG signals can have undesired artefacts for different reasons, such as patient or electrode movements, displacements of the electrodes and power-line noise [9]. To achieve useful clinical information, noisy ECG signals must be converted into reliable signals using preprocessing procedures [10, 11]. To this aim, many algorithms have been proposed for R-peak detection [12–14], QRS detection [15–18], QT-interval detection [3, 19], RR-interval detection [20–23] and ST-interval detection [8, 24, 25] to provide useful clinical information.

Cardiac arrhythmia is a medical term and corresponds to a condition which reflects irregularities in heart rate rhythm, where it may be quite slow or fast [1]. Disruptions of the electrical impulses associated with coordination of the heartbeat cause arrhythmia. In other words, arrhythmia is defined as discomfort that includes rhythm irregularity or conduction problems in electrical impulses through the heart. It can occur in the upper or lower chamber of the heart. Arrhythmias of the ventricles may lead to life-threatening events. Various types of arrhythmias, which demonstrate quite different characteristics, have been defined [26]. Each of these types of arrhythmias is associated with a pattern whose morphological characteristics have been identified. Therefore, it is possible to define and classify arrhythmia types using a computer-aided approach. The arrhythmias are categorized into two basic classes. The irregularities associated with the electrical conductivity of the heart is the first class. This category of arrhythmia corresponds to arrhythmias caused by a single irregular vessel shot. The other category includes arrhythmias consisting of a series of irregular heartbeats. These irregularities cause changes in morphology or wave shape and frequency. A large number of these changes can be observed though ECG examination. A normal heart rhythm where each specific peak can be observed properly is defined as the normal sinus rhythm (NSR). During NSR, the triggering pulses from the sinoatria spread in a coordinated manner across the four chambers of the heart. In addition to the NSR, there are also different common arrhythmia types. The source of the NSR beats is the SA node. The regular heart rate can vary due to autonomous inputs occurring in the SA node. Premature beats are a widely observed cardiac disease and are harmless. Patients with this type of arrhythmia feel a beating or missing heartbeat in the chest. Premature beats also include premature atrial contractions (PACs) and premature-ventricular contractions (PVCs) [27]. Supraventricular arrhythmias occur in the atria, with fast heart rates, and some of them are atrial fibrillation (AF), atrial flutter, paroxysmal supraventricular tachycardia (PSVT) and Wolff–Parkinson–White syndrome [28]. Ventricular arrhythmias (VAs) include ventricular flutter, ventricular tachycardia (VT) and ventricular fibrillation (VF). These types of arrhythmias are life-threatening and therefore require quick intervention [29]. Sinus node dysfunction is related to problems associated with the SA node. A slow heart rhythm is observed in this arrhythmia and patients use a pacemaker to tackle this type of arrhythmia [30]. Heart block problems may appear in the AV node or HIS Purkinje system because of incomplete electrical activity [31]. Slow and irregular heartbeats are then observed. To tackle this condition, using a pacemaker is recommended. Arrhythmia diagnosis based on ECG beats has great importance for cardiologists [32]. A broad range of cardiac conditions covering arrhythmia and HRV can be observed on ECG traces [33]. However, even for specialists in the field, it is very difficult to examine, define and categorize a large number of ECG beats. In addition, the possibility of missing vital information is high due to the large scale of the data. As a practical solution, computer-aided systems have been adopted and have become quite important in daily clinical practice [1]. In this view, automatic ECG beat classification has been considered as an important issue in addressing cardiac ailments.

In computer-aided approaches, the two basic procedures of feature extraction and classification are common. The performance of such systems is sensitive and depends on specifically designed features as well as the fine-tuning of the classifiers. Time-domain features have been used for ECG analysis and have been adopted as a useful approach. In addition, the frequency domain holds great clues regarding HRV. Furthermore, the time–frequency domain have been used to describe large-scale ECG data [34]. The morphological and higher-order statistic [35] diagnostic indices have been measured using advanced digital signal processing techniques in the feature extraction stage, such as principal component analysis (PCA) [3, 36, 37], wavelet transforms (WT) [38, 39], Hilbert transforms [40], cross-correlation approaches [41] and the Kalman filter [42]. Support vector machines (SVMs) [43] have ensured high general performance. The extreme learning machine (ELM) has contributed to shortening the training time [44]. Similarly, the k-nearest neighbors (k-NN) method [38], linear discriminant analysis (LDA) [45], artificial neural network (ANNs) [46] and probabilistic neural network (PNN) [47] have also been used for classification purposes. Optimization techniques such as particle swarm optimization (PSO) [48], artificial bee colony [49] and genetic algorithms [50] have been embedded into the proposed models for automatic ECG beat classification tasks [33]. The permutation entropy (PE) and the conditional entropy of ordinal patterns (CEOP) have been applied to ECG signals to realize a binary classification [51]. Deterministic learning schema have been applied to the ECG beat classification task. To this aim, the proposed model exploited the dynamics structure of the ECG beats as a unique feature. In this approach, the classification was carried out without using any feature extraction procedure for the test beats. Also, the model presented good generalization capacity [33]. The enormous differences between individual patients were addressed in another study focused on ECG beat classification. An optimization mechanism was proposed to enhance the achievements of the model. Recurrent neural networks (RNNs) have been employed to learn features automatically and promising results have been reported [52]. To prevent sudden cardiac death, a computer-aided diagnosis model based on spectral entropy and a deep convolutional neural network (DCNN) was introduced. The time–frequency transform and entropy measurement were dealt with in this study. Furthermore, two-directional two-dimensional PCA was utilized to reduce the number of features used. The study points out that the spectral entropy can provide good separation between different ECG beats [34]. A novel model was built on the SVM with rejection schema. Traditional procedures, such as preprocessing, feature extraction and classification, applied in machine learning approaches were followed in the proposed model. As a result, a cost-sensitive classifier was provided [53]. Vector quantization (VQ) has been used in ECG beat classification, since this method increases the classification accuracy while reducing the dimension of the feature set. To this aim, a novel dictionary learning algorithm was presented. This study resulted in an accelerated algorithm and a model with increased accuracy [54]. The ECG beat classification task has also been handled as a binary classification task between normal and abnormal. A method relying on the radial basis function (RBF) system for canceling ectopic beats was proposed. The model yielded good results in terms of the low distortion of the signal recordings [55]. A neuro-fuzzy network model fed with autoregressive model coefficients, higher-order cumulants and the WT has been used for ECG beat classification. The proposed model provided a significant performance enhancement in ECG beat classification [56]. In addition to the normal beat type, four different beat types were addressed in a novel model exploring RNNs trained with the Levenberg–Marquart algorithm. The combined eigenvector method and RNN was reported as a useful tool for ECG beat classification [57]. A combination of PSO and RBF neural networks was used to separate six different ECG beat types. The morphological features were taken into account in the model. As a result, an efficient diagnostic model was provided [58]. Six discriminative ECG beat classification problems were addressed using a PNN and WT. The model was fed with only eleven features and an effective and efficient model was achieved [59]. A novel model, called a modified mixture of experts (MME) network, was used in ECG beat classification with diverse features and a supervised learning approach with the expectation–maximization (EM) algorithm was used in the model. The model used handcrafted features extracted using wavelet coefficients and Lyapunov exponents. The proposed MME approach was found to be a useful tool in categorizing ECG signals for early heart abnormality detection [60]. An automated medical diagnostic tool based on a cross-correlation and least square support vector machine (LS-SVM) was developed for ECG beat classification which covers normal beats, PVC beats and other beats. The performance of the proposed model in this three-class classification scheme was reported to have 95.51% accuracy [61]. In another study, ECG beat classification was performed using a novel independent components (ICs) arrangement strategy and independent component analysis (ICA). The PNN and SVM classifiers were employed to evaluate the proposed method. Eight different ECG beat types were considered in the experiment and a 98.7% classification accuracy was achieved [62].

The ECG beat classification task has mainly been addressed in two different ways: the global-classifier and patient-adapting approaches. In the first approach, the experience of specialists is not considered and the classification is performed automatically [52]. The desired performance has not been achieved in this approach since a large-scale inter-individual variability has been observed in ECG waveforms. The second approach focuses on the ECG beat classification task by considering the specific characteristics of each patient's ECG recording to enhance the accuracy of automated systems [51]. However, this enhancement has remained limited since there are different temporal and physical conditions for patients as well as variations in the morphological characteristics of ECG waveforms. ECG signals generally display their characteristic features periodically. Sometimes differences arise in periodic ECG beats and sometimes in the morphological structure of the ECG. These enormous time-varying patterns are marked as temporal or dynamic [63]. For this reason, the recognition of this temporal pattern is very difficult. Also, the presence of notable similarities between different temporal ECG beats is a challenge that remains to be solved [64].

As mentioned above, many different combinations of advanced signal processing and machine learning techniques have already been proposed for the ECG beat classification task [65–67]. Recently, a novel approach called the deep convolution neural network (CNN), a sub-branch of machine learning, has become attractive due to its generalizability on large-scale real-world problems [68]. The great advantage of CNN models is the end-to-end learning scheme [69]. In this way, the tedious and complex feature extraction step is eliminated and the features representing the characteristic and discriminative features of the input data can be obtained using activation maps [70]. Although CNN models require large-scale data and high calculation costs for the training, they have been adopted as an efficient method due to the highly generalizable performance provided.

In the proposed approach, pre-trained CNN models are used and the fine-tuning procedure is performed to transfer domain knowledge. Then, the deep features extracted from the activation maps in the deep CNN architecture are input to one of the traditional machine learning algorithms called SVMs. In the training of the CNN models, raw ECG beat images are considered and five ECG beat types, namely normal (N), left-bundle-branch-block (LBBB), right-bundle-branch-block (RBBB), premature-ventricular-contraction (PVC) and paced beats (PBs) are taken into account in the experiments [71]. As a result, a novel ECG beat classification diagnostic tool with high sensitivity is produced via the proposed model.

The rest of this chapter is organized as follows: the dataset description and methods are provided in section 12.2; the experimental work and results are reported in section 12.3; a comparison is presented in section 12.4; and, lastly, concluding remarks are presented in section 12.5.

12.2. Material and methods

12.2.1. The MIT-BIH database

The MIT-BIH ECG arrhythmia database was the first publicly available database on arrhythmia and was collected by Beth Israel Hospital (BIH) between 1975 and 1979 and has been distributed by MIT since 1980 [72]. The MIT-BIH database contains 48 half-hour ECG records obtained from 47 patients, including 25 men aged 32–89 years and 22 women aged 23–89 years. The database consists of two parts. There are 23 patient records in the first part and 25 patients in the second part. The database contains 116 137 QRS complexes. The ECG signals were sampled at 360 samples per second with an 11 bit resolution over a 10 mV range and were band-pass filtered at 0.1–100 Hz. Each record in the database has been annotated independently by two or more cardiologists.

12.2.2. Producing ECG beat images

In this study, as one-dimensional (1D) ECG record signals are converted into two-dimensional (2D) images, we need to crop each ECG beat region and create ECG beat images as a separate input into the machine learning approaches. To do this, we fix the ECG beat image size in a rectangular shape. The first step is to obtain the signal values of the ECG beats by reading the record files. The format of the record files is comma-separated values (CSVs). Each record file has nearly 650 000 records for the 30 min duration and 360 records per second. To label the ECG beat type, we need to read annotate files that have the 'txt' file format. The second step is to determine what the ECG beat image size should be. While producing the ECG beat image, we need a reference point in the signal. The peak value of each signal was selected as a reference point and the width of the ECG beat image is determined by the sum of two parts of the signals. The first part is the points which are before the peak value and the second part is the points which are after the peak value. While producing the ECG beat image, 90 signal values are used for the part before the peak value and 150 signal values for the part after the peak value to define the signal as lossless.

The flowchart for producing the ECG beat images is shown in figure 12.2. The preprocessing block of the flowchart in figure 12.2 has parameters such as sig1, wden, sym8, peak values, beat types and size of the images. sig1 is the raw signal of a patient record, wden is a function that denoises the raw signal using WT, sym8 is the name of the wavelet function, lev is the level of wavelet function and its value is six in this process, peak value and beat type are obtained through annotate files. The peak value is the highest point in each beat and the beat type is the label.

There are 23 symbols including beat and non-beat types in the MIT-BIH ECG database. The symbols of the beat types and non-beat types are !, ], ∣, [, ∼, +, A, a, E, e, F, f, J, j, L, . or N, / or p, Q, q, R, S, V and x. The beat types used in this study are L, N, p, R and V. They are called the left-bundle-branch-block (LBBB) beats, normal (N) beats, paced beats (PBs), right-bundle-branch-blocks (RBBB) beats and premature-ventricular-contraction (PVC) beats. These five beat types are selected due to the huge number of their occurrence. Sample beat type images that are produced using the preprocessing block are shown in figure 12.3.

**Figure 12.3.** Produced ECG beat images after preprocessing: (a) LBBB beat images, (b) N beat images, (c) PB images, (d) RBBB beat images and (e) PVC beat images.
Download figure:
Standard image High-resolution image

In total, 8001 LBBB (symbol L) beat samples, 8067 N (symbol N) beat samples, 7018 PB (symbol P) samples, 7247 RBBB (symbol R) beat samples and 7125 PVC (symbol V) beat samples were used in the experiments to evaluate the proposed method.

12.2.3. Convolutional neural networks (CNNs)

The deep learning approach consists of multiple processing layers combined with multiple abstraction structures to learn the representations of the data. Various preprocessing procedures, dimension reduction and feature selection are performed to reveal these features [73]. To reduce the cost at this stage, it is necessary to avoid the dependence on features. In this way, when designing classifiers and other prediction systems, it will be easier and less costly to extract useful information (discriminative features) from data using artificial intelligence [74]. In this context, the subject of deep learning is closely related to representative learning. Deep learning algorithms have a wide range of applications and have been applied successfully in many different fields. Deep learning is considered a sub-branch of machine learning and performs feature extraction transformation procedures through non-linear layers. The layers feed each other, in other words, the output of one layer can be the input of another layer. Deep learning algorithms can either be supervised or unsupervised and these algorithms generally have a non-linear learning structure. Thus, the representation of the data is performed using distinctive features obtained at different levels [75]. There is a hierarchical learning representation in deep learning and this representation is based on the abstraction of layers at different levels. The data applied as an input in deep learning are passed through a series of processes and learning is realized. Convolution is performed to reveal the distinctive features of the data. This process results in activation maps representing local distinctive features. The power of activation maps to represent data may vary. In other words, some distinctive features may be retained in different activation maps.With layers such as pooling and dropout, calculation costs are reduced and overfitting is prevented [76].

The convolution layer, pooling layer, normalization and fully connected layer form the basic CNN model [77]. Layers are used consecutively to create numerous models for many different tasks. The convolution layers are operated to reveal the discriminative features in the input data. In the equation below, ${X}_{i}^{l-1}$ indicates the features from the previous layers, ${k}_{{ij}}^{l}$ displays the kernels that are learnable and the bias is represented by ${b}_{j}^{l}$ [78]. The output values produced by the feature map are calculated as follows:

$\begin{eqnarray}&&{X}_{j}^{l}=f\left({\sum }_{i\in {M}_{j}}{X}_{i}^{l-1}\,* \,{k}_{{ij}}^{l}+{b}_{j}^{l}\right),\end{eqnarray} \tag{ 12.1 }$

where the input map selection represented by ${M}_{j}$ and $f(.)$ indicates an activation function. With the pooling layer, the calculation cost is reduced and different approaches, such as average pooling or maximum pooling, can be used for this purpose. In CNN models, another advantage of pooling layers is to prevent overfitting. This process is also supported by layers such as dropout layers. In these layers, the downsampling is performed as defined in

$\begin{eqnarray}&&{{\rm{X}}}_{j}^{l}=\mathrm{down}\left({X}_{j}^{l-1}\right).\end{eqnarray} \tag{ 12.2 }$

Here in the related procedure is symbolized by the function 'down'. It can be stated that the number of local distinctive features is reduced [79]. The activation maps processed in previous layers are combined in fully connected (FC) layers. The deep features obtained as a result of all the operations applied in the deep architecture are combined in the FC layers [80]. Optimization algorithms play an important role in the training of deep networks, and their use is inevitable for proper learning to be achieved. Equation (12.3) below represents an optimization algorithm. For optimization purposes, stochastic gradient descent with momentum (SGDM) and adaptive moment estimation (ADAM) are preferred frequently due to their performance on real-world problems [81]:

$\begin{eqnarray}&&{V}_{t}=\beta {V}_{t-1}+a{\nabla }_{w}L(W,X,y),\end{eqnarray} \tag{ 12.3 }$

where $W$ denotes the weights, and $L$ and α show the loss function and learning rate, respectively. During the training of the CNN, the new weights are computed as follows:

$\begin{eqnarray}&&W=W-a{V}_{t}.\end{eqnarray} \tag{ 12.4 }$

The ADAM optimization algorithm performs weights within each iteration, taking into account the average of the second momentum. It also uses the average first moment in the RMSProp method [82].

12.2.4. Deep transfer learning (DTL)

Transfer learning is a learning approach that examines the use of artificial learning systems with different or similar solutions to problems. The transfer learning approach is based on the learning model of human nature. In other words, human beings find solutions to a different event that they encounter by knowing or not knowing their past experiences. Deep transfer learning is an effective approach to solve image classification problems, in particular when there is a finite number of training images. Achieving network weights that represent learning in the deep learning model requires considerable time and computational costs. In the cases where the data are limited, updating the previously obtained weights is adopted as an efficient solution instead of performing network training from scratch, because the computational node numbers mentioned here are at the level of millions. As part of this solution, a modification is made, in particular in the last layers of the network [83].

As mentioned above, the transfer learning approach allows a pre-trained network to be used for a new task. For this purpose, it is based on updating the weights of the network, which is trained on the general problem, providing the connections within the network [84]. Except for the last few layers of the network, all are defined as a new network. Later, the connections and weights are updated considering the number of classes in the studied problem. Training of the network is performed to enable the network to reveal specific information in the new area. Fine-tuning is required to enable deep network learning. In this context, an initial learning coefficient is determined first. The choice of this parameter is crucial to ensure appropriate and valid learning. The initial learning coefficient should be neither too large nor too small [85]. Another important parameter of the transfer learning approach is the mini-batch size. The number of samples that is used to train the network in each epoch is determined by this parameter. Similarly, the maximum number of epochs is one of the important parameters to be specified for transfer learning. Optimization algorithms are used to provide better generalization performance and adapt deep networks to a new area through transfer learning. These algorithms can use various learning schemes. These schemes allow the learning coefficient to be updated as the epoch progresses. Therefore, the initial learning coefficient can be updated during transfer learning and thus more efficient learning can be realized [86].

12.2.5. Support vector machines

The SVM is the most popular traditional supervised machine learning approach and is used by researchers all over the world [87]. Due to its easy implementation and robustness, it is possible to come across an SVM in almost all machine learning-based classification studies. The basic principle of the SVM is to linearly separate two classes from each other by drawing a margin (a line, a plane or a hyperplane) between them. The goal of the SVM method is to maximize the margin between two classes [88]. When the datasets become non-linear, researchers solved this issue by adding some new kernel structures to the SVM to overcome these non-linear datasets. Some of these SVM kernels are linear, quadratic, cubic, fine Gaussian, medium Gaussian and coarse Gaussian.

12.2.5.1. The linear SVM kernel

$\begin{eqnarray}&&K(x,y)={x}^{T}y+c,\end{eqnarray} \tag{ 12.5 }$

where K denotes the kernel, x is the input vector (the feature vector of one class), y is the input vector (the feature vector of another class), c is a constant term and T denotes the transpose. The terms defined in equation (12.5), i.e. $K,{x},{y},{c},{T}$ , are also the same for equations (12.6)–(12.9).

12.2.5.2. The quadratic SVM kernel

$\begin{eqnarray}&&K(x,y)={(\alpha {x}^{T}y+c)}^{2},\end{eqnarray} \tag{ 12.6 }$

where α denotes the slope term. The quadratic SVM kernel is a type of polynomial SVM that has a polynomial degree of 2.

12.2.5.3. The cubic SVM kernel

$\begin{eqnarray}&&K(x,y)={(\alpha {x}^{T}y+c)}^{3},\end{eqnarray} \tag{ 12.7 }$

cubic SVM is a type of polynomial SVM that has a polynomial degree of 3.

12.2.5.4. The fine Gaussian SVM kernel

$\begin{eqnarray}&&K(x,y)={\rm{\exp }}\left(-\frac{{\parallel x-y\parallel }^{2}}{{2\sigma }^{2}}\right)\end{eqnarray} \tag{ 12.8 }$

$\begin{eqnarray}&&K(x,y)={\rm{\exp }}{(-\gamma \parallel x-y\parallel }^{2}),\end{eqnarray} \tag{ 12.9 }$

where γ depends on σ which is the Gaussian term. The fine Gaussian SVM kernel scale set is the result of the square root of P divided by 4, where P is the number of predictors. Equation (12.9) includes all types of Gaussian SVM kernels, such as fine, medium and coarse.

12.2.5.5. The medium Gaussian SVM kernel

The medium Gaussian SVM kernel scale set is the square root of P, where P is the number of predictors.

12.2.5.6. The coarse Gaussian SVM kernel

The coarse Gaussian SVM kernel scale set is the result of the square root of P multiplied by 4, where P is the number of predictors.

12.2.6. Performance metrics

The most basic tool used to assess the performance of a model is the confusion matrix, also called the error matrix. This matrix includes the true-positive, false-positive, false-negative and true-negative parameters. Using these indices, various performance metrics such as accuracy can be calculated. In this study, only the accuracy metric is taken into account and this reflects the system's ability to accurately estimate the ECG beats. It can be calculated as follows:

$\begin{eqnarray}&&\mathrm{Accuracy}=\frac{\mathrm{NoTCB}}{\mathrm{NoACB}}\end{eqnarray} \tag{ 12.10 }$

$\begin{eqnarray}&&\mathrm{NoTCB}=\mathrm{NoACB}-\mathrm{NoFCB},\end{eqnarray} \tag{ 12.11 }$

where NoTCB is the number of true classified beats, NoACB is the number of all classified beats and NoFCB is the number of false classified beats. All the numbers of classified beats belong to only one class.

12.3. Experimental work and results

A workstation equipped with the NVIDIA Quadro M4000 GPU was used in the experiments. The MATLAB software on the MIT-BIH arrhythmia database was considered in all coding. Five ECG beat types namely normal (N) beats, left-bundle-branch-block (LBBB) beats, right-bundle-branch-block (RBBB) beats, premature-ventricular-contraction (PVC) beats and paced beats (PBs) were used in the classification stage of the work. The MIT-BIH arrhythmia database contains 40 ECG records and all records were used in the experiments. In the experiments, 8067 N beat samples, 8001 LBBB beat samples, 7247 RBBB beat samples, 7125 PVC beat samples and 7018 PB samples were used.

The ECG signals were converted into ECG images by taking into account the annotations prepared by the experts. The signal-to-image conversion was handled using a code that was written by the chapter authors. First, the resolutions of the constructed images were adjusted to 224 × 224 px and then were converted to gray-scale images. Five well-known pre-trained deep CNNs, called VGG16, VGG19, ResNet18, ResNet50 and ResNet101, were considered in the experiments. In addition, five SVM methods, namely the linear SVM, quadratic SVM, cubic SVM, medium Gaussian SVM and coarse Gaussian SVM were used in the classification. In the linear SVM, the linear kernel function was used and the epsilon value of the SVM method was set to 0.01. In the quadratic SVM, the quadratic function was used as the kernel function and the epsilon value was adjusted to 0.2. The cubic kernel function was used in the cubic SVM and the epsilon value was assigned as 0.02. For both the medium and coarse Gaussian SVMs, the Gaussian kernel function was used and the epsilon parameters were adjusted to 0.1 and 0.2. The performance of the proposed method was evaluated by using the accuracy score. A ten-fold cross-validation procedure was also performed in the experiment to yield objective results. At the end of the ten-fold cross-validation procedure, the mean accuracy scores were calculated. Table 12.1 shows the obtained accuracy scores. The rows of table 12.1 show the SVM classifier types, while the columns show the pre-trained CNN models. The last row and column show the average accuracy scores.

Table 12.1. The results for the pre-trained deep CNN models and SVM classifiers in ECG beat classification.

	Accuracy (%)
Method	ResNet18	ResNet50	ResNet101	VGG16	VGG19	Average
Linear SVM	99.5	98.4	98.5	99.3	99.3	99.00
Quadratic SVM	99.3	99.2	99.3	99.6	99.6	99.40
Cubic SVM	99.4	99.2	99.3	99.6	99.6	99.42
Fine Gaussian SVM	99.2	99.0	99.0	99.4	99.3	99.18
Medium Gaussian SVM	99.2	98.9	99.0	99.4	99.3	99.16
Coarse Gaussian SVM	97.3	97.2	96.8	98.6	98.5	97.68
Average	98.98	98.65	98.65	99.32	99.27

From table 12.1, it is seen that only the VGG16 and VGG19 models produced average accuracy scores above 99.0%. VGG16 achieved 99.32% and VGG19 produced a 99.27% average accuracy score. In addition, the ResNet18 model obtained a 98.98% average accuracy score, and the ResNet50 and ResNet101 models both produced 98.65% average accuracy scores. The highest accuracy score of 99.6% was also produced by both the VGG16 and VGG19 models. When the obtained results are evaluated from the viewpoint of the classifiers, it is seen that the best mean accuracy score of 99.42% was yielded by the cubic SVM classifier. The second best mean accuracy score of 99.40% was yielded by the quadratic SVM, and the fine and medium Gaussian SVMs produced 99.18% and 99.16% average accuracy scores, respectively. Linear SVM produced a 99.00% average accuracy score and, finally, the worst average accuracy score of 97.68% was produced by the coarse Gaussian SVM classifier.

Figure 12.4 shows the confusion matrix for the ResNet50 features and the linear SVM classifier. The labels 1, 2, 3, 4 and 5 show N, LBBB, RBBB, PVC and PB, respectively. From figure 12.4, it is observed that 171 N beat samples were classified incorrectly. The correct classification rate of the N beat samples was 97.88%. 142 LBBB beat samples were misclassified and the accuracy of the LBBB beats was 98.23%.

The obtained correct classification rate for RBBB was 99.37% and only 46 misclassified RBBB samples can be seen in figure 12.4. The highest misclassification was obtained for PVC beats. In total, 214 PVC beat samples were classified incorrectly and the accuracy of the PVC beat was 97.00%. Finally, the highest accuracy score of 99.52% was obtained for the PB class. Only 34 samples were misclassified. Figure 12.5 shows the distribution of ResNet50 features. The x-axis shows the first column of the ResNet50 features and the y-axis shows the second column of the ResNet50 features.

**Figure 12.5.** The distribution of the ResNet50 features. The x-axis displays the first column of the ResNet50 features and the y-axis displays the second column of the ResNet50 features.
Download figure:
Standard image High-resolution image

Figure 12.6 illustrates the training period of the fine-tuning of the ResNet50 model. The top panel of figure 12.6 displays the training and testing accuracies (blue for training and black for testing) and the bottom panel of figure 12.6 shows the loss values for both the training and testing samples (orange for training and black for testing). Table 12.2 provides the results of the fine-tuning of the pre-trained deep CNNs on ECG beat categorization.

Table 12.2. The results of fine-tuning pre-trained deep CNN models on ECG beat classification.

Fine-tuning	Accuracy (%)
VGG16	99.04
ResNet18	99.39
ResNet50	99.36
ResNet101	99.49
VGG19	99.66

As seen in table 12.2, all the fine-tuned deep CNN models achieved classification accuracy scores above 99.00%. The highest accuracy score of 99.66% was yielded by the VGG19 model and a 99.49% accuracy score, which was the second best result, was obtained by the ResNet101 model. Accuracy scores of 99.39%, 99.36% and 99.04% were obtained by the ResNet18, ResNet50 and VGG16 models, respectively.

In the final experiments, a novel CNN model was constructed and trained in an end-to-end fashion by the chapter authors and used in ECG beat classification. The developed CNN model is depicted in figure 12.7 and, as can be seen, the developed CNN model was composed of 15 layers. The network started with the input layer and there were three convolution layers namely conv_1, conv_2 and conv_3 in the model, and batch normalization, ReLU and pooling layers followed each convolution layer. A fully connected layer, softmax layer and classification layer were also used for the classification of the ECG beats. The conv_1, conv_2 and conv_3 layers contained 64, 32 and 18 filters of size 3 × 3. The max operator was used in the pooling layers.

**Figure 12.7.** The developed CNN model.
Download figure:
Standard image High-resolution image

The layers of the proposed architecture and the parameters and values used with the layers are given in table 12.3. The SGDM was chosen as the optimization algorithm. The initial learning coefficient was set to 0.01 and 20 was used for the batch size value. The network trained more than 1200 iterations.

Table 12.3. Analysis of the CNN model for ECG beat classification.

1	'input'	224 × 224 × 3 images
2	'conv_1'	64 3 × 3 × 3 convolutions with stride [1 1] and padding 'same'
3	'BN_1'	Batch normalization
4	'relu_1'	ReLU
5	'pool_1'	2 × 2 max pooling with stride [2 2] and padding [0 0 0 0]
6	'conv_2'	32 3 × 3 × 8 convolutions with stride [1 1] and padding 'same'
7	'BN_2'	Batch normalization
8	'relu_2'	ReLU
9	'pool_2'	2 × 2 max pooling with stride [2 2] and padding [0 0 0 0]
10	'conv_3'	16 3 × 3 × 16 convolutions with stride [1 1] and padding 'same'
11	'BN_3'	Batch normalization
12	'relu_3'	ReLU
13	'fc'	Five fully connected layers
14	'softmax'	Softmax
15	'Classification'	Crossentropyex with '0' and nine other classes

The end-to-end learning scheme of the proposed CNN model is given in figure 12.8. The top panel of figure 12.8 shows the training and testing accuracies (blue for training and black for testing) and the bottom panel of figure 12.8 demonstrates the loss values for both the training and test samples (orange for training and black for testing). The obtained accuracy score was 98.68% and the training procedure was completed in 876 iterations.

12.4. Discussion

A comparison is presented in table 12.4, taking into account the methods, databases and results obtained in studies related to ECG beat classification.

Table 12.4. Comparison of the related studies.

Reference, method	Database	# of beat types	Accuracy (%)
Thomas et al [89], DTCWT	MIT-BIH	5	97.68
Kaur et al [90], WT, factor analysis, LDA	MIT-BIH	5	99.06
Rai et al [91], Daubechies WT, RBFNN	MIT-BIH	5	99.60
Khalaf et al [92], spectral correlation and SVM	MIT-BIH	5	98.60
Dong et al [33], deterministic learning, RBFNN	MIT-BIH	5	97.78
Osowski and Ling [35], fuzzy hybrid neural network	MIT-BIH	7	98.00
Zidelmal et al [53], support vector machine with rejection	MIT-BIH	2	98.80
Liu et al [54], vector quantization, k-medoids	QTDB	8	94.50
Mateo et al [55], RBFNN	A	2	98.71
Engin [56], neuro-fuzzy network	MIT-BIH	4	93.50
Asgharzadeh-Bonab et al [34], spectral entropy, deep CNN technique	MIT-BIH	5	99.33
Wang et al [52], morphological and temporal vector, global RNN	MIT-BIH	4	99.80
	INCARTDB
	SVDB
Li et al [93], deep residual CNN technique	MIT-BIH	5	99.06
Acharya et al [94], CNN technique	B	4	94.90
Yao et al [95], CNN technique	C	9	81.20
The proposed model, deep features and support vector machines	MIT-BIH	5	99.66

A: PhysioNet database (MIT-BIH Atrial Fibrillation Database, Long-Term AF Database, MIT-BIH Arrhythmia Database, AF Termination Challenge Database, MIT-BIH Normal Sinus Rhythm Database, St Petersburg Institute of Cardiological Technics 12-lead Arrhythmia Database, Normal Sinus Rhythm RR Interval Database).

INCARTDB: St Petersburg Institute of Cardiological Technics 12-lead Arrhythmia Database.

SVDB: The MIT-BIH Supraventricular Arrhythmia Database.

B: MIT-BIH Atrial Fibrillation Database (AFDB), MIT-BIH Arrhythmia Database (MITDB), Creighton University Ventricular Tachyarrhythmia Database (CUDB).

C: China Physiological Signal Challenge.

Thomas et al [89] used the dual-tree-complex WT and MLNs for the classification of cardiac arrhythmias. Five types of ECG beats were considered. The model followed three key stages that wer: preprocessing, feature transformation and classification. Amplitude normalization and band-pass filtering were applied to the raw ECG signals in the preprocessing stage. Dual-tree complex wavelet-based features and four other extracted features were used in the feature set. Finally, the MLNs were employed to determine the five different ECG beat types. The model achieved 97.68% classification accuracy. Kaur et al [90] used wavelet coefficients and feature reduction methods for efficient cardiac dysfunction classification. The WT was computed with db4. All WT coefficients at levels 4, 5, 6 and 7 were taken into account and a total of 43 coefficients constituted the feature vector. Then, factor analysis was utilized to reduce the size of the feature set without rotation and with orthogonal rotation. In this way, the number of features was decreased and 23 features were obtained. The model achieved 99.06% accuracy. Rai et al [91] used Daubechies WT and RBFNN for five types of ECG beat classification. A preprocessing step covering noise elimination, baseline wander removal, R-peak detection and QRS-complex extraction was applied to the raw ECG signals. Each ECG sample was represented by 21 points, which were input to the model. In this study, the performances of the BPNN, MLP and SVM classifiers were also examined and the model yielded a 99.60% classification score. Khalaf et al [92] used spectral correlation and the SVM classifier for cardiac arrhythmia classification. For this purpose, the spectral correlation was utilized. The proposed system used Otsu's thresholding, and statistic features which included PCA and Fisher scores. The SVM classifier was used to determine the ECG beat types and the model produced a 98.60% classification accuracy. Dong et al [33] introduced a new approach based on the deterministic learning algorithm. This system ensured different features and the model was performed on the MIT-BIH database. The main advantage of the system was that no feature extraction procedure was required for the test beats. The model yielded a 97.78% classification score. Osowski and Ling [35] carried out ECG beat classification based on higher-order statistical features and a fuzzy neural network. The model produced an accuracy of 98.00% using a multi-layer perceptron and Fourier analysis. Zidelmal et al [53] used an SVM with rejection. The frequency information, RR intervals, QRS morphology and AC power of QRS detail coefficients were also considered in the analysis. The beats were collected into two classes and the model achieved 98.80% classification. Liu et al [54] used the vector quantization method for classifying eight different ECG beat types. To de-noise the raw ECG signal, a median filter was used in the preprocessing step. Wave boundary detection was also performed. The dictionary learning algorithm was employed to describe the signals. As a result, the model achieved 94.50% classification success using VQ k-medoids. Mateo et al [55] introduced a system using RBFNN to cancel out ectopics.

As seen in table 12.4 and mentioned in the studies [35, 53–56, 89–92], ECG beat classification has been realized using advanced signal processing techniques that include WT, high-ordered statistical features, PCA, ICA, VQ, factor analysis, PSO and specific morphological feature extraction processes. These approaches mainly followed three key stages: preprocessing, feature transformation (feature extraction and feature selection) and finally classification. The quality of both the preprocessing and feature extraction steps is critical to ensuring a consistent and robust diagnostic tool, since the performance of the next steps of the model depends on these inital steps. In other words, the selected features have a great impact on the performance of the classifiers.

Asgharzadeh-Bonab et al [34] proposed a model based on the spectral entropy and deep CNN. The noise reduction and QRS segmentation were applied as the preprocessing procedures in the proposed model. Then the model followed time–frequency analysis, feature reduction, and classification steps. The images obtained using the two-directional two-dimensional principal component analysis were applied as the input to deep CNN. The model ensured an average accuracy of 98.33%. Wang et al [52] proposed a model fed with morphological and temporal vectors and uses a global recurrent neural network. The model applied a single RNN for automatic feature learning and classification. An end-to-end learning schema was ensured in this study. As a result, a diagnostic tool with high accuracy was ensured. Li et al [93] offered a deep residual CNN model from 2 lead ECG. The efficient features were learned directly ECG samples. The 1D convolutional layer was used in the architecture. As a result, a successful model was ensured. Acharya et al [94] introduced two architectures of CNN performed on the three different databases. They did not consider the QRS-complex in the experiment. Also, feature extraction and feature selection were not involved. The model achieved an accuracy of 94.90%. Yao et al [95] suggested an attention-based time-incremental CNN model. The model performed on a database consisting of a normal and eight different beat types and ensured 81.20% classification accuracy.

Considering the related studies [34, 52, 93–95], it is seen that in addition to the models following traditional preprocessing, feature transformation and classification processes, deep CNN models have also been used to classify different ECG beat types, thanks to their efficient end-to-end learning schemes. In this context, it has been observed that both deep CNN models, LSTM-based models, ResNet models and RNN models have been used efficiently. Deep CNN networks push the raw data, which is received as inputs, forward to many non-linear layers throughout the network. In this process, the deep CNN models perform the learning and classification processes directly on the input data using functions such as convolution, pooling and dropout. Thanks to this end-to-end learning scheme, the labor-intensive extraction process can be eliminated from the diagnostic models. Another great feature of the deep CNN models is that they offer a hierarchical learning scheme. Activation maps obtained from different layers can reveal many distinctive features of the input data. Sharing weights in the network also provides stronger generalization performance. In the proposed method, the performance of the pretrained CNN models was first examined to classify the ECG beats. Later, instead of extracting features manually, the deep features obtained from the activation maps were inputted into the SVM classifier and quite efficient results were obtained. In addition to all their advantages, deep CNN models need a lot of data for proper training. As this experimental study focused on ECG beats, sufficient data were available. However, for many clinical applications, accessing such a large-scale dataset may not be possible or may take a lot of time. In addition, sufficient hardware resources with high capacity must be available for the training of deep CNN models. Moreover, compared to traditional machine learning methods, deep CNN models take a longer time to train.

The vast majority of studies were carried out on the MIT-BIH database. In addition, different databases are also available through PhysioNet for ECG beat classification. In the studies, it can be observed that five classes are generally used for ECG beat classification problems, and binary classification has also been realized by separating the beats as normal and abnormal. Moreover, some researchers evaluated the problem as having four, seven or eight classes. These differences also show that it is not possible to make a one-to-one comparison. In related studies, it is seen that the classification performance is generally over 95%. It is also worth noting that there is a significant increase in classification success with the RNN and CNN based methods that use end-to-end learning schemes. The classification success has been over 99% using these models. Asgharzadeh-Bonab et al [34] achieved a 99.33% classification score while Wang et al [52] reported an accuracy of 99.80%. Similarly, for the proposed model we achieved 99.66% classification. As a result, from table 12.4, it is observed that the proposed method outperformed most other state-of-the-art methods.

12.5. Conclusion

In this chapter, three deep CNN approaches are considered in ECG beat classification. These approaches achieve an end-to-end learning scheme of a new CNN model, fine-tuning of pre-trained CNN models, and extraction of deep features and their classification using a traditional or deep machine learning approach such as SVM. The obtained results present the following conclusions:

1.
Saving ECG beats as images and applying the CNN approach to them for classification purposes produced quite efficient results. All deep CNN based approaches produced reasonable accuracy scores.
2.
Fine-tuning of a pre-defined model and deep feature extraction produced better results than end-to-end training of the new CNN model. This might have happened because the pre-trained models were trained on 25 million images.
3.
The cubic kernel function produced better accuracy scores than the other kernel functions.

In future works, dense CNN models will be developed and trained on ECG beat classification. In addition, time–frequency images are considered for comparison with the presented results.

References

[1]Sahoo S, Dash M, Behera S and Sabut S 2020 Machine learning approach to detect cardiac arrhythmias in ECG signals: a survey IRBM in press
- Go to reference in chapter
- Crossref
[2]McSharry P E, Clifford G D, Tarassenko L and Smith L A 2003 A dynamical model for generating synthetic electrocardiogram signals IEEE Trans. Biomed. Eng. 50 289–94
- Go to reference in chapter
- Crossref
[3]Cömert Z, Kocamaz A F and Kocamaz A F 2015 Determination of QT interval on synthetic electrocardiogram pp 1–4
- Go to reference in chapter
[4]Diker A, Cömert Z, Avci E and Velappan S 2018 Intelligent system based on genetic algorithm and support vector machine for detection of myocardial infarction from ECG signals pp 1–4
- Go to reference in chapter
[5]Thomas I C, Cork D P, Bhatia A K, Nayak H M, Burke M C and Moss J D 2015 QRS complex fragmentation and survival following left ventricular assist device implantation Int. J. Cardiol. Heart Vasc. 7 165–9
- Go to reference in chapter
- Crossref
[6]Chiladakis J, Kalogeropoulos A, Zagkli F and Chouchoulis K 2015 Effect of heart rate on the intrinsic and the ventricular-paced QRS duration J. Electrocardiol. 48 689–95
- Go to reference in chapter
- Crossref
[7]Ünal S et al 2018 Tp–e interval and Tp–e/QT ratio in patients with human immunodeficiency virus J. Infect. Public Health 11 35–8
- Go to reference in chapter
- Crossref
[8]Correa R, Arini P D, Correa L S, Valentinuzzi M and Laciar E 2014 Novel technique for ST-T interval characterization in patients with acute myocardial ischemia Comput. Biol. Med. 50 49–55
- Go to reference in chapter
- Crossref
[9]Diker A, Cömert Z and Avcı E 2017 A diagnostic model for identification of myocardial infarction from electrocardiography signals Bitlis Eren Univ. J. Sci. Technol. 7 132–9
- Go to reference in chapter
- Crossref
[10]Guyot P, Djermoune E-H and Bastogne T 2019 Assessment of non-negative matrix factorization for the preprocessing of long-term ECG J. Pharmacol. Toxicol. Methods 99 106595
- Go to reference in chapter
- Crossref
[11]Jain S, Bajaj V and Kumar A 2018 Riemann Liouvelle fractional integral based empirical mode decomposition for ECG denoising IEEE J. Biomed. Health Inform. 22 1133–9
- Go to reference in chapter
- Crossref
[12]Gu X, Hu J, Zhang L, Ding J and Yan F 2020 An improved method with high anti-interference ability for R peak detection in wearable devices IRBM 41 172–83
- Go to reference in chapter
- Crossref
[13]Gupta V, Mittal M and Mittal V 2019 R-peak detection using chaos analysis in standard and real time ECG databases IRBM 40 341–54
- Go to reference in chapter
- Crossref
[14]Zalabarria U, Irigoyen E, Martinez R and Lowe A 2020 Online robust R-peaks detection in noisy electrocardiograms using a novel iterative smart processing algorithm Appl. Math. Comput. 369 124839
- Go to reference in chapter
- Crossref
[15]Bajaj A and Kumar S 2019 QRS complex detection using fractional Stockwell transform and fractional Stockwell Shannon energy Biomed. Signal Process. Control 54 101628
- Go to reference in chapter
- Crossref
[16]Lee J S, Lee S J, Choi M, Seo M and Kim S W 2019 QRS detection method based on fully convolutional networks for capacitive electrocardiogram Expert Syst. Appl. 134 66–78
- Go to reference in chapter
- Crossref
[17]Malik J, Soliman E Z and Wu H-T 2020 An adaptive QRS detection algorithm for ultra-long-term ECG recordings J. Electrocardiol. 60 165–71
- Go to reference in chapter
- Crossref
[18]Jain S, Ahirwal M K, Kumar A, Bajaj V and Singh G K 2017 QRS detection using adaptive filters: a comparative study ISA Trans. 66 362–75
- Go to reference in chapter
- Crossref
[19]Christov I et al 2006 Dataset of manually measured QT intervals in the electrocardiogram Biomed. Eng. Online 5 31
- Go to reference in chapter
- Crossref
[20]Wang L, Lin Y and Wang J 2019 A RR interval based automated apnea detection approach using residual network Comput. Methods Programs Biomed. 176 93–104
- Go to reference in chapter
- Crossref
[21]Pachori R B, Avinash P, Shashank K, Sharma R and Acharya U R 2015 Application of empirical mode decomposition for analysis of normal and diabetic RR-interval signals Expert Syst. Appl. 42 4567–81
- Go to reference in chapter
- Crossref
[22]Thuraisingham R A 2006 Preprocessing RR interval time series for heart rate variability analysis and estimates of standard deviation of RR intervals Comput. Methods Programs Biomed. 83 78–82
- Go to reference in chapter
- Crossref
[23]Hsu C F, Lin P-Y, Chao H-H, Hsu L and Chi S 2019 Average entropy: measurement of disorder for cardiac RR interval signals Phys. A Stat. Mech. Appl. 529 121533
- Go to reference in chapter
- Crossref
[24]Schneider M A et al 2017 Reducing door-in door-out intervals in helicopter ST-segment elevation myocardial infarction interhospital transfers Air Med. J. 36 244–7
- Go to reference in chapter
- Crossref
[25]Bouaguel I, Foudad H and Trichine A 2018 Corrected QT interval prolongation: a new predictor of cardiovascular risk inpatients with non-ST-elevation acute coronary syndrome Algerian cohort Arch. Cardiovasc. Dis. Suppl. 10 49
- Go to reference in chapter
- Crossref
[26]Lanatá A, Valenza G, Mancuso C and Scilingo E P 2011 Robust multiple cardiac arrhythmia detection through bispectrum analysis Expert Syst. Appl. 38 6798–804
- Go to reference in chapter
- Crossref
[27]Kondo H, Shinohara T and Takahashi N 2015 A case of short-coupled premature ventricular beat-induced ventricular fibrillation with early repolarization in the inferolateral leads J. Arrhythmia 31 60–3
- Go to reference in chapter
- Crossref
[28]Shah J et al 2020 Prevalence of cardiovascular risk factors and financial burden in younger adults hospitalized with atrial fibrillation Heart Lung 49 393–7
- Go to reference in chapter
- Crossref
[29]Hulleman M et al 2020 Ventricular fibrillation waveform characteristics in out-of-hospital cardiac arrest and cardiovascular medication use Resuscitation 151 173–80
- Go to reference in chapter
- Crossref
[30]Sangaiah A K, Arumugam M and Bian G-B 2020 Artif. Intell. Med. 103 101788
- Go to reference in chapter
- Crossref
[31]Wainwright B et al 2020 Autoimmune-mediated congenital heart block Best Pract. Res. Clin. Obstet. Gynaecol. 64 41–51
- Go to reference in chapter
- Crossref
[32]Chen Q et al 2019 The value of baseline and arrhythmic ECG in the interpretation of arrhythmic mechanisms Card. Electrophysiol. Clin. 11 219–38
- Go to reference in chapter
- Crossref
[33]Dong X, Wang C and Si W 2017 ECG beat classification via deterministic learning Neurocomputing 240 1–12
- Go to reference in chapter
- Crossref
[34]Asgharzadeh-Bonab A, Amirani M C and Mehri A 2020 Spectral entropy and deep convolutional neural network for ECG beat classification Biocybern. Biomed. Eng. 40 691–700
- Go to reference in chapter
- Crossref
[35]Osowski S and Linh T H 2001 ECG beat recognition using fuzzy hybrid neural network IEEE Trans. Biomed. Eng. 48 1265–71
- Go to reference in chapter
- Crossref
[36]Castells F, Laguna P, Sörnmo L, Bollmann A and Roig J M 2007 Principal component analysis in ECG signal processing EURASIP J. Adv. Signal Process. 2007 74580
- Go to reference in chapter
- Crossref
[37]Monasterio V, Laguna P and MartÍnez J P 2009 Multilead analysis of T-wave alternans in the ECG using principal component analysis IEEE Trans. Biomed. Eng. 56 1880–90
- Go to reference in chapter
- Crossref
[38]Banerjee S and May M M 2013 ECG beat classification based on discrete wavelet transformation and nearest neighbour classifier J. Med. Eng. Technol. 37 264–72
- Go to reference in chapter
- Crossref
[39]Daamouche A, Hamami L, Alajlan N and Melgani F 2012 A wavelet optimization approach for ECG signal classification Biomed. Signal Process. Control 7 342–9
- Go to reference in chapter
- Crossref
[40]Yan J and Lu L 2014 Improved Hilbert–Huang transform based weak signal detection methodology and its application on incipient fault diagnosis and ECG signal analysis Signal Process. 98 74–87
- Go to reference in chapter
- Crossref
[41]Dutta S, Chatterjee A and Munshi S 2011 Identification of ECG beats from cross-spectrum information aided learning vector quantization Measurement 44 2020–7
- Go to reference in chapter
- Crossref
[42]Akhbari M, Ghahjaverestan N M, Shamsollahi M B and Jutten C 2018 ECG fiducial point extraction using switching Kalman filter Comput. Methods Programs Biomed. 157 129–36
- Go to reference in chapter
- Crossref
[43]Bhagyalakshmi V, Pujeri R V and Devanagavi G D 2018 GB-SVNN: genetic BAT assisted support vector neural network for arrhythmia classification using ECG signals J. King Saud Univ.—Comput. Inf. Sci. in press
- Go to reference in chapter
- Crossref
[44]Zhou S and Tan B 2020 Electrocardiogram soft computing using hybrid deep learning CNN-ELM Appl. Soft Comput. 86 105778
- Go to reference in chapter
- Crossref
[45]Yeh Y-C, Wang W-J and Chiou C W 2009 Cardiac arrhythmia diagnosis method using linear discriminant analysis on ECG signals Measurement 42 778–89
- Go to reference in chapter
- Crossref
[46]Joo S, Choi K-J and Huh S-J 2012 Prediction of spontaneous ventricular tachyarrhythmia by an artificial neural network using parameters gleaned from short-term heart rate variability Expert Syst. Appl. 39 3862–6
- Go to reference in chapter
- Crossref
[47]Haseena H H, Joseph P K and Mathew A T 2011 Classification of arrhythmia using hybrid networks J. Med. Syst. 35 1617–30
- Go to reference in chapter
- Crossref
[48]Zeng N, Wang Z, Li Y, Du M and Liu X 2012 A hybrid EKF and switching PSO algorithm for joint state and parameter estimation of lateral flow immunoassay models IEEE/ACM Trans. Comput. Biol. Bioinform. 9 321–9
- Go to reference in chapter
- Crossref
[49]Jain S, Bajaj V and Kumar A 2016 Efficient algorithm for classification of electrocardiogram beats based on artificial bee colony-based least-squares support vector machines classifier Electron. Lett. 52 1198–200
- Go to reference in chapter
- Crossref
[50]Diker A, Avci D, Avci E and Gedikpinar M 2018 A new technique for ECG signal classification genetic algorithm wavelet kernel extreme learning machine Optik 180 46–55
- Go to reference in chapter
- Crossref
[51]à Mougoufan J B B, Fouda J S A E, Tchuente M and Koepf W 2020 Adaptive ECG beat classification by ordinal pattern based entropies Commun. Nonlinear Sci. Numer. Simul. 84 105156
- Go to reference in chapter
- Crossref
[52]Wang G et al 2019 A global and updatable ECG beat classification system based on recurrent neural networks and active learning Inf. Sci. 501 523–42
- Go to reference in chapter
- Crossref
[53]Zidelmal Z, Amirou A, Ould-Abdeslam D and Merckle J 2013 ECG beat classification using a cost sensitive classifier Comput. Methods Programs Biomed. 111 570–7
- Go to reference in chapter
- Crossref
[54]Liu T, Si Y, Wen D, Zang M and Lang L 2016 Dictionary learning for VQ feature extraction in ECG beats classification Expert Syst. Appl. 53 129–37
- Go to reference in chapter
- Crossref
[55]Mateo J, Torres A M, Aparicio A and Santos J L 2016 An efficient method for ECG beat classification and correction of ectopic beats Comput. Electr. Eng. 53 219–29
- Go to reference in chapter
- Crossref
[56]Engin M 2004 ECG beat classification using neuro-fuzzy network Pattern Recognit. Lett. 25 1715–22
- Go to reference in chapter
- Crossref
[57]Übeyli E D 2009 Combining recurrent neural networks with eigenvector methods for classification of ECG beats Digit. Signal Process. 19 320–9
- Go to reference in chapter
- Crossref
[58]Korürek M and Doğan B 2010 ECG beat classification using particle swarm optimization and radial basis function neural network Expert Syst. Appl. 37 7563–9
- Go to reference in chapter
- Crossref
[59]Yu S-N and Chen Y-H 2007 Electrocardiogram beat classification based on wavelet transformation and probabilistic neural network Pattern Recognit. Lett. 28 1142–50
- Go to reference in chapter
- Crossref
[60]Güler İ and Übeyli E D 2005 A modified mixture of experts network structure for ECG beats classification with diverse features Eng. Appl. Artif. Intell. 18 845–56
- Go to reference in chapter
- Crossref
[61]Dutta S, Chatterjee A and Munshi S 2010 Correlation technique and least square support vector machine combine for frequency domain based ECG beat classification Med. Eng. Phys. 32 1161–9
- Go to reference in chapter
- Crossref
[62]Yu S-N and Chou K-T 2009 Selection of significant independent components for ECG beat classification Expert Syst. Appl. 36 2088–96
- Go to reference in chapter
- Crossref
[63]Wang K, Yang G, Huang Y and Yin Y 2020 Multi-scale differential feature for ECG biometrics with collective matrix factorization Pattern Recognit. 102 107211
- Go to reference in chapter
- Crossref
[64]Mondéjar-Guerra V, Novo J, Rouco J, Penedo M G and Ortega M 2019 Heartbeat classification fusing temporal and morphological information of ECGs via ensemble of classifiers Biomed. Signal Process. Control 47 41–8
- Go to reference in chapter
- Crossref
[65]Librenza-Garcia D et al 2017 The impact of machine learning techniques in the study of bipolar disorder: a systematic review Neurosci. Biobehav. Rev. 80 538–54
- Go to reference in chapter
- Crossref
[66]Toğaçar M, Cömert Z and Ergen B 2020 Classification of brain MRI using hyper column technique with convolutional neural network and feature selection method Expert Syst. Appl. 149 113274
- Go to reference in chapter
- Crossref
[67]Toğaçar M, Ergen B and Cömert Z 2020 Application of breast cancer diagnosis based on a combination of convolutional neural networks, ridge regression and linear discriminant analysis using invasive breast cancer images processed with autoencoders Med. Hypotheses 135 109503
- Go to reference in chapter
- Crossref
[68]Butun E, Yildirim O, Talo M, Tan R-S and Acharya U R 2020 1D-CADCapsNet: one dimensional deep capsule networks for coronary artery disease detection using ECG signals Phys. Med. 70 39–48
- Go to reference in chapter
- Crossref
[69]Daldal N, Sengur A, Polat K and Cömert Z 2020 A novel demodulation system for base band digital modulation signals based on the deep long short-term memory model Appl. Acoust. 166 107346
- Go to reference in chapter
- Crossref
[70]Cömert Z 2020 Fusing fine-tuned deep features for recognizing different tympanic membranes Biocybern. Biomed. Eng. 40 40–51
- Go to reference in chapter
- Crossref
[71]Goldberger A L et al 2000 Circulation 101 PhysioBank, PhysioToolkit, and PhysioNet components of a new research resource for complex physiologic signals e215–20
- Go to reference in chapter
[72]Moody G B and May M R G 2001 The impact of the MIT-BIH arrhythmia database IEEE Eng. Med. Biol. Mag. 20 45–50
- Go to reference in chapter
- Crossref
[73]Toğaçar M, Ergen B and Cömert Z 2019 Waste classification using AutoEncoder network with integrated feature selection method in convolutional neural network models Measurement 2019 107459
- Go to reference in chapter
- Crossref
[74]Yu Y, Lin H, Meng J, Wei X, Guo H and Zhao Z 2017 Deep transfer learning for modality classification of medical images Information 8 91
- Go to reference in chapter
- Crossref
[75]Traore B B, Kamsu-Foguem B and Tangara F 2018 Deep convolution neural network for image recognition Ecol. Inform. 48 257–68
- Go to reference in chapter
- Crossref
[76]Bakator M and Radosav D 2018 Deep learning and medical diagnosis: a review of literature Multimodal Technol. Interact. 2 47
- Go to reference in chapter
- Crossref
[77]Başaran E, Cömert Z and Çelik Y 2020 Convolutional neural network approach for automatic tympanic membrane detection and classification Biomed. Signal Process. Control 56 101734
- Go to reference in chapter
- Crossref
[78]Toğaçar M, Ergen B and Cömert Z 2020 Detection of lung cancer on chest CT images using minimum redundancy maximum relevance feature selection method with convolutional neural networks Biocybern. Biomed. Eng. 40 23–39
- Go to reference in chapter
- Crossref
[79]Budak Ü, Cömert Z, Rashid Z N, Şengür A and Çıbuk M 2019 Computer-aided diagnosis system combining FCN and Bi-LSTM model for efficient breast cancer detection from histopathological images Appl. Soft Comput. 85 105765
- Go to reference in chapter
- Crossref
[80]Liu H, Li B, Lv X and Huang Y 2017 Image retrieval using fused deep convolutional features Procedia Comput. Sci. 107 749–54
- Go to reference in chapter
- Crossref
[81]Zeiler M D 2012 ADADELTA: An Adaptive Learning Rate Method http://doi.acm.org.ezproxy.lib.ucf.edu/10.1145/1830483.1830503
- Go to reference in chapter
[82]Shindjalova R, Prodanova K and Svechtarov V 2014 Modeling data for tilted implants in grafted with bio-oss maxillary sinuses using logistic regression AIP Conf. Proc. 1631 58–62
- Go to reference in chapter
- Crossref
[83]Deniz E, Sengür A, Kadiroglu Z, Guo Y, Bajaj V and Budak Ü 2018 Transfer learning based histopathologic image classification for breast cancer detection Health Inf. Sci. Syst. 6 18
- Go to reference in chapter
- Crossref
[84]Altuntaş Y, Cömert Z and Kocamaz A F 2019 Identification of haploid and diploid maize seeds using convolutional neural networks and a transfer learning approach Comput. Electron. Agric. 163 104874
- Go to reference in chapter
- Crossref
[85]LeCun Y, Bengio Y and Hinton G 2015 Deep learning Nature 521 436–44
- Go to reference in chapter
- Crossref
[86]Talo M 2019 Automated classification of histopathology images using transfer learning Artif. Intell. Med. 2019 101743
- Go to reference in chapter
- Crossref
[87]Cortes C and Vapnik V 1995 Support-vector networks Mach. Learn. 20 273–97
- Go to reference in chapter
- Crossref
[88]Burges C J C 1998 A tutorial on support vector machines for pattern recognition Data Min. Knowl. Discov. 2 121–67
- Go to reference in chapter
- Crossref
[89]Thomas M, Das M K and Ari S 2015 Automatic ECG arrhythmia classification using dual tree complex wavelet based features AEU—Int. J. Electron. Commun. 69 715–21
- Go to reference in chapter
- Crossref
[90]Kaur M and Arora A S 2012 Classification of ECG signals using LDA with factor analysis method as feature reduction technique J. Med. Eng. Technol. 36 411–20
- Go to reference in chapter
- Crossref
[91]Rai H M, Trivedi A, Chatterjee K and Shukla S 2014 R-peak detection using Daubechies wavelet and ECG signal classification using radial basis function neural network J. Inst. Eng. Ser. B 95 63–71
- Go to reference in chapter
- Crossref
[92]Khalaf A F, Owis M I and Yassine I A 2015 A novel technique for cardiac arrhythmia classification using spectral correlation and support vector machines Expert Syst. Appl. 42 8361–8
- Go to reference in chapter
- Crossref
[93]Li Z, Zhou D, Wan L, Li J and Mou W 2020 Heartbeat classification using deep residual convolutional neural network from 2-lead electrocardiogram J. Electrocardiol. 58 105–12
- Go to reference in chapter
- Crossref
[94]Acharya U R, Fujita H, Lih O S, Hagiwara Y, Tan J H and Adam M 2017 Automated detection of arrhythmias using different intervals of tachycardia ECG segments with convolutional neural network Inf. Sci. 405 81–90
- Go to reference in chapter
- Crossref
[95]Yao Q, Wang R, Fan X, Liu J and Li Y 2020 Multi-class arrhythmia detection from 12-lead varied-length ECG using attention-based time-incremental convolutional neural network Inf. Fusion 53 174–82
- Go to reference in chapter
- Crossref

Export references: BibTeX RIS

Electrocardiogram beat classification using deep convolutional neural network techniques

Chapter navigation

Export citation and abstract

Permissions

Share this chapter

Affiliations

Dates

Chapter DOI

Books links

Abstract

12.1. Introduction

12.2. Material and methods

12.2.1. The MIT-BIH database

12.2.2. Producing ECG beat images

12.2.3. Convolutional neural networks (CNNs)

12.2.4. Deep transfer learning (DTL)

12.2.5. Support vector machines

12.2.5.1. The linear SVM kernel

12.2.5.2. The quadratic SVM kernel

12.2.5.3. The cubic SVM kernel

12.2.5.4. The fine Gaussian SVM kernel

12.2.5.5. The medium Gaussian SVM kernel

12.2.5.6. The coarse Gaussian SVM kernel

12.2.6. Performance metrics

12.3. Experimental work and results

12.4. Discussion

12.5. Conclusion

References