Search in:

Applied Artificial Intelligence

An International Journal

Volume 33, 2019 - Issue 1

Submit an article Journal homepage

Free access

457

Views

CrossRef citations to date

Altmetric

Listen

Articles

A Neuro-Genetic Technique for Pruning and Optimization of ANN Weights

Sakshi SakshiElectronics & Communication Engineering Department, Thapar University, Patiala, IndiaCorrespondence[email protected]
View further author information

Ravi KumarElectronics & Communication Engineering Department, Thapar University, Patiala, IndiaView further author information

Pages 1-26 | Published online: 23 Oct 2018

Cite this article
https://doi.org/10.1080/08839514.2018.1525524
CrossMark

In this article

ABSTRACT
Introduction
Related Work
Problem Formulation
Implementation
Results and Discussions
Conclusion
Additional information
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

A novel technique for optimization of artificial neural network (ANN) weights which combines pruning and Genetic Algorithm (GA) has been proposed. The technique first defines “relevance” of initialized weights in a statistical sense by introducing a coefficient of dominance for each weight and subsequently employing the concept of complexity penalty. Based upon complexity penalty for each weight, candidate solutions are initialized to participate in the Genetic optimization. The GA stage employs mean square error as the fitness function which is evaluated once for all candidate solutions by running the forward pass of backpropagation. Subsequent reproduction cycles generate fitter individuals and the GA is terminated after a small number of cycles. It has been observed that ANNs trained with GA optimized weights exhibit higher convergence, lower execution time, and higher success rate in the test phase. Furthermore, the proposed technique yields substantial reduction in computational resources.

Introduction

Artificial Neural Networks (ANNs) and genetic algorithms (GA) are popular paradigms for learning and optimization, each with their own strengths and weaknesses. ANNs trained with backpropagation (BP) algorithm are highly popular due to their low complexity and ease of implementation. However, they are prone to getting stuck in local minima thereby yielding suboptimal solutions. GA, on the other hand, is good at global search and exhibits a robust performance. Applications of GA have been reported for defense in biological and chemical warfare (Haupt and Haupt Citation2011), design of combinational logic (Louis Citation2005), and wireless ad-hoc networks (Karthikeyan, Baskar, and Alphones Citation2012). Recent advancements in GA is based on a three-dimensional cellular GAs proposed by Asmaa, Erdogan, and Arslan (Citation2013) that provides better performance in terms of efficiency, and efficacy, for most of the problems. However, unguided mutation and variations in a host of parameters leads to extremely slow convergence of GA. With an aim to obtain complementary advantages from both GA and ANN, many researchers have reported a variety of neuro genetic techniques with varying success rates (Christiansen et al. Citation2012; Karlra and Prakash Citation2003; Srivastava, Shukla, and Srivastava Citation1998).

Design of ANN using GA has been under consideration for quite some time (Hunter and Chiu Citation2000). Kwon and Moon (Citation2007) have reported a neuro-genetic stock forecasting system where a recurrent neural network is trained for predicting the performance of a particular stock using a large pool of previous data. Similarly, Ramasubramanian and Kannan (Citation2006) have described a database intrusion prediction system based upon a hybrid neuro-genetic network. This hybrid approach to weight optimization is particularly useful when the ANN is trained with BP algorithm. In design of a hybrid neuro-genetic system, there is a scope to rope-in both improved GA and improved ANN training methodologies. Some authors have described a model which takes into account interdependence between individuals taking part in the reproduction process, thus building a more rational strategy for selection and fitness evaluation (Bull Citation2001). Hu et al. (Citation2015) have proposed a GA-optimized ANN to be used in ultrasonic flow meters. The GA is utilized to determine ANN architecture based on its efficient parallel advantage in global search. Using this approach, the initial weights and biases are also optimized, which makes the network capable to avoid getting stuck in local minima. A novel approach to improve the classification performance of a polynomial neural network has been proposed by Lin, Prasad, and Saxena (Citation2015). Libelli, Marsili, and Alba (Citation2000) described an adaptive mutation technique to enhance the mutation probability of less fit individuals. A novel “wavelet mutation operator” has been proposed by Ling and Leung (Citation2007a; and Citation2007b). Kumar, Gospodaric, and Bauer (Citation2007) report an improved GA based on biological evolution. The efficacy of GA as a pre-processing tool has been demonstrated on feature subset selection in Tan et al. (Citation2008). An ANN has been trained for bearing fault detection using input features selected by GA (Samanta, Al-Balushi, and Al-Araimi Citation2006). However, the task of optimizing weights of an ANN is a challenging one since the GA and ANN can no longer be treated as separate stages of a system. In this work, we report a neuro-genetic classifier trained with BP algorithm, in which an optimum set of weights is chosen using a novel GA-based technique.

BP works well on simple training problems but its performance falls off rapidly as the dimension or complexity (high correlation in the input samples) of input data increases. The problem of training a network using BP has two aspects. First is the task of finding the optimal number of hidden layer neurons. Second, the determination of weights and biases of the ANN. Specifying the weights of a neural net is mostly viewed as an optimization process, where an error goal needs to be achieved. However, optimization using BP imposes a great risk of getting stuck to a local minimum, since the error surface is high dimensional. GAs do not encounter such kind of performance issue as they keep the current best solution as the part of their population while they search for further better solutions. It is also clear that all the weights do not contribute equally to pattern classification. Thus, there are some set of weights which have less or no impact on the pattern separability. These types of weights are redundant and are referred to as excess weights (Haykin Citation2007). Therefore, the first issue addressed in this paper is elimination of redundant weights. This aspect has been addressed by many researchers in a plentitude of ways. A pruning strategy for Extreme Learning Machine has been proposed using the Successive Projections Algorithm (SPA) as an approach to automatically find the number of hidden nodes (Mesquita et al. Citation2015). Another pruning strategy has been proposed to find a compact structure of feedforward neural networks with high generalization ability in classification problems (Tong and Tanaka Citation2015). One of the novel aspects of this work is to amalgamate weight pruning and GA. This aspect has been discussed in the implementation section. A brief overview of the relevant prior work on GA-based weight pruning is now being presented in the next section.

Related Work

In the early stages of development of ANN training paradigms, Reed (Citation1993) presented a comprehensive review of network pruning algorithms. One of the earliest reported uses of GA for weight pruning can be traced in (Aleksander and Taylor Citation1992) which employed pruning of a trained network prior to presenting the test samples for improved generalization. This genetically pruned network was used for face recognition and sonar signal classification. Recently, Sabo and Yu (Citation2008) have proposed a simple pruning algorithm based upon cross validation and sensitivity analysis in order to get an optimal neural network topology which will result into a significantly lower computational complexity. Augasta and Kathirvalavakumar (Citation2015) have conducted a survey of existing pruning techniques that optimize the architecture of neural networks. They have also evaluated the effectiveness of various pruning techniques based on sensitivity analysis, mutual information, and significance on four real data sets. Apart from network and weight pruning approaches, researchers are also opting for statistical methods to prune the input features for simpler network architecture since the number of input layer neurons is equal to the input data dimension. Kingston, Maier, and Lambert (Citation2004) have employed statistical input pruning for environmental modeling. Christiansen et al. (Citation2012) have applied optimal brain surgeon (OBS) algorithm for automatically generating a topologically optimized ANN based upon the statistical properties of the input data set. GA-based search has been employed by Mantzaris, Anastassopoulos, and Adamopoulos (Citation2011) to eliminate excess weights from an ANN used to estimate Urinary Tract Infection from medical examination data. Workers have also reported novel evolutionary algorithms for pruning ANNs constructed for prediction applications (Yang and Chen Citation2012). However, an in-depth study of the prior work reveals the fact that researchers have by and large ignored the role of mutation operator in the GA flow in selecting the fittest weights for an ANN. Furthermore, the authors felt a need to incorporate some information about the statistical properties of individual weights converging on a particular neuron in the GA flow. The novel aspect of this work is the introduction of a mutation operator which performs thresholding of individual weights based upon their contribution to overall weight variance. Though the problem of weight optimization using GA has been attempted with varying degrees of success by previous researchers, it is for the first time to the best of our knowledge that the concept of weight pruning has been incorporated into the conventional GA, so that the weights having smaller effect on the network as compared to other weights be declared as excess weights and eventually be removed. It has also been proposed that weight pruning strategy will govern the population initialization operation of GA which will be described in section “Problem Formulation”.

Need of Improvement

Recently, Medeiros and Barreto (Citation2013) have proposed a novel weight pruning methodology for (Multilayer Perceptron) MLP classifiers. The proposed method is based on the observation that relevant synaptic weights tend to generate higher correlations between error signals associated with the neurons of a given layer and the error signals propagated back to the previous layer, whereas non-relevant (i.e. prunable) weights tend to generate smaller correlations. This observation has been termed as MAXCORE principle by the authors and has been introduced as the guiding principle behind the Cross Correlation Analysis of Back Propagated Errors (CAPE) methodology. CAPE has emerged as one of the most significant weight pruning strategy reported recently. In terms of computational complexity, CAPE has outperformed OBS and its variants.

However, this paper contests the claim of generality of the MAXCORE principle (Medeiros and Barreto Citation2013). Let us analyze a hypothetical ANN trained with BP algorithm whose backward pass is depicted in .

Figure 1. Backward pass of backpropagation.

Let the errors generated by $M$ output neurons of the network presented with $N$ training examples be represented in the form of an $N \times M$ matrix.

(1)

E_{o} = [\begin{matrix} e_{1}^{o} (1) & e_{2}^{o} (1) & e_{3}^{o} (1) & . & . & . & e_{M}^{o} (1) \\ e_{1}^{o} (2) & e_{2}^{o} (2) & e_{3}^{o} (2) & . & . & . & e_{M}^{o} (2) \\ e_{1}^{o} (3) & e_{2}^{o} (3) & e_{3}^{o} (3) & . & . & . & e_{M}^{o} (3) \\ . & . & . & . & . & . & . \\ . & . & . & . & . & . & . \\ . & . & . & . & . & . & . \\ e_{1}^{o} (N) & e_{2}^{o} (N) & e_{3}^{o} (N) & . & . & . & e_{M}^{o} (N) \end{matrix}]

(1)

where each row of $E_{o}$ corresponds to the error generated by the output neurons for a given training pattern.

Similarly, backpropagated errors associated with $Q + 1$ hidden neurons of the same network can be represented in the form of an $N \times (Q + 1)$ matrix.

(2)

E_{h} = [\begin{matrix} e_{0}^{h} (1) \\ e_{0}^{h} (2) \\ e_{0}^{h} (3) \\ . \\ . \\ . \\ e_{0}^{h} (N) \end{matrix} \begin{matrix} e_{1}^{h} (1) \\ e_{1}^{h} (2) \\ e_{1}^{h} (3) \\ . \\ . \\ . \\ e_{1}^{h} (N) \end{matrix} \begin{matrix} e_{2}^{h} (1) \\ e_{2}^{h} (2) \\ e_{2}^{h} (3) \\ . \\ . \\ . \\ e_{2}^{h} (N) \end{matrix} \begin{matrix} . \\ . \\ . \\ . \\ . \\ . \\ . \end{matrix} \begin{matrix} . \\ . \\ . \\ . \\ . \\ . \\ . \end{matrix} \begin{matrix} . \\ . \\ . \\ . \\ . \\ . \\ . \end{matrix} \begin{matrix} e_{M}^{h} (1) \\ e_{M}^{h} (2) \\ e_{M}^{h} (3) \\ . \\ . \\ . \\ e_{M}^{h} (N) \end{matrix}]

(2)

The correlation matrix can be evaluated as:

(3)

C_{o h} [k, i] = \sum_{t = 1}^{N} e_{k}^{o} (t) e_{i}^{h} (t)

(3)

where $C_{o h} [k, i]$ is the entry in the correlation matrix corresponding to the scalar product of the kth column of $E_{o}$ with the ith column of $E_{h}$ .

Let us examine $C_{o h} [3, 2]$ and $C_{o h} [2, 2]$ for the network shown in .

Let us assume $C_{o h} [3, 2] > C_{o h} [2, 1]$ . According to the MAXCORE principle, $w_{32}$ is more relevant than $w_{21}$ if $\sum_{t = 1}^{N} e_{3}^{o} (t) e_{2}^{h} (t) > \sum_{t = 1}^{N} e_{2}^{o} (t) e_{1}^{h} (t)$ ,

where

(4)

e_{2}^{h} (t) = \sum_{k = 1}^{4} δ_{k} (t) w_{k 2} (t)

(4)

and

(5)

e_{1}^{h} (t) = \sum_{k = 1}^{4} δ_{k} (t) w_{k 1} (t)

(5)

As evident from above equations, higher or lower values of $e_{i}^{h} (t)$ depend not only upon $w_{k 1}$ but also upon local gradients $δ_{k}$ of the output neurons connected to the hidden layer neuron i. Since, local gradient of the output neuron k is given by:

(6)

δ_{k} (t) = e_{k}^{o} (t) φ^{'} (v_{k} (t))

(6)

where $v_{k} (t)$ is the induced local field and $φ^{'}$ is the differential of the activation function. Most of the time, a sigmoidal activation function is chosen which maps the synaptic weights $w_{k i} (t)$ nonlinearly in the form of induced local field passed through $φ^{'} (.)$ . Since, the contribution of $e_{i}^{h} (t)$ in the magnitude of correlation coefficient $C_{o h} [k, i]$ depends upon $δ_{k} (t)$ , which in turn are nonlinear functions of synaptic weights, $w_{k i} (t)$ , a high value of $C_{o h} [k, i]$ neither guarantees a high magnitude nor a high degree of relevance for $w_{k i} (t) .$ Furthermore, it is always better to define “relevance” of a synaptic weight either in a statistical sense or with respect to a performance metric. The standard practice of complexity regularization defines a risk function as:

(7)

R (w) = ξ_{s} (w) + λ ξ_{c} (w)

(7)

where $ξ_{s} (w)$ is the standard performance measure (e.g. MSE), and $ξ_{c} (w)$ is the complexity penalty term which depends upon the network model alone. As regularization parameter, $λ$ represents the relative importance of the complexity penalty term with respect to the MSE. Assuming $λ$ to have a moderate value somewhere between zero and infinity, the relative importance of complexity penalty term with respect to MSE is also moderate. Therefore, we can define “relevance” of individual synaptic weights both in terms of MSE and in terms of complexity penalty. However, to compute the relevance of weights in terms of $ξ_{s} (w)$ , we need to run BP algorithm first (at least forward pass). For the networks which have not yet undergone training, it would be better to first compute $ξ_{c} (w)$ in terms of complexity penalty. In the light of the above discussion, novel contributions of this work can be enlisted as follows:

This work seeks to define relevance of the weights in a statistical sense.
A coefficient of dominance has been proposed to identify nodes carrying higher information content and to determine complexity penalty for each weight.
Use of GA to optimize fittest weights in terms of MSE.
New schemes for initialization of population and mutation have been proposed.

Problem Formulation

The Proposed Methodology

The proposed methodology has been implemented in a sequence of several steps as illustrated in . As an illustration of the first stage of the methodology, let us consider an ANN with single hidden layer and three target classes subjected to IRIS data as input. The number of hidden layer neurons has been chosen to be three. As shown in , there are a total of 27 connections including 20 weights and 7 biases. All the connections have been initialized randomly. The connections would be evaluated as a part of weight pruning/modification scheme which is described in detail with analytical justifications in the next section. All the 27 connections have been encoded in the form of a chromosome string shown in . Many such chromosomes have been generated to initialize the mating pool. Each weight has been encoded using 10 bits. With this step, GA optimization of ANN weights commences. Subsequent steps follow the standard GA flow with crossover, mutation, and ranking of individuals which ultimately yields the “fittest” set of weights. The details of all individual stages of the methodology are described in the implementation section.

Figure 2. The proposed methodology.

Figure 3. ANN architecture for IRIS data.

Figure 4. Weights and biases of represented as a chromosome.

Data Description

The simulations have been carried out on several benchmark data sets obtained from UCI machine learning repository. All the data sets chosen for this study represent pattern classification problems with varying number of samples, input dimensions, and target classes details of which is shown in . During the course of simulation, the number of training and test samples was kept equal in most of the cases except in the case of letter recognition where the results varied significantly when the number of test samples was changed.

Table 1. Data description.

Download CSV Display Table

Implementation

The Weight Pruning Scheme

The objective of pruning is to downsize the model to the level of least complexity that still provides the best generalization. In general, the capability of feedforward neural networks for data fitting increases as the number of connection weights increase, because an increase in the number of free parameters enables the network to finely capture subtle features. However, a model with too many parameters can pick up nonessential information in the sample data and cause the problem of overfitting. This leads to poor generalization. Therefore, application of appropriate pruning techniques can contribute to improvement in the generalization ability. Furthermore, a reduction in the network components is advantageous for hardware implementation of large-sized ANNs. The proposed idea seeks to incorporate weight pruning scheme into the conventional GA flow. The weights which are insensitive to the error changes can be discarded after each step of training. The pruned network is of smaller size and is likely to give higher accuracy than before its trimming. ANN use network pruning strategies which can be classified into two categories viz. weight decay and weight elimination. In both the strategies, a complexity penalty term is defined which appropriately quantifies the degree of redundancy in a weight. In weight decay procedure, if the initial weight vector is w, the complexity penalty term is defined as the squared norm of the weight vector w (i.e. all the free parameters) in the network and is expressed as:

(8)

ξ_{c} (w) = w^{2} = \sum_{i \in ζ_{t o t a l}} {w_{i}}^{2}

(8)

where the set $ζ_{t o t a l}$ refers to all the synaptic weights in the network. However, in this scheme, some weights in the network have relatively larger values than others. This results in making smaller weights redundant since they have little influence on the training outcome. To overcome this limitation, the complexity penalty term is now redefined as follows:

(9)

ξ_{c} (w) = \sum_{i \in ζ_{t o t a l}} (\frac{{(\frac{w_{i}}{w_{o}})}^{2}}{1 + {(\frac{w_{i}}{w_{o}})}^{2}})

(9)

where w_o is the preassigned weight change parameter defined according to the problem and w_i refers to the ith weight in the network. It is proposed here that the optimality of a weight vector would be defined in the statistical sense. Higher the complexity penalty for a particular weight, more important is the weight for the learning process. If the weights converging on a particular hidden neuron have variance more than the variance of total weights in the network, then that hidden node will contribute more to the network. The real challenge in this case lies in selection of an appropriate $w_{o}$ . As can be inferred from , the choice of parameter $w_{o}$ is crucial aspect of network pruning.

Figure 5. Variation of complexity penalty term with respect to $w_{i} / w_{o}$ (Haykin Citation2007).

In this study, a new pruning method has been proposed based on the weight variation information during the backpropagation training. Every node in the hidden layer is evaluated to determine the effective variance of weights. In order to compute the complexity penalty term as defined in Equation (9) for each weight, a coefficient of dominance called D_si is calculated for each neuron and is expressed as:

(10)

D_{s i} = \frac{\overline{V_{t o t a l}}}{V_{s i}}

(10)

where D_si corresponds to w_o of Equations. (9) and (10), $V_{S_{i}}$ is the variance of the weights of ith hidden node, and ${\overline{V}}_{t o t a l}$ is the effective variance of all the weights in the hidden layer.

In the similar way, all the remaining hidden layer neurons are evaluated to find the coefficient of dominance and a set $D$ is defined as:

(11)

D = \{D_{S_{1}}, D_{S_{2}}, D_{S_{3}} \dots \dots \dots D_{S_{i}}\}

(11)

The complexity penalty for each weight in the network is defined as:

(12)

ξ_{w i} = \frac{{(w_{i} / D_{S_{i}})}^{2}}{1 + {(w_{i} / D_{S_{i}})}^{2}}

(12)

The variance dependent weight pruning technique described above helps to increase the information content of the weight set convergent on a less dominant neuron. This can be explained in terms of a metric called Fisher information which is defined as a way of measuring the amount of information that an observable random variable X carries about an unknown parameter θ of a distribution that models X and can be stated as:

(13)

Γ (θ) = E [{(\frac{\partial ln (f (X; θ))}{\partial θ})}^{2} | θ]

(13)

where log is the natural logarithm of the probability function.

The input to the hidden node is given by:

(14)

y_{j} = w_{j 1} x_{1} + w_{j 2} x_{2} + \dots w_{j i} x_{} + b_{i}

(14)

where $x_{i}$ is the ith input and $w_{j i}$ is the weight from ith input to jth hidden node. Also, the weights leading to a hidden node are regarded as zero mean random variables. Therefore, the mean of the input to the hidden node is given by:

(15)

E \{y_{j}\} = E \{\sum_{i = 0}^{n} w_{j i} x_{i}\} = 0

(15)

Since, the weights are randomly chosen, they can be considered independent of the input feature. Thus, Equation (9) can be written as:

(16)

E \{y_{j}\} = \sum_{i = 0}^{n} E \{w_{j i}\} E \{x_{i}\} = 0

(16)

The variance of y is given by:

(17)

σ_{y}^{2} = E \{{(y_{j})}^{2}\} - E^{2} \{(y_{j})\}

(17)

Using Equation (15) in (17), we get,

(18)

σ_{y}^{2} = E \{{(\sum_{i = 0}^{n} w_{j i} x_{i})}^{2}\}

(18)

Since the different weights and input feature to a hidden node are independent, Equation (18) becomes

(19)

σ_{y}^{2} = \sum_{i = 0}^{n} E \{{(w_{j i})}^{2}\} E \{{(x_{i})}^{2}\} \forall j

(19)

If the training samples are normalized to be in the interval [0:1], then

(20)

E \{{(x_{i})}^{2}\} = E^{2} \{(x_{i})\} + σ_{x}^{2} (x_{i})

(20)

where $σ_{x}^{2} (x_{i})$ is variance of random variable $x_{i}$ . For a uniform random variable lying within [0:1],

(21)

E \{{(x_{i})}^{2}\} = \frac{1}{3}

(21)

Also, input to hidden layer weight is also regarded as a random variable with zero mean, uniformly distributed in the interval [–a, a]. Hence, the standard deviation of the input to a hidden layer weight is:

(22)

E \{{(_{j i})}^{2}\} = {\frac{a^{2}}{3}}_{}

(22)

So, using Equations (13), (15), and (16))

(23)

σ_{y}^{2} = \frac{N a^{2}}{9}

(23)

where N denotes the number of weights converging on a particular node.

Equations (19) and (23) justify that input to a hidden layer neuron is also a random variable with zero mean and $\frac{N a^{2}}{9}$ variance. The probability distribution function of the input to hidden layer node is given as:

(24)

f (μ, σ) = \frac{1}{\sqrt{2 π σ}} e^{\frac{- {(y - μ)}^{2}}{2 σ^{2}}}

(24)

where y corresponds to induced local field of jth hidden layer neuron and can also be defined as:

(25)

y_{j} = \sum_{i = 0}^{n} w_{j i} x_{i}

(25)

where x_i is the data input and w_ji is the weight connecting from ith input node to jth hidden node .

According to Equations (23) and (24), the information content about the weight set in the hidden layer neuron which is the above-defined random variable can also be calculated as follows:

(26)

l n f = l n (\frac{1}{\sqrt{2 π σ}} e^{\frac{- {(y_{j})}^{2}}{2 σ^{2}}}) = l n \frac{1}{\sqrt{2 π σ}} - \frac{y_{j}^{2}}{2 σ^{2}}

(26)

(27)

{(\frac{\partial l n f}{\partial w})}^{2} = \frac{y_{j}^{2} x_{i}^{2}}{σ^{4}}

(27)

Using Equation (13), the fisher information is given as:

(28)

Γ (θ) = E [\frac{y_{j}^{2} x_{i}^{2}}{σ^{4}}] = \frac{1}{σ^{4}} E [y_{j}^{2}] E [x_{i}^{2}]

(28)

Using Equations (12), (15), and (17), we get:

(29)

Γ (θ) = \frac{N a^{2}}{27 σ^{4}}

(29)

Let the variance term be replaced by the coefficient of dominance (which is a simple ratio of the variances) and consider two different hidden nodes with $V_{s 1}$ and $V_{s 2}$ as respective variances of weights convergent on them. Then, the fisher information for the two nodes is given by:

(30)

Γ_{1} (θ) = \frac{N a^{2}}{27} {(\frac{V_{s 1}}{V_{t o t a l}})}^{4} a n d Γ_{2} (θ) = \frac{N a^{2}}{27} {(\frac{V_{s 2}}{V_{t o t a l}})}^{4}

(30)

If $V_{s 1} > V_{s 2},$ Fisher information of node 1 will be more compared to that of node 2. This implies that a lower coefficient of dominance which makes the complexity penalty higher (and the weight more relevant) also enhances the information content of the convergent connections at a node.

The inclusion of variance in the complexity penalty term and the particular choice of coefficient of dominance are justified by the above explanation.

Genetic Optimization of Weights

Working of Genetic Algorithm

GAs are paradigms for optimization based loosely on several features of biological evolution. Initially, a pool of solutions (population) represented by chromosomes is created. Then, an encoding scheme is used to represent each individual. An evaluation/fitness function is required that provides ranking to each chromosome. The population undergoes reproduction until stopping criteria are met (Sivanandam and Deepa Citation2007).

The basic GA shown in the form of is explained as follows:

Figure 6. The Genetic Algorithm.

Generate random population of chromosomes, that is, suitable solutions for the problem.
Fitness of each chromosome in the population is evaluated.
The population undergoes reproduction followed by a number of iterations of the following three steps:
- Parents are selected to reproduce according to their fitness.
- Different operators are applied to the parents to form new offspring.
- Place new offspring in the new population.
Use new generated population for a further run of the algorithm.
If the end criterion is satisfied, return the best solution to the current population.
Go to step 2.

Why Genetic Algorithm?

The proposed scheme not only prunes the weights according to the fitness function, it modifies them in accordance with the complexity penalty. The complexity penalty-based modification function yields an initial population of candidates which constitutes the solution space. GA is now used to evaluate the candidate’s fitness through crossover and mutation in successive generations. Actual pruning is accomplished only after the algorithms zeros in on the least fit weights. Weight pruning completely alters the architecture of an ANN. The altered architecture may be better than the original one in terms of convergence and error performance. However, we have no means of judging its optimality before the network is actually trained. From the point of view of custom hardware implementation, repeated alterations in the architecture of a network are highly undesirable and it leads to gross wastage of resources. The use of GA in the subsequent stage ensures efficient search of the solution space to identify one or more “good solutions” though not necessarily the best one. However, through successive generation cycles, it is not hard to identify the least fit (prunable) weight.

The Strategy

The strategy is depicted in the form of a flow chart in . As the first step initialization of population is done. Unlike the conventional method of random initialization we adopt a methodology which modifies the initial weights according to the complexity penalty to generate the mating pool in a controlled manner as described below.

Figure 7. Genetic optimization of ANN weights.

The initialized population of size $K$ is given by a set

(31)

P = \{p_{i}^{* j}\} j = 1, 2, 3, . . K

(31)

Also,

(32)

p_{i}^{* j} = A * w_{i}

(32)

where $A$ is a $1 \times K$ vector of multipliers

if $ξ_{w i} > 0.5$ , then $A$ lies in the range [0.9, 1.3].

If $0.2 < ξ_{w i} \leq 0.5$ , then $A$ is a $1 \times K$ vector of random multipliers.

If $0 < ξ_{w i} \leq 0.2$ , then $A$ lies in the range [0.2, 0.5].

If $ξ_{w i} < 0.2$ , then $w_{i}$ is pruned.

Multiplication of an initial weight $w_{i}$ by $A$ generates $K$ (in our case, $K$ = 20) different weights which constitute the population after bit string encoding. It should be noted that weights with a higher complexity penalty (more relevance) generate a population with “high” complexity penalties (due to range of $A$ ). Similarly, non-relevant weights generate a population with “low” complexity penalties. By initializing the population in this manner, we are restricting the extremities of the search space to a specified range, thereby reducing the chances of the algorithm running a futile search and getting stuck in local minima. However, the necessity of randomness in initialization cannot be completely overlooked since it is essential to include largest possible number of potential solutions in the search space. To ensure this, we have generated random population from the weights having “moderate” complexity penalty.

The fitness function is defined as $\frac{1}{M S E}$

(33)

w h e r e, M S E = \frac{1}{2} \sum_{p = 1}^{N} {(t^{p} - y^{p})}^{2}

(33)

Here, t and y represent the target and calculated outputs, respectively, and p signifies the number of training patterns. Fitness of each chromosome is evaluated by running the forward pass of BP algorithm. The chromosomes are ordered according to their fitness value using rank selection. Reproduction of new offspring is done by selecting fittest parents from the mating pool and subjecting them to two point restricted crossover.

Each generation cycle witnesses addition of the offspring to the mating pool if its fitness is more than that of one of its parents. In this case, the least fit chromosome is discarded and the offspring is ranked according to its fitness value. However, the offspring is discarded if it is not fitter than any of its parents. In both cases, third and fourth fittest chromosomes are mutated with a small mutation probability (0.06–0.09). If the mutated chromosomes are fitter than the original one, they are added to the mating pool and ranked accordingly and two least fit chromosomes are discarded. This heuristic step of mutating third and fourth rank chromosomes is like giving a “second chance” to some athlete taking part in Olympics and missing the bronze medal by a whisker. If mutation with a small probability increases the fitness then both original and mutated weights should be part of the mating pool in the next generation cycle. After 25 generation cycles, the average increase in fitness of weights was no longer significant employing the proposed scheme. The fitness values of the weights obtained using the proposed technique were compared with that obtained using conventional GA (random initialization) and it was verified that partially random initialization and mutation of third and fourth fitted chromosomes yielded fitter individuals in significantly less number of generation cycles.

Training the Pruned ANN

After 25 generation cycles of the GA, the pruned ANN is reinitialized with the stored weights. A single hidden layer neural network was trained using two different versions of BP algorithm for classifying five benchmark data sets. The number of hidden layer neurons was optimized experimentally and different architectures were found optimum for different data set. The network was first trained using traingdm function of Matlab. traingdm was chosen as the training function because it gives the highest degree of freedom for the investigator to fine tune the training parameters since both learning rate and momentum constant can be varied experimentally using this function. The network was also trained with Levenberg–Marquardt algorithm using trainlm function of MATLAB. Since, Levenberg–Marquardt algorithm is known to outperform simple gradient descent and other conjugate gradient methods in a plentitude of problems, it was necessary to examine whether the proposed technique is able to improve the performance of Levenberg–Marquardt algorithm. A 10-fold cross validation scheme was used to prevent over fitting. The training phase performance of the network in was evaluated with respect to four parameters viz. convergence success rate (% of time the simulation reaches error goal), mean epochs, execution time, and computational complexity. However, the generalization performance is the ultimate performance metric which gives the classification success rate. All the parameters have been evaluated both for unpruned and pruned networks.

Results and Discussions

Convergence and Generalization

The training phase performances of the conventional ANN and GA-optimized ANN trained with traingdm are given in & , respectively. In , results are shown for the networks not subjected to GA-based weight optimization. It can be observed that the pruned architectures perform marginally better than their unpruned counterparts for all the three training parameters. A comparison of and reveals significant improvements in the performance of the ANN when the weights are optimized using GA. Furthermore, better performance of the pruned architectures can’t be ignored in this case also. The networks were then trained till convergence and subsequently presented with the test samples. and depict the error surface obtained using conventional BP and proposed algorithm, respectively. As evident from the plots, the error surface obtained using the proposed technique spans a larger area of the plot with multiple error minima. This indicates more consistent performance of the network initialized with GA-determined weights at almost all values of learning rate and momentum constant. The effect of the proposed techniques on Levenberg–Marquardt algorithm has been investigated. The pruned networks have again performed marginally but consistently better with trainlm as shown in . Although trainlm by default exhibits higher converge success and a lower epoch mean, GA-optimized weights have improved these parameters as can be observed in .

Figure 8. Surface plot for error obtained for the network trained using conventional backpropagation.

Figure 9. Surface plot for error obtained for the network trained using GA-optimized backpropagation.

The reflection of better convergence in the training phase on the generalization performance was as expected and the results are shown in and for traingdm and in and for trainlm, respectively. It is encouraging to note that magnitude of improvement in the performance matrices is more significant in complicated architectures e.g. “letter recognition-1 and 2”. Intuitively, more complex architectures have larger number of weights and hence more candidates in the mating pool leading to the fittest possible selection.

Table 6. Generalization performance of conventional ANN (traingdm).

Download CSV Display Table

Table 7. Generalization performance of GA Optimized ANN (traingdm).

Download CSV Display Table

Table 8. Generalization performance of conventional ANN (trainlm).

Download CSV Display Table

Table 9. Generalization performance of GA-optimized ANN (trainlm).

Download CSV Display Table

Computational Cost

The computational cost (complexity) is an important issue to be addressed during evaluation of algorithms. Sometimes, the algorithm’s effectiveness in providing a solution to a given problem requires the execution of complex computations and the use of excessive memory resources, which can create severe difficulties in real-time or low-cost applications. Let us analyze first the computational complexity of the weight pruning stage described in section “The weight pruning scheme.” Since we have developed the weight pruning scheme as an alternative to CAPE, we would now compare the computational cost for these two methodologies. and enlist the number of operations required by the proposed scheme and CAPE, respectively, as a function of number of input samples (N), input attributes (P), hidden neurons (Q), and output neurons (M). The number of operations required by the proposed methodology and CAPE was computed considering an MLP with N = 140 input samples, P = 2 input attributes, M = 1 output neuron, and the number of hidden neurons Q varying from 1 to 10. The same MLP was taken as reference by the inventors of CAPE (Medeiros and Barreto Citation2013).

Table 10. Computational operations required by the proposed method.

Download CSV Display Table

Table 11. Computational operations required by CAPE method (Medeiros and Barreto Citation2013).

Download CSV Display Table

It can be easily observed from the tables that the proposed scheme requires lesser number of multiplication operations than CAPE. This is due to the fact that CAPE requires calculation of correlation coefficients to determine the relevance of weights, whereas we define relevance in terms of complexity penalty which involves fewer operations. Furthermore, unlike the proposed technique, CAPE involves running backward pass of BP which in turn involves computations through a nonlinear activation function incurring additional cost as shown in the last row of the tables. shows how the number of operations varies as a function of number of weights (connections). For the sake of simplicity, all the operations are considered to have same computational cost. From the graph, it is apparent that the proposed scheme demands lesser resources than the CAPE algorithm. It is also clear from that using proposed weight pruning stage, there is substantial reduction in the computational complexity and this reduction tends to become consistent with increasing number of connections.

Table 12. Percentage reduction in complexity by the proposed algorithm.

Download CSV Display Table

Figure 10. Estimated costs for CAPE and proposed scheme when pruning an MLP.

presents the stage-wise breakup of computations up to the commencement of GA. As we know, the computational complexity of GA is of the order $O (G * n * b)$ , Where $G$ , $n$ , and $b$ are number of generations, population size, and bit string length, respectively. In our case, this stage gives a complexity of $O (25 * 20 * 10)$ . In addition to it, evaluation of fitness for each weight requires running of forward pass of BP once per generation cycle which requires $O (\frac{P * M * N * G}{2})$ operations. BP without GA requires $O (P * M * N * E)$ operations, where $E$ is the number of epochs. This means savings in terms of mean epochs is going to decide which technique consumes lesser number of operations. It is evident from –, $E > \frac{G}{2}$ for most of the architectures and even for Levenberg–Marquardt algorithm which takes much fewer epochs to converge.

Table 13. Stage wise break-up of computations.

Download CSV Display Table

Example: Pruned Network A: Trained with conventional trainlm (IRIS data). Epoch mean = 50.

Pruned Network B: Trained with GA optimized trainlm (IRIS data). Epoch mean = 30.

Computational complexity of network A = $O (P * M * N * E)$ = O $(4 * 3 * 150 * 50)$ = $O (90, 000)$

Computational complexity of network B = $O (\frac{4 * 3 * 150 * 25}{2}) + O (4 * 3 * 150 * 30) + O (25 * 20 * 10)$ $= O (81, 500)$ .

Thus, the savings in computations even for marginal improvement in epoch mean due to GA optimization is evident and this has been observed more prominently in complex architectures which take large number of epochs to converge.

Conclusion

The results presented in the previous section confirm the efficacy of the proposed technique in terms of convergence, execution time, and generalization performance. It is evident that both weight pruning and GA optimization stages play their respective parts and contribute significantly to improvement in performance of the BP-trained ANN. This improvement is reflected not only in ANN trained with traditional gradient descent with momentum term (traingdm) but also with a more advanced algorithm like Levenberg–Marquardt (trainlm). Reduction in the epochs required for convergence indicates that GA search about the weight space has yielded near optimal solutions. Initialization of population based upon complexity penalty as described in section “Implementation” seems to have paid dividend. Subsequently, the mutation of third- and fourth-ranked individuals (described as MAD algorithm) in the mating pool makes the whole process more competitive since the mutation operator seeks to increase the appropriateness of the weights in a statistical sense. Evaluation with the MSE-based fitness function also points toward a correlation between statistical “relevance” of weights and their fitness in terms of lower MSE. The computational complexity of the proposed weight pruning stage has been proven to be better than the existing weight pruning strategy CAPE. Also, the classification results obtained reveal the efficacy of the proposed technique. It can also be concluded that this approach can serve as an efficient and viable alternative to conventional neuro-genetic design of neural classifiers. Future work may be in the direction of further reducing the execution time of the proposed algorithm.

Disclosure Statement

The authors report no conflicts of interest. The authors alone are responsible for the content and writing of the paper. The Manuscript has not been published elsewhere and it has not been submitted simultaneously for publication elsewhere.

Additional information

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

References

Aleksander, I., and J. Taylor. 1992. Pruning neural nets by genetic algorithm. Artificial neural networks 2. International Conference on Artificial Neural Networks (ICANN-92). Brighton, United Kingdom: Elsevier Science Publishers.
Google Scholar
Asmaa, A. N., A. T. Erdogan, and T. Arslan. 2013. Adaptive three-dimensional cellular genetic algorithm for balancing exploration and exploitation processes. Soft Computing 17:1145–57. doi:10.1007/s00500-013-0990-1.
Web of Science ®Google Scholar
Augasta, M. G., and T. Kathirvalavakumar. 2015. Pruning algorithms of neural networks – a comparative study. Central European Journal of Computer Science 3 (3):105–15.
Google Scholar
Bull, L. 2001. On Co-evolutionary genetic algorithms. Soft Computing 5:201–07. doi:10.1007/s005000100082.
Google Scholar
Christiansen, N., J. Job, K. Klyver, and J. Hogsberg. 2012. Optimal brain surgeon on artificial neural networks in nonlinear structural dynamics. 25th Nordic seminar on computational mechanics. Lund, Sweden.
Google Scholar
Haupt, S. E., and R. L. Haupt. 2011. A mixed integer genetic algorithm used in biological and chemical defense applications. Soft Computation 15 (1):51–59. doi:10.1007/s00500-009-0516-z.
Web of Science ®Google Scholar
Haykin, S. 2007. Neural network and learning machines. India: Prentice-HALL.
Google Scholar
Hu, L., L. Qin, K. Mao, and W. Chen. 2015. Optimization of neural network by genetic algorithm for flowrate determination in multipath ultrasonic gas flowmeter. IEEE Sensors Journal 16 (5):1158–67. doi:10.1109/JSEN.2015.2501427.
Web of Science ®Google Scholar
Hunter, A., and K. S. Chiu. 2000. Genetic algorithm design of neural network and fuzzy logic controllers. Soft Computation 4:186–92. doi:10.1007/s005000000050.
Google Scholar
Karlra, P., and N. R. Prakash. 2003. A neuro-genetic algorithm approach for solving the inverse kinematics of robotic manipulators. IEEE Conference on Systems, Man and Cybernetics 2. Washington, USA: IEEE Publishers.
Google Scholar
Karthikeyan, P., S. Baskar, and A. Alphones. 2012. Improved genetic algorithm using different genetic operator combinations (GOCs) for multicast routing in ad hoc networks. Soft Computing 17 (9):1563–72. doi:10.1007/s00500-012-0976-4.
Web of Science ®Google Scholar
Kingston, G., H. Maier, and M. Lambert. 2004. A statistical input pruning method for artificial neural networks used in environmental modeling. Biennial Meeting of the International Environmental Modelling and Software Society. Osnabruck, Germany: iEMSs Publisher.
Google Scholar
Kumar, P., D. Gospodaric, and P. Bauer. 2007. Improved genetic algorithm inspired by biological evolution. Soft Computation 11 (10):923–41. doi:10.1007/s00500-006-0143-x.
Web of Science ®Google Scholar
Kwon, Y. K., and B. R. Moon. 2007. A hybrid neurogenetic approach for stock forecasting. IEEE Trans. Neural Network 18 (3):851–64. doi:10.1109/TNN.2007.891629.
Web of Science ®Google Scholar
Libelli, S., Marsili, and P. Alba. 2000. Adaptive mutation in genetic algorithms. Soft Computation 4 (2):76–80. doi:10.1007/s005000000042.
Google Scholar
Lin, C. T., M. Prasad, and A. Saxena. 2015. An improved polynomial neural network classifier using real-coded genetic algorithm. IEEE transactions on systems. Man, and Cybernetics: Systems 45 (11):1389–401.
Web of Science ®Google Scholar
Ling, S. H., and F. H. F. Leung. 2007a. An improved genetic algorithm with average-bound cross over and wavelet mutation operations. Soft Computation 11 (1):7–31. doi:10.1007/s00500-006-0049-7.
Web of Science ®Google Scholar
Ling, S. H., F. H. F. Leung, and H. K. Lam. 2007b. Input-dependent neural network trained by real-coded genetic algorithm and its industrial applications. Soft Computation 11 (11):1033–52. doi:10.1007/s00500-007-0151-5.
Web of Science ®Google Scholar
Louis, S. J. 2005. Genetic learning for combinational logic design. Soft Computation 9 (1):38–43. doi:10.1007/s00500-003-0332-9.
Web of Science ®Google Scholar
Mantzaris, D., G. Anastassopoulos, and A. Adamopoulos. 2011. Genetic algorithm pruning of probabilistic neural networks in medical disease estimation. Neural Networks 24 (8):831–35. doi:10.1016/j.neunet.2011.06.003.
PubMed Web of Science ®Google Scholar
Medeiros, C. M., and G. A. Barreto. 2013. A novel weight pruning method for MLP classifiers based on the MAXCORE principle. Neural Computing and Applications 22 (1):71–84. doi:10.1007/s00521-011-0748-6.
Web of Science ®Google Scholar
Mesquita, D. P., J. Gomes, R. L. Rodrigues, and R. K. Galvao. 2015. Pruning extreme learning machines using the successive projections algorithm. IEEE Latin America Transactions 13 (12):3974–79. doi:10.1109/TLA.2015.7404935.
Web of Science ®Google Scholar
Ramasubramanian, P., and A. Kannan. 2006. A genetic-algorithm based neural network short-term forecasting framework for database intrusion prediction system. Soft Computation 10 (8):699–714. doi:10.1007/s00500-005-0513-9.
Web of Science ®Google Scholar
Reed, R. 1993. Pruning algorithms-A survey. IEEE Transactions on Neural Networks 4 (5):740–47. doi:10.1109/72.248452.
PubMed Web of Science ®Google Scholar
Sabo, D., and X. Yu. 2008. A new pruning algorithm for neural network dimension analysis. IEEE World Congress on computational Intelligence Proc. Hong Kong, China.
Google Scholar
Samanta, B., K. R. Al-Balushi, and S. A. Al-Araimi. 2006. Artificial neural networks and genetic algorithm for bearing fault detection. Soft Computation 10 (3):264–71. doi:10.1007/s00500-005-0481-0.
Web of Science ®Google Scholar
Sivanandam, S. N., and S. N. Deepa. 2007. Introduction to Genetic Algorithms. Berlin, Heidelberg: Springer Science & Business Media.
Google Scholar
Srivastava, A. K., K. K. Shukla, and S. K. Srivastava. 1998. Exploring neuro-genetic processing of electronic-noise data. Microelectronics Journal 29 (11):921–31. doi:10.1016/S0026-2692(98)00056-1.
Web of Science ®Google Scholar
Tan, F., X. Fu, Y. Zhang, and A. G. Bourgeois. 2008. A genetic algorithm-based method for feature subset selection. Soft Computation 12 (2):111–20. doi:10.1007/s00500-007-0193-8.
Web of Science ®Google Scholar
Tong, Z., and G. Tanaka. 2015. A pruning method based on weight variation information for feedforward neural networks. 4th IFAC Conference on Analysis and Control of Chaotic Systems 48(18): 221–226. Elsevier Science Publishers
Google Scholar
UCI machice learning repository http://archive.ics.uci.edu/ml/datasets
Google Scholar
Yang, S. H., and Y. P. Chen. 2012. An evolutionary constructive and pruning algorithm for artificial neural networks and its prediction applications. Neurocomputing 86 (1):140–49. doi:10.1016/j.neucom.2012.01.024.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

A Neuro-Genetic Technique for Pruning and Optimization of ANN Weights

ABSTRACT

Introduction

Related Work

Need of Improvement

Problem Formulation

The Proposed Methodology

Data Description

Table 1. Data description.

Implementation

The Weight Pruning Scheme

Genetic Optimization of Weights

Working of Genetic Algorithm

Why Genetic Algorithm?

The Strategy

Training the Pruned ANN

Results and Discussions

Convergence and Generalization

Table 2. Training phase performance of conventional ANN (traingdm).

Table 3. Training phase performance of GA-optimized ANN (traingdm).

Table 4. Training phase performance of conventional ANN (trainlm).

Table 5. Training phase performance of GA-optimized ANN (trainlm).

Table 6. Generalization performance of conventional ANN (traingdm).

Table 7. Generalization performance of GA Optimized ANN (traingdm).

Table 8. Generalization performance of conventional ANN (trainlm).

Table 9. Generalization performance of GA-optimized ANN (trainlm).

Computational Cost

Table 10. Computational operations required by the proposed method.

Table 11. Computational operations required by CAPE method (Medeiros and Barreto Citation2013).

Table 12. Percentage reduction in complexity by the proposed algorithm.

Table 13. Stage wise break-up of computations.

Conclusion

Disclosure Statement

References

Information for

Open access

Opportunities

Help and information

A Neuro-Genetic Technique for Pruning and Optimization of ANN Weights

ABSTRACT

Introduction

Related Work

Need of Improvement

Problem Formulation

The Proposed Methodology

Data Description

Table 1. Data description.

Implementation

The Weight Pruning Scheme

Genetic Optimization of Weights

Working of Genetic Algorithm

Why Genetic Algorithm?

The Strategy

Training the Pruned ANN

Results and Discussions

Convergence and Generalization

Table 2. Training phase performance of conventional ANN (traingdm).

Table 3. Training phase performance of GA-optimized ANN (traingdm).

Table 4. Training phase performance of conventional ANN (trainlm).

Table 5. Training phase performance of GA-optimized ANN (trainlm).

Table 6. Generalization performance of conventional ANN (traingdm).

Table 7. Generalization performance of GA Optimized ANN (traingdm).

Table 8. Generalization performance of conventional ANN (trainlm).

Table 9. Generalization performance of GA-optimized ANN (trainlm).

Computational Cost

Table 10. Computational operations required by the proposed method.

Table 11. Computational operations required by CAPE method (Medeiros and Barreto Citation2013).

Table 12. Percentage reduction in complexity by the proposed algorithm.

Table 13. Stage wise break-up of computations.

Conclusion

Disclosure Statement

Additional information

Funding

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date