756
Views
6
CrossRef citations to date
0
Altmetric
Articles

Analysis of the Ebola Outbreak in 2014 and 2018 in West Africa and Congo by Using Artificial Adaptive Systems

, , , , &

ABSTRACT

In this manuscript the Ebola outbreaks in 2014 and 2018 have been studied. On March 23, 2014, the World Health Organization announced the beginning of the Ebola outbreak in West Africa. The initial location was in a forested area in the south eastern portion of Guinea. We used three different methods to determine the origin of the outbreak. The first was a suite of artificial adaptive systems called Topological Weighted Centroid which located the outbreak origin at Longitude: −10.5337, Latitude: 8.1517. This area is 64 km from Guekedou, Guinea. We also used a Dynamic Naive Bayesian/Dynamic Networks Block Algorithm. The Bayesian algorithm shows the main source of the Ebola outbreak at Kissidougou. Both of these methods revealed the outbreak started in the forested area southeast of Guinea. The distance between Guekedou and Kissidougou is about 69 km. Furthermore, we used an artificial neural network (ANN) called Selfie to predict the outbreak diffusion. The Ebola outbreak in May 2018 in Democratic Republic of Congo was not as widespread as the outbreak in 2014. The outbreak was effecting the health zones of Bikoro and Iboko, and Wangata in Congo. We have used an ANN algorithm and predicted the origin of the outbreak at (Longitude: 18.3046, Latitude: −0.6865) in the Equator about 20 km at North-West of Bikoro.

Introduction

The initial WHO epidemiologic investigation revealed the first (index) case was a 2-year-old child from Meliandou (a village in Gueckedou prefecture in Nzerekore region of southern Guinea), Longitude: −10.061, Latitude: 8.616. According to the initial investigation the child died on December 6, 2013 but a secondary investigation confirmed the death of a child at the end of December. After the death of the index case, the sister, mother, grandmother of the child and a nurse, became victims 2–5, died on December 29, 2013, December 13, 2013, January 1, 2014, and January 29, 2014, respectively. Evidently, the fourteenth patient, a health-care worker from Gueckedou, caused the spread of the virus to Macenta, Nzerekoré, and Kissidougou in February 2014. (Sylvain et al. Citation2014)

The World Health Organization (WHO) was alerted of the Ebola outbreak in Guinea on March 23, 2014. The Ministry of Health (MoH) of Guinea notified the WHO of the rapidly spreading Ebola outbreak in the forested area south eastern of Guinea with a total of 49 cases including 29 deaths by March 22, 2014. The Ebola cases reported were in Guekedou (Longitude: −10.1336, Latitude: 8.5674), Macenta (Longitude: −9.4710, Latitude: 8.5435), Nzerekore (Longitude: −8.8238889, Latitude: 7.7472) and Kissidougou districts (Longitude:-10.1000, Latitude: 9.1833) (WHO Ebola Response Team Citation2014).

The recent (May 2018) Ebola outbreak was reported by WHO. According to the WHO reports the confirmed cases were in Bikoro and Iboka health zones, and in Wangata, one of the three health zones of Mbandaka (Ebola Virus Disease Citation2018; Ebola virus disease-Democratic Republic of the Congo, Disease outbreak news Citation2018; Update on the Ebola outbreak in Democratic Republic of Congo, Medicines Sans Frontiers, Doctor without Borders, May 2018).

Different approaches have been used to study the Ebola outbreak such as artificial adaptive systems, a fixable spatiotemporal growth, and deterministic model, a Markov model to estimate the weekly average number of secondary cases and using Mirador to find the most informative clinical factors related to the Ebola virus Diseases (Buscema Citation2015; Clubri et al. Citation2016; Santermans et al. Citation2016; Wayne et al. Citation2015).

Three algorithms in this effort have been used to validate the final results. All of these algorithms have been developed by the Semeion research center. We used the Topological Weighted Centroid (TWC) algorithm to find the possible location of the outbreak’s origin and analyze the dynamics of disease spread. The TWC suite of algorithms is based on statistical thermodynamics. They have been tested in other cases such as the E-coli outbreak in Germany (Buscema et al., Citation2009; Buscema et al. Citation2009, Citation2013; Grossi, Buscema, and Jefferson Citation2009). The details and the new development in TWC algorithm with some applications also have been discussed in (Buscema, Massini, and Sacco Citation2017). The Dynamic Naive Bayesian/Dynamic Networks Block (DNB) Algorithm was used to find the cause and effect of the Ebola outbreak and it is published here for the first time. The Selfie ANN has been applied also for the first time in paper. The main purpose of using different algorithms is to justify the results of using different methods and test them on the Ebola outbreak data. The core of our analysis is based on the TWC method. We have divided the data into eight sets and we have shown the method is stable. We have obtained the same prediction for the origin of the outbreak from all eight sets and the results were near the place that was reported by WHO as the origin of the Ebola outbreak. In summary, we validated our analyses by

  • comparing the results from three different algorithms (TWC, DNB, and ANN Selfie),

  • using different TWC algorithms (TWCα, TWCβ, and TWCϒ),

  • applying TWCα for eight different sets and

  • comparing with the results with the origin that was reported by WHO.

shows a summary of the outbreak over six West African countries. Our data were obtained from WHO, HealthMap, and The World Bank.

Table 1. Summary of outbreak over six West African countries.

According to information from the WHO, Ebola started in Guinea but spread out mostly to Liberia and Sierra Leone.

The Ebola dataset from the HealthMap included 83 locations () from western Africa monitored for a period of 32 weeks (from March 4 to October 4, 2014). The number of new Ebola-infected persons was recorded every week for each city. Notice that the number of confirmed cases in seven locations is zero. We consider any error in the data as noise that can occur with real data.

Table 2. The Ebola dataset from the HealthMap including 83 locations from western Africa monitored for a period of 32 weeks (from 4 March to 4 October 2014).

shows the locations and the number of cases of the recent Ebola outbreak in Democratic Republic of Congo in May 2018 (Ebola Virus Disease Citation2018). The recent outbreak was controlled quickly and it was not as widespread as the outbreak in 2014. This is the ninth Ebola outbreak in the last 40 years in Congo. The most recent outbreaks in Congo were in Likate district in May 2017 and in Boende in 2014. Most of the outbreaks occurred in isolated areas (Update on the Ebola outbreak in Democratic Republic of Congo, Medicines Sans Frontiers, Doctor without Borders, May 2018). The outbreak is reported in 10 locations. Some of those locations are very close to each other (Update on the Ebola outbreak in Democratic Republic of Congo, Medicines Sans Frontiers, Doctor without Borders, May 2018). We have used three coordinates that represent Wangata, Bikoro and Iboko heath zones according to WHO reports (Ebola virus disease-Democratic Republic of the Congo, Disease outbreak news Citation2018).

Table 3. The locations and the number of cases of the Ebola outbreak in Congo (May 2018).

The Model

The approach we used was based on artificial adaptive systems. For this study, we used the Topological Weighted Centroid (TWC) suite of algorithms (Buscema et al., Citation2009; Buscema et al. Citation2009, Citation2013; Buscema, Massini, and Sacco Citation2017; Grossi, Buscema, and Jefferson Citation2009), Dynamic Naive Bayesian (DNB) Algorithm, and Selfie ANN.

Topological Weighted Centroid Algorithm

The Topological Weighted Centroid, whose theory and successful applications have been discussed in detail in our previous publications (Buscema et al., Citation2009; Buscema et al. Citation2009, Citation2013; Buscema, Massini, and Sacco Citation2017; Grossi, Buscema, and Jefferson Citation2009), was used to predict the origin and to analyze the spread of the recent Ebola outbreak. In this paper, we discuss TWC briefly. The foundation of this artificial neural system algorithm is statistical mechanics. We define the possible states of the system in terms of the locations (longitude and latitude) of the outbreak. For each state, we use the Boltzmann factor to define the probability of the state and then we are able to define the free energy and entropy of the system. Using the free energy and entropy we can then configure the state space giving us the possible source(s). From the source we then predict the possible spread. We define the system similar to a two-dimensional physical system that is a combination of several point masses, charges, or heat sources. The system is a finite two-dimensional discrete space with some active points (the outbreak points similar to the heat sources or charges in a physical system). The effect of the outbreak on all points of the discrete space is given by some scalar fields similar to the electric potential or temperature field in a physical system. Here, in some of our algorithms, only the location of the outbreak points are used and in some in addition to the outbreak points, the coordinates of all points on the discrete space are used. We use the longitude and latitude of the points in the region of interest as the two-dimensional coordinates. Three different TWC algorithms, TWCα, TWCβ, and TWCγ were used. The main differences between them stem from different definitions of the state of the system.

TWCα

The average location of the outbreak in the TWCα algorithm is measured according to the distance of the outbreak points from each other. The TWCα calculates the optimum value of α parameter. This optimal value of α is used to predict the coordinate of the origin of the outbreak TWCxα,TWCyα. The coordinate is given in terms of latitude and longitude. The weighting factor is defined in a similar way as the Boltzmann factor in statistical thermodynamics and is given by

(1) Wiα=1N1j=1,jiNedi,jDα(1)

where N is the total number of outbreak points (states), di,j can be the Euclidean distance between point i and j or a function of the Euclidean distance, D=maxdi,j and α is a variable parameter 0α< that will be discussed later. In equation (1) because of ij, the summation is divided by N1. In this analysis we have used a particular function of Euclidean distance for di,j that is called meta distance or indirect distance (Buscema, Massini, and Sacco Citation2017). From a physics perspective di,j can be interpreted as the potential energy associated with sources i and j. The sum over all weighting factors is similar to the partition function in statistical thermodynamics and works as a normalizing factor of the probability. It is

(2) Zα=i=1NWiα(2)

The probability of being in state i is given by

(3) Piα=WiαZα(3)

The coordinates of TWCα are defined as

(4) TWCxα=i=1NxiPiα(4)

and

(5) TWCyα=i=1NyiPiα(5)

As α0 TWCα approaches the center of mass of the system. The free energy, Fα, and entropy (information) Sα, of the system are defined in a similar way as in statistical thermodynamics,

(6) Fα=lnZαα(6)

and

(7) Sα=i=1NPiαlnPiα(7)

The optimal value ofα=α is given by optimizing the free energy and entropy of the system. As α increases from zero to α the distance between the TWCα and the center of the mass increases and when α exceeds α the distance between TWCα and the center of mass starts decreasing. Therefore α=α is a turning point. As α changes from zero to α the TWCα point follows a path from the center of mass to TWCα. According to this algorithm the coordinate of the point of interest (the origin of the outbreak) is at TWCxα and TWCyα.

TWCβ

From a physical perspective, the TWCα calculates the interaction of sources with each other, similar to the interaction of masses or charges on each other. Now we want to find out the effect of the sources on the space. Each source of the outbreak would be an outbreak potential in the space. The potential (possibility) of outbreak in the near points to the source should be larger than the farther points. In the physics terminology, the source distribution would create a field in the space, similar to the gravitational field. The TWCβ algorithm is the foundation of the scalar field which is discussed later. This scalar field has a similar perspective as the electric potential due to an electric charge distribution. In the TWCβ algorithm, we include the distance of all points to themselves, di,j=0. Since the effect of a source on the neighboring region should be the largest effect thus the weight corresponding to di,j=0 should be the largest weight.

This algorithm is similar to TWCα. In that algorithm the distance of the point to itself was not included so, to include the distance of the point to itself, the weighting factor from equation (1) is replaced by

(8) Wiβ=1Nj=1Nedi,jDβ(8)

Equation (8) shows the weight increases as di,j deceases and the largest weight is for di,j=0 as we expected.

The partition function and probability are defined in a manner similar to that of (2) and (3):

(9) Zβ=i=1NWiβ(9)

and

(10) Piβ=WiβZβ(10)

The coordinates of TWCβ are given by the equations similar to that of (4) and (5),

(11) TWCxβ=i=1NxiPiβ(11)

and

(12) TWCyβ=i=1NyiPiβ(12)

We also define the free energy and entropy similar to equations (6) and (7). We will find the optimal value of β=β in a similar way as was shown for TWCα. The optimal value of β=β will be used to determine the topological weighted scalar β and γ fields that will be discussed shortly.

TWCγ

The TWCγit is an algorithm that obtains a set of points (trajectory) associated with each outbreak point at step t. The parameter γit is variable thus TWCγit is a vector associated with each outbreak point and gives it a trajectory that starts from the i-th outbreak point and continues to an optimal value. The optimal value will be discussed later. In this case, we have a weighting factor that associates two points to each other. We follow a logic similar to that of (1) without summation over j in the weighting factor.

(13) Wi,jγit=edi,jDγit(13)

The partition function is given by

(14) Ziγit=j=1NWi,jγit(14)

and probability function for the dependence of point i to point j is given by

(15) Pi,jγit=Wi,jγitZiγit(15)

The coordinates of the i-th TWCγit at step t are given by

(16) TWCi,xγit=i=1NxjPi,jγit(16)

and

(17) TWCi,yγit=i=1NyjPi,jγit(17)

The entropy (information) associated to the i-th outbreak point is given by

(18) Siγit=j=1NPi,jγitlnPi,jγit(18)

The optimal or final value of γit=γi is given by the following equilibrium condition

(19) Siγitγitγit=γi=0(19)

There are NTWCγ points, where N is the number of outbreak points. Basically TWCγ transforms the outbreak points to new TWCγ points. It maps the i-th outbreak point to point TWCiγit at step t. Since TWCiγit is the trajectory of the i-th outbreak point, we can say TWCγ maps the i-th point along a trajectory. The coordinates of TWCγ will be used for the topological weighted scalar γ field that will be discussed in the next section.

The Topological Weighted Scalar β and γ Fields

The effect of the outbreak on the space according to our approach is given by the scalar fields (Buscema et al., Citation2009; Buscema et al. Citation2009, Citation2013; Buscema, Massini, and Sacco Citation2017; Grossi, Buscema, and Jefferson Citation2009). The outbreak creates a scalar field similar to an electrical or gravitational scalar potential. We associate the field with the outbreak distribution and the distances of the points on the space to the outbreaks points. Suppose the two-dimensional discrete space is given by M points and there are N outbreak points. The Topological Weighted Scalar β Field, at a point k is defined as

(20) TWSFkβ=1Nj=1Neqk,jDβ(20)

where 1<k<M, D is the maximum Euclidean distance between the outbreaks points, and qk,j is the Euclidean distance between the k-th point on the discrete space and the j-th outbreak point. There are M values of TWSFkβ for each point on the two-dimensional space.

The Topological Weighted Scalar γ Field (TWSFγ), at a point k can be defined in a similar manner to that of the Topological Weighted Scalar β Field:

(21) TWSFγkβ=1Nj=1Nerk,jDβ(21)

where rk,j is the Euclidean distance between the k-th point and the j-th TWCjγjt point at step t.

Dynamic Naive Bayesian/Dynamic Networks Block Algorithm

Dynamic Naive Bayesian/Dynamic Networks Block (DNB) is an algorithm to cluster the complex cause–effect relationships in a discrete dataset representing a multivariate temporal process.

The DNB algorithm is composed of three parts:

  1. Equations to detect the local cause–effect relations between the variables in the temporal flow (see equation (22));

  2. Equations to establish the global and prevalent cause–effect relations between the variables for the entire temporal multivariate process (see equations from (23) to (25));

  3. Equations to select the most likelihood cause–effect relations between variables at different time scales (Multi-Scale Entropy – MSE) in order to cluster them and to project them in the direct graph, a sparse or connected graph (equations (26) and (27)).

The temporal correlation is given by

(22) Ik,k+1,i,jp=xk,ip.xk+1,jp(22)

where xk,ip is the k-th instance of the i-th variable from the p-th sample.

The average of the temporal correlation is given by

(23) xi,jp=1Mk=1M1Ik,k+1,i,jp(23)

where M is the number of time steps. The temporal correlation variance is

(24) σi,jp=1MkM1Ik,k+1,i,jpxi,jp2(24)

The temporal correlation strength according to the probability function of x, px, is given by

(25) si,jp=kM1Ik,k+1,i,jppIk,k+1,i,jp(25)

The global temporal correlation at different time scales is given by

(26) ti,j=pQsi,jpzNsz,jp(26)

where N is the number of variables and Q is the number of samples.

The maximum of global temporal correlation gives the greatest likelihood of “Cause j ” of “Effect i”:

(27) Cj=argmaxti,ji(27)

Selfie Artificial Neural Network

Selfie ANN is an algorithm able to estimate the perception that a set of entities have of space where they are located in. This estimation allows to predict the diffusion of a process, if the assigned entities represent a sample of the process itself.

Selfie algorithm learns the position of each Entity in relation to any other. In this way the position of each entity is learned in a collective and convergent modality by the other entities.

When a new point appears in a specific position, all the trained entities will try to modify the position of the new point in relation to the logic that they have previously learned (the Hyper Surface computed).

The data organization to implement (training and testing) Selfie ANN is:

Training:

The Dataset for training phase is given by

(28) DB=Input,Outputn=1N2(28)

Each input vector is any possible combination of two entities, i and j, with a total of N2.

(29) Inputn=xn,i,yn,i,xn,j,yn,j(29)

and each output vector is given by

(30) Outputn,i=x n,i,y n,i(30)

where i,j are assigned points indices (entities) and n is the index of the input and the output vectors. Each output vector defines how a second entity j estimates the position of the first entityi.

Testing:

input vector is given as following

(31) Inputpn=xn,p,yn,p,xn,j,yn,jn=1N(31)

where p the index of a generic point. Each input vector defines how a second entity j estimates the position of a generic point p.

The output for testing is given by

(32) Outputp=1Nn=1Nx n,p,1Nn=1Ny n,p(32)

Each output vector defines how all N entities estimate the position of a generic point p.

During the test phase we may have 3 typical cases:

Normal case 1: The new point has the same position as the trained entities: in this case the new position assigned to it will be similar to its original position (according to the training error).

Normal case 2: The new point has different coordinates from any of the trained entities: in this case the new position assigned to it will be different from its original position, in order to maximize the function learned.

Relevant case: The new point has different coordinates from any of the trained entities, but this time its position remains the same. Consequently, this new point belongs to the learned function more that the trained Entities.

All the new points with different coordinates from any of the trained entities and whose position remains unchanged during the test phase, have to be considered the hidden entities of the computed function.

These new points belong to the function more than the trained entities. Consequently, they have to be considered the kernel of the function itself. These points may consider the virtual future of the studied process.

Results

The main part of our analysis is related to 2014 Ebola outbreak. The recent outbreak (May 2018 in Congo) was not as widespread as the 2014 outbreak and it has a simpler dynamics.

Ebola Outbreak 2014

We have used the data provided by HealthMap. The longitude and latitude information for the outbreak location was not given. The longitude and latitude information was detained from Google Map. shows the map of Ebola outbreak as of September 1, 2014. Most of the Ebola cases are found in Liberia, Sierra Leone, and Guinea which are the neighboring countries. shows the scale of the Ebola outbreak over the countries with the most cases and shows an outbreak covering about 1000 miles (about 1600 km). We used “MapQuest Open” for mapping the results.

Figure 1. Ebola outbreak as September 1, 2014 from http://healthmap.org.

Figure 1. Ebola outbreak as September 1, 2014 from http://healthmap.org.

Figure 2. Estimated length over the area with the most outbreaks, Google. Map was used.

Figure 2. Estimated length over the area with the most outbreaks, Google. Map was used.

The outbreak is reported over 76 places (by removing seven places with zero outbreak from data given in ) from March 2014 until the first week of October 2014 (32 weeks). We divided the data into eight sets, each set being 4 weeks (almost a month). Each set includes the data from the beginning (the first week) to the end of that set. For example, the data for set 5 is the data from the first week to the end of 20th week. We computed the coordinates of TWCα for different sets. The TWCα algorithm predicts the origin of the outbreak. The results are approximately the same and they are near the place that was reported by WHO as the origin of the Ebola outbreak. This can be considered as a self-validation techniques for TWC and testing the TWC with the real observation.

shows the coordinates of TWCα for different sets. All of these results fall into the forested area in the south eastern part of Guinea. The distances between them are relatively small compared to the entire area of the Ebola outbreak in those countries. For example, the distance between TWCα of set 3 and set 7 is about 57 km which is the maximum distance between TWCα points. Guekedou (Longitude: −10.1336, Latitude: 8.5674) is the possible origin of the Ebola outbreak. The distances between the average TWCα and Guekedou is about 64 km. The distance has been calculated by using the “Latitude/Longitude Distance Calculator” that was obtained from the National Weather Service, National Hurricane Center.

Table 4. TWCα coordinates for dataset.

The TWCα algorithm estimates the original outbreak area. shows the estimated length corresponding to this area is about 34 miles (about 54 km). This result has been overlapped with the map of the region. The color map in represents the probability distribution for the possible origin of the outbreak through the region. The dark red shows the region with the highest probability of being the origin of the outbreak. It also shows the possible outbreak pattern in the region. The TWCβ and TWCγ give more information about the outbreak pattern.

Figure 3. The outbreak area predicted by the TWCα algorithm, open.mapquest was used.

Figure 3. The outbreak area predicted by the TWCα algorithm, open.mapquest was used.

The scalar field from TWSFβ algorithm shows the prediction of the Ebola outbreak spread. The outbreak spread follows the gradient of the field. The TWSFβ scalar field was mapped onto the region in . The hot spot is in the forested area in the south eastern portion of Guinea. The TWSFβ scalar field () not only shows the pattern of the outbreak around the origin similar to the results from the TWCα algorithm () but also captures two more regions (in the south of Nigeria and in the Democratic Republic of Congo). As we mentioned in the previous section, the weighting function in TWCβ includes the distance of a point from itself. This is the reason the additional regions have been captured in TWCβ algorithm. The main purpose of the TWCα algorithm is to find the location of the origin of the outbreak which was found at a distance of about 64 km from Guekedou. The new region(s) can be considered as the secondary region(s) of the outbreak and they can act independently in the outbreak distribution.

Figure 4. The TWSFβ scalar field for the Ebola outbreak. It captures three regions of outbreak, the main region which includes the origin of the outbreak (dark red) in the most west part of the map and two secondary regions in Nigeria and Democratic Republic of Congo (light blue). The red dots are the points of the dataset. Open.mapquest was used.

Figure 4. The TWSFβ scalar field for the Ebola outbreak. It captures three regions of outbreak, the main region which includes the origin of the outbreak (dark red) in the most west part of the map and two secondary regions in Nigeria and Democratic Republic of Congo (light blue). The red dots are the points of the dataset. Open.mapquest was used.

In the results of TWSFγ are shown. In this figure, we show the location of Guekedou, Macenta, Nzerekore, and Kissidougou districts. They are close to each other. Guekedou and Macenta are overlapping. According to the WHO, this is the origin of the Ebola outbreak (Sylvain et al. Citation2014; WHO Ebola Response Team Citation2014). The prediction of TWCγ matches the WHO report. The results from TWSFγ are very similar to the TWSFβ. The TWCγ algorithm follows the trajectory of the outbreak at each iteration step and finally gives the outbreak map at the optimal value of β. The TWSFγ results, , capture two regions (the main and one secondary region) but the TWSFβ, , captures three regions. It means that the region in Democratic Republic of Congo is not as important as in region the south of Nigeria in terms of the cause of the outbreak. This is a reasonable result since the region in Democratic Republic of Congo is farther from the main origin of the outbreak compared to the region in the south of Nigeria.

Figure 5. The results of TWSFγ. It captures two regions (the main region and one secondary region). The red dots are the points of the dataset. Open.mapquest was used.

Figure 5. The results of TWSFγ. It captures two regions (the main region and one secondary region). The red dots are the points of the dataset. Open.mapquest was used.

The DNB algorithm was applied to the dataset to detect the main cause-effect temporal relationship among the cities, in order to understand the spatial and the temporal evolution of the Ebola virus. In this application multi-scale sampling was not applied. We used the data in , and the DNB algorithm that was discussed before. In the DNB algorithm, we need the number of samples, variables, and time steps. In this case the number of samples is one, the variable is the number of new Ebola cases and the number of time steps is 32 (32 weeks). The DNB algorithm has generated a directed graph where the main cause–effect relations among cities are shown ().

Figure 6. DNB algorithm results, it shows the cause of the ebola outbreak is at Kissidougou.

Figure 6. DNB algorithm results, it shows the cause of the ebola outbreak is at Kissidougou.

shows the main cause of the Ebola outbreak is at Kissidougou and is located in the forested area in the south eastern of Guinea this was confirmed by WHO and MoH of Guinea report as the origin of the outbreak.

We use a new algorithm called Selfie and show the overlap of the results on Google earth (). The same data () have been used for this analysis.

Figure 7. The overlap of the Selfie result on Google earth. Red dots are the points of the dataset.

Figure 7. The overlap of the Selfie result on Google earth. Red dots are the points of the dataset.

The result from Selfie shows the Ebola outbreak is spreading toward the south of Guinea and mostly Sierra Leone and Liberia and it might spread over other countries if it is not be controlled. The prediction pattern overall formed a ring over a large scale that includes the infected countries.

Ebola Outbreak 2018

The Ebola outbreak in Congo in 2018 was controlled quickly and it did not spread out in the large region. For this case, we have used only TWCα. The Ebola outbreak was reported in 10 places (see ). The largest distance between the outbreak points is the distance between Wangata and Loango which is less than 150 Km. The Ebola outbreak spread out over a triangular region (between Bikoro, Iboko, and Wengata heath zones) with the area of less than 5000 km2. Most of the outbreaks were distributed in the regions of Bikoro and Iboko health zones. We have used three set of coordinates corresponding to Bikoro, Iboko, and Wengata heath zones as shown in (Ebola virus disease-Democratic Republic of the Congo, Disease outbreak news Citation2018). The results of the TWCα algorithm predicted the origin of the outbreak at (Longitude: 18.3046, Latitude: −0.6865) in the Equator located in the suburb of Bikoro. This location is at the north west of Bikoro at the distance less than 20 km to Bikoro.

Table 5. The Ebola outbreak dataset that has been used for the TWCα algorithm to predict the origin of the outbreak in Congo (May 2018).

Conclusion

In this paper Topological Weighted Centroid (TWC), Dynamic Naive Bayesian (DNB) Algorithm, and Selfie ANN were used.

In this paper three different Topological Weighted centroid, TWC, algorithms, TWCα, TWCβ, and TWCγ were applied to the Ebola outbreak data that were extracted from HealthMap. Our results show the main origin of the outbreak is in the forested area in south eastern Guinea and that has been confirmed by WHO and MoH of Guinea as being the source of Ebola. The TWCβ algorithm captures two secondary regions in addition to the main region of the outbreak but TWCγ algorithm in addition to the main region captures only one secondary region. This indicates that the secondary region captured by both the TWCβ and TWCγ algorithms is more important than the region that was captured only by TWCβ. The secondary regions can be considered as the new cause of the outbreak.

The DNB algorithm has elicited the cause-effect logic of the virus scattering and these findings also seem to be reasonable. The DNB algorithm predicted the main cause of the Ebola outbreak is at Kissidougou and is located in the forested area in the south eastern of Guinea.

The Selfie ANN was used to predict the future diffusion of the virus from September 2014. The updated data from WHO has confirmed this predictive scenario (see ). Only Nigeria was overestimated, but the Nigerian authorities worked a lot to stop the virus diffusion after the first contamination (Stephen and Sabeti Citation2014). Further, Selfie ANN suggests a circular area which is specific to an Ebola diffusion in West Africa. But according to the Selfie prediction, the center of this area should be completely safe (around Burkina Faso), and the future data coming from WHO has confirmed this estimation. At this point a question arises: why? Analysis of the flora and fauna of this area would be useful in order to know if some natural inhibitor of the virus was active in that area.

Table 6. New cases from September 2014 (Source: http://healthmap.org/ebola/#).

We have also used the TWCα algorithm to predict the origin of the Ebola outbreak in DR Congo in 2018. The algorithm predicts the origin is in the suburb of Bikoro that is matched with WHO’s reports.

Acknowledgments

We would like to thank “Quantic Nivi Group” for their great service for superimposing our results on the regional maps.

Disclosure statement

The authors declare that there is no conflict of interest regarding the publication of this paper.

References

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.