775
Views
2
CrossRef citations to date
0
Altmetric
Research Article

Computational modeling of consistent observation of asynchronous distributed computation on N–manifold

| (Reviewing editor)
Article: 1528029 | Received 18 Jul 2018, Accepted 20 Sep 2018, Published online: 12 Oct 2018

Abstract

The present-day distributed computing systems are asynchronous in nature, and they cover heterogeneous nodes as well as networks having geographic distribution scale. These distributed systems are prone to unpredictable network partitioning, communication delay, and node failures. The realizations of consistent observation in such systems are challenging. The traditional models of distributed computation are not fully adequate to incorporate characteristics arising in the new paradigm. Alternatively, the homology and topology based distributed computing models are formulated to gain a new perspective. This paper proposes a computational model of consistent observation of asynchronous distributed computation on N-manifold. The proposed model offers control of granularity of fineness of observation of computation to varying degree. The discrete geometric simulations of computational structures on 3-manifold offer different analytical insights. The extracted lattice chains of distributed computation on manifold illustrate execution dynamics of the system under consideration.

PUBLIC INTEREST STATEMENT

The fault detection and analysis of observable distributed computing systems are central to system design. The manifold structural model of observable distributed computing systems would benefit the system designers to carry out fault modeling and analysis in a deterministic system. This paper proposes the analytical formulation of model of observable distributed computation on N-manifold. The analytical model incorporates the varying granularity of observation of computation.

1. Introduction

The modern applications of distributed computing systems are pervasive encompassing traditional cluster or grid systems, cloud computing systems, and mobile distributed systems (Akinwunmi, Olajubu, & Aderounmu, Citation2016; Babaoglu & Marzullo, Citation1993; Ranga, Dave, & Verma, Citation2016). The large-scale distributed computing systems are difficult to deploy maintaining stability and consistency without employing any monitor. The observation of distributed computation is an important factor in ensuring stability of computation (Fidge, Citation1996; Parlangeli & Notarstefano, Citation2012). Furthermore, correctness of distributed computation can be determined by incorporating consistency detection mechanism in a sequence of observations of computation (Babaoglu & Marzullo, Citation1993; Goldberg, Gopal, Lowry, & Strom, Citation1991). In large-scale mobile distributed systems involving wireless sensor networks, the network partitioning is unpredictable, and it results in random failures of distributed computation on aggregated data. Thus, the applications of monitor to enhance observability and consistency of distributed computation are required in order to generate stable output. The traditional models of distributed computations are constructed by using elements of graph theory and computational logic based formalisms (Garg, Agarwal, & Ogale, Citation2014; Schwarz & Mattern, Citation1994). However, a relatively new approach to model distributed computing systems is to employ concepts of algebraic topology and homology (Bauer, Kerber, & Reininghaus, Citation2014; Conde & Rajsbaum, Citation2012; Goubault, Citation2003). The topological models of distributed computing offer analytical insight to complex systems in new perspectives. The topological and homological models help in analyzing distributed systems having large event space, which results in formation of very large space of combinatorial executions. The manifold is a multidimensional space, which can be represented in geometric forms having certain characteristics. The construction of models of consistent observability of asynchronous distributed systems on a manifold would offer analytical insight to the sequences of executions represented as snapshots. Moreover, granularity of fineness of observation can be controlled in such asynchronous systems appropriately.

1.1. Motivation

The consistent observation of sequence of computation in a distributed system is critical to determine correctness of execution (Babaoglu & Marzullo, Citation1993). The determination of ordering of events in a distributed system is difficult due to enlarged combinatorial execution space of computation. As a result, the observability of distributed systems becomes difficult (Fidge, Citation1996). The recovery of distributed computation is often required if a fault is detected in the execution sequence. The mechanism of restoring consistent states in a large-scale distributed system involves persistent logging of multiple checkpoints for recovery (Goldberg et al., Citation1991). However, this mechanism enhances space complexity to a large extent if the combinatorial execution state space is large. Furthermore, determining correct combinatorial execution checkpoint is computationally expensive in a distributed system having a large state space. The realization of monitor of distributed computation is not trivial due to the existence of causal relations between any two pair of events (Schwarz & Mattern, Citation1994).

The formal modeling of computing systems enables to gain an analytical insight to the complex systems, which facilitates robust designs. For example, a formal specification of dependability of computation in large-scale pervasive systems is constructed (Ayara & Najjar, Citation2008). The stable predicate detection in distributed computation having infinite run is formulated by restricting the run within finite graph model (Garg et al., Citation2014). However, constraining the infinite computation on finite structure may invite oscillatory execution pattern incorporating indeterminate convergence. A comparatively new approach to model distributed computing systems involves elements of algebraic topology, lattice theory, and homological algebra offering new insights (Bagchi, Citation2018; Bauer et al., Citation2014; Conde & Rajsbaum, Citation2012; Herlihy & Rajsbaum, Citation1999). This paper proposes a formal model of consistent observation of a distributed computation on N-manifold. The proposed model does not impose any condition of finiteness of computation. The variable filter function can control the granularity of observation of states in a sequence of computation while maintaining consistency conditions. The main contributions of this paper are as follows.

  • Construction of a formal model of observation of asynchronous distributed computing on N-manifold structure

  • Integrating filtering method to generate consistent observation of distributed computation

  • Incorporation of variability of filter to prepare consistent observation of computation with varying granularity

  • Evaluation of resulting manifold structure of distributed computation in 3-D and lattice chain embedding on it

The rest of the paper is organized as follows. Section 2 presents related work. Section 3 describes preliminary concepts. Sections 4 and 5 present construction of distributed computing on N-manifold structure and formulation of consistent observation model, respectively. Section 6 presents analytical properties. Section 7 and 8 describe computational evaluation results and application aspects, respectively. Finally, section 9 concludes the paper.

2. Related work

In general, the formulation of analytical model of distributed computing is performed by using modular graph structures and relational algebraic operators (Rhode, Presicce, Simeoni, & Taentzer, Citation1999; Sitohang, Citation2002). The observations of a distributed computation in multiple phases are required in order to determine the controllability of the system. In theory of distributed computing, consistent observation of sequences of computation is a critical factor (Babaoglu & Marzullo, Citation1993). In general, observability of a distributed system is hard to realize due to combinatorial arrival order of events (Fidge, Citation1996). A distributed computation can be modeled as a cyclic graph, where nodes represent distributed processes and edges represent networks between processes. The graph structures can be dynamic, and hybrid distributed systems are modeled by using dynamic graphs structures (Kuhn, Lynch, & Oshman, Citation2010). It is proposed that number theoretic approaches can be successfully applied to determine observability and reachability of paths as well as cycles in an arbitrary graph (Parlangeli & Notarstefano, Citation2012). Furthermore, the unsolvable systems can be recognized by Eigen analysis. The main challenge of this model is that, detection of a prime number is necessary to realize observability of cycles in the graph, which is computationally expensive. This is because the prime number detection is computationally hard.

The debugging of distributed computation requires rollback recovery in the presence of faults. Researchers have proposed a rollback and reply mechanism to realize restoration of consistent global states in a distributed system (Goldberg et al., Citation1991). The design requires records of persistent logs and stable checkpoints, which enhances space complexity to a large extent if the combinatorial execution state space is large. Moreover, the determination of causal relationship between events in the event space of a distributed computing is necessary in order to implement computation monitor (Schwarz & Mattern, Citation1994). The topological models of distributed computing offer an analytical insight to the complex systems. In recent times, distributed computing systems are modeled by employing combinatorial topological structures (Armstrong, Citation1983; Edelsbrunner & Harer, Citation2010; Herlihy & Rajsbaum, Citation1999). The shared memory based distributed computing model is formulated by employing algebraic topology (Conde & Rajsbaum, Citation2012). The conceptual framework of persistent homology is utilized in modeling distributed computation (Bauer et al., Citation2014; Zomorodian & Carlsson, Citation2005). The homotopy theory and topological spaces are employed in modeling the mutual exclusion as well as its complexity in concurrent computing systems (Carson & Reynolds, Citation1987; Fajstrup, Rauben, & Goubault, Citation2006; Goubault, Citation2003; Gunawardena, Citation1994). However, the homotopy theory needs adaptation while formulating distributed systems due to directional property of homotopy theory.

3. Preliminary concepts

Let X be a point set and τxΩ(X) where Ω(X) is a power set of X. If the 1-D space represented by X is Euclidean, then the space can be metrized to generate a metric space (X,dX) by equipping it with a real-valued function, dX:X2[0,+). The resulting space (X,dX) is a 1-D finite metric space having size n=X if following axioms are maintained:

(1) n(1,+),a,b,cX,dX(a,b)=dX(b,a),[a=b][dX(a,b)=0],dX(a,c)dX(a,b)+dX(b,c)(1)

If τx represents a topology on X, then the resulting topological space induced in the metric space is (X,dX,τx), where Aiτx,(Ai,dX) constructs metric subspace. The metric space having topological structure (X,dX,τx) maintains following properties:

(2) mZ+,m(1,+),ϕ,Xτx,[Ai,Akτx][AiAkτx],Aiτx,i=1mAiτx(2)

The properties of generalized topological space state that the space is closed under arbitrary union as well as intersection operations. However, the intersection operation should be finite. Let, NZ+1,fi:BiN be the homeomorphic and invertible function operated on the topological space BiX. If X is Hausdorff, then the resulting structure (Bi,fi) is called a chart on N-manifold (Lee, Citation2013). If (BiX,fi) and (BkX,fk) are two charts on N-manifold, then a transition map can be constructed as,

(3) BiBk=V,Vϕ,fkfi1:fi(V)fk(V)(3)

The charts (BiX,fi) and (BkX,fk) are smoothly compatible if V=ϕ. An N-manifold is called C- class if the following property is maintained by it,

(4) rN,<limu+Du(fkfi1)|r<+(4)

It can be possible that, an N-manifold is smooth to a degree v<+. Thus, different classes of manifolds can be formed depending on respective natures.

4. Distributed computing on N–manifold

Let P=pi:iZ+iN be a set of distributed processes and Z0+=Z+0. The set of events local to a distributed process piP is denoted by Ei, where ϕEi. Let EP be N-dimensional given by EP=E1×E2×...×EN. If piP,C:EiZ0+ is a local bijection within individual distributed processes, then C(TE)Z0+, where TE=i=1NEi and C(ϕ)=0. Let, CP:EPC(TE)N be such that,

(5) βEP:CP(β)(Z0+)N(5)

If piP,hi:EiCP(EP) is invertible, then a N-manifold MN is given as,

(6) HP=hi:piP,MN=(TE,HP)(6)

In the proposed model of distributed computation on N-manifold, the underlying manifold structure is not differentiable due to discrete nature of computing space represented by TE.

4.1. Hausdorff property of computing space

The distributed computing space can be metrized to form metric space by equipping it with a suitable function (Bagchi, Citation2018). Let a discrete metric dS:TE2[0,+) be defined on distributed computing space TE as,

(7) a,bTE,[a=b][dS(a,b)=0],[ab][dS(a,b)=1](7)

The resulting computing space (TE,dS) is Hausdorff because the following axiom is maintained by it,

(8) ε(0,1),ab:Bε(a)={x:xTEdS(a,x)<ε},Bε(a)Bε(b)=ϕ(8)

This validates that the distributed computing space TE is metrizable Hausdorff space, which can be mapped on an N-manifold.

4.2. Locally Euclidean space of computation

The set of events local to individual distributed processes can be counted by employing locally applicable monotone logical clock function defined over respective intervals as,

(9) piP,IiZ0+:C(Ei)Ii(9)

The logical clock function C(.) is a local bijection. Hence, the local event spaces of distributed processes can be converted into respective metric spaces (Ei,dE) if it is equipped with a function, ATE,dE:A2[0,+). This leads to the restriction to local formulation as, piP,AEi,dE:A2[0,+). The resulting metrization function is defined as,

(10) a,bEi,xyIi:x=C(a),y=C(b),dE(a,b)=xy(10)

This indicates that (Ei,dE) is a Euclidean metric space of the asynchronous distributed computation under consideration.

5. Computation observation on manifold

The observability of distributed computation is an important parameter in order to determine stability of systems and to measure control dynamics. The observability of a distributed system enables implementation of consistency detection in execution sequence as well as fault recognition. If a distributed system is designed with possibility of faults other than Fail-Stop mode, then consistent observation of distributed computation is required to ensure error-free execution. This section presents the model of consistent observation of distributed computation on N-manifold having varying granularity of observations.

5.1. The εk-fine cut of computation

Given a distributed computation on MN, the computation is not consistent everywhere on MN. The consistency of a computation can be verified by considering computational cuts on MN preserving snapshot of execution status of distributed processes. A set of N-dimensional cuts on MN is given by ΛCi=1Nhi(Ei). The εk-cut of ΛC is a restriction on taking snapshots of execution on MN preserving consistency of observation of distributed computation. The εk-cut (Λk) is defined by following axioms for εk,kZ+,

(11) ΛkΛC:εk(0,+),xi=C(aiEi),Λk=(x1,x2,...,xN):xiZ0+dE(ai,akEk)εk(11)

Evidently, εk>0, the Λk may not provide meaningful information about execution states of computation. This is due to the fact that, a widely dispersed cut on MN for a sufficiently large εk would violate consistency of observation of computation considering varying execution states of a group of processes in asynchronous distributed computing systems. If Sε=εk:kZ+ represents a set of choices to generate εk-cut on MN, then an acceptable value to perform consistent observation in finite form is given by

(12) εd=inf(Sε)(12)

Thus, the finest possible observation Λd of an asynchronous distributed computation can be performed by maintaining the following axioms,

(13) ΛdΛk:εd=1,Λd=(x1,x2,...,xN):xiZ0+dE(ai,akEk)εd(13)

The coarse grained observation can be performed by considering εe=sup(Sε) and dE(ai,akEk)εe. Hence, the variation of fineness of observation of asynchronous distributed computation can be performed on N-manifold.

5.2. Consistency and filter model

The manifold MN generated by distributed computing is not consistent everywhere within the space. Thus, an arbitrary function sequence on N-manifold of asynchronous distributed computation, given by FN=(hi)i=1i=N, may not be computationally consistent everywhere. The consistency can be maintained by employing a filter function on the N-manifold. Let all possible cuts of execution on N-manifold be given by ΛM=εkSεΛk. A predicate Γ(β)0,1 determines consistency of cut βEP by evaluating the predicate in corresponding event space given as,

(14) EΓ=β:CP(β)ΛM(14)

Any generalized and arbitrary cuts of distributed computation are not observably consistent. Hence, a filter is required to generate consistent cuts out of a set of cuts of computation. Let us assume that Γ(β)=1 signifies consistent cut on N-manifold. Let a function be given as,

(15) A(Z0+)N,BA,g:ABϕ,g(aA)=g(bB)a=b(15)

The function g(.) is a filter function on cuts on execution manifold if it satisfies the following axioms,

(16) AΛM,βEΓ:[Γ(β)=1][(gCP)(β)B],[Γ(β)=0][(gCP)(β)B](16)

The filter function g(.) checks the validity of stable predicate while filtering out inconsistent cuts on manifold. The granularity of observable consistent cuts is determined by restricting the variations of clocks within cuts having finite limits. This indicates the non-commutative composition, hiHP,ghi(.), can be further restricted to generate a set of consistent observation on N-manifold at appropriate granularity having εk-fineness by maintaining following conditions,

(17) BkB:εkSε,[(gCP)(β)Bk][(gCP)(β)Λk](17)

In another view, it can be considered as an additional refinement to control the granularity of observation of computation.

5.3. Lattice embedding on N-manifold

Let the εk-fine and consistent snapshot of observation of a distributed computation on MN be Bk. Let a partial ordering relation <⊂Bk2 be constructed on MN maintaining lattice properties. If LBk, then a lattice chain (L,<) can be formed by maintaining the following condition,

(18) lx,lyL:[(lx,ly)∈<][(ly,lx)∈<](18)

It is important to note that (L,<) is an execution lattice chain in N-manifold. Evidently, an execution lattice chain represents a consistent sequence of observations of asynchronous distributed computation on the respective manifold.

6. Analytical properties

6.1 Theorem: The space (X, dX) is not Hausdorff, where X=i=1NCEi and dXxX,yX= xy.

Proof: Let ATE such that aEi,bEk,a,bA. However, there is no globally consistent clock in any distributed systems and C(TE)Z0+. Thus, the following axioms can be satisfied by a distributed computing system,

(19) C(Ei)C(Ek)=D,Dϕ,[C(a)DC(b)D][dX(C(a),C(b))=0](19)

Hence, ε>0 such that, if Bε(C(a))={x:dX(C(a),x)<ε} and Bε(C(b))={y:dX(C(b),y)<ε}, then the following axiom is satisfied,

(20) ab,Bε(C(a))Bε(C(b))ϕ(20)

Hence, the space (X,dX) is not Hausdorff under local logical clock.

6.2 Theorem: If CEiIi and IiIk=ϕ for ik, then (X, dX) is Hausdorff.

Proof: As before, let be X=i=1NC(Ei) in a distributed computing system. However, for any two intervals IiIk=ϕ in the system if ik. Moreover, any distributed computing system maintains following axiom due to monotone property of logical clock,

(21) piP:[a,bEiC(a),C(b)Ii][C(a)C(b)](21)

Hence, the global computation in the respective distributed system will maintain the following property,

(22) x,yX,dX(x,y)>0(22)

Thus, (X,dX) is a metric space. As every metric space is Hausdorff, hence (X,dX) is Hausdorff.

7. Computational evaluation

The computational evaluation is performed through simulation considering 3-manifold (M3), which is made up of execution event spaces of three distributed processes. The simulation is performed through Discrete Computational Geometry (DCG) modeling technique considering 3-D shapes. The shapes are constructed based on sets containing discrete data points using mesh topology. The data grid lines are spaced in equal distribution (50:50) to maintain topological mesh without any skew. The metric (norm) is fixed to constant positive integer value that equals to 02 while constructing the grid lines. The distributed computational structures represented as manifold shapes are simulated by considering four different network communication models. The simulation considers a closed group of three processes representing a distributed system on 3-manifold. The group communication models between processes considered in experimentation are: (1) unreliable cyclic unicast in monotone (UCU), (2) reliable cyclic unicast (RCU), (3) unreliable cyclic broadcast (UCB), and (4) reliable cyclic broadcast (RCB). The execution spaces of distributed processes are consisting of local event spaces of individual processes mapped under monotone logical clock function in positive integer range including origin (Z+0). The M3 manifold structure is constructed by following the consistent execution sequences. The sets of inconsistent sequences of distributed computation are discarded by applying filter function. The snapshots of distribution profiles of points in sets in 3-D in UCU mode and RCU mode are illustrated in Figures and , respectively.

Figure 1. Distribution profile of points in 3-D in UCU mode.

Figure 1. Distribution profile of points in 3-D in UCU mode.

Figure 2. Distribution profile of points in 3-D in RCU mode.

Figure 2. Distribution profile of points in 3-D in RCU mode.

The corresponding snapshots of distribution profiles of points in sets in UCB and RCB modes are presented in Figures and , respectively.

Figure 3. Distribution profile of points in 3-D in UCB mode.

Figure 3. Distribution profile of points in 3-D in UCB mode.

Figure 4. Distribution profile of points in 3-D in RCB mode.

Figure 4. Distribution profile of points in 3-D in RCB mode.

The detailed description about resulting manifold structures and embedded lattice chains are presented below considering reliable and unreliable communication models. The process execution spaces on 3-D axes in manifold structures represent execution events-spaces of distributed processes, which are dynamic in nature having combinatorial forms.

7.1. Experiment I: unreliable network communication

The 3-manifold structure of distributed computation for UCU communication model is presented in Figure . In this case, the group of processes synchronizes computation involving a shared variable by using unreliable unicasts in repeated cycles between a pair of processes in the group.

Figure 5. Distributed computing manifold using cyclic unreliable unicast.

Figure 5. Distributed computing manifold using cyclic unreliable unicast.

The 3-manifold structure illustrates existence of multiple local supremum and infimum points. However, a global supremum and infimum can be computed representing global state of computation. The majority of surface areas are relatively smoother during computation indicating finite bound in computation. In this model of computation, the overall message complexity is minimum in stable network condition because the communication is cyclic unicast and no acknowledgment is transacted between processes. The corresponding embedded lattice chain on 3-manifold representing a consistent distributed computing sequence is presented in Figure .

Figure 6. Execution lattice chain on 3-manifold for unreliable unicast based computing.

Figure 6. Execution lattice chain on 3-manifold for unreliable unicast based computing.

The lattice chain illustrates the presence of an apparently periodic structure, which is inline to the cyclic nature of synchronization between processes in the closed group. In the next experiment step, the unicast communication model is converted to broadcast network communication model (UCB) in the closed group. The resulting 3-manifold structure generated by distributed computation is presented in Figure .

Figure 7. Distributed computing manifold using cyclic unreliable broadcast.

Figure 7. Distributed computing manifold using cyclic unreliable broadcast.

In this case, the processes in a closed group perform broadcasts in cycles without considering reliability of messages. It is considered in simulation that no two messages can be simultaneously sent or received by any process at a single time instant. The 3-manifold structure illustrates that distance between different local supremum and infimum points are reduced if the network is relatively stable. The overall smoothness of the surface is maintained in the majority of places with localized convex and concave deformations depending on the nature of process states. The corresponding embedded lattice chain of consistent distributed computing sequence is presented in Figure .

Figure 8. Execution lattice chain on 3-manifold for unreliable broadcast based computing.

Figure 8. Execution lattice chain on 3-manifold for unreliable broadcast based computing.

The lattice chain embedded on 3-manifold of UCB model represents that apparent periodicity of computation exists with relative dispersion. The dispersion effect is due to the time dilation required for completing a broadcast in a closed group.

7.2. Experiment II: reliable network communication

In this experimental set, the unreliable network communication models are changed by incorporating reliability in communication. The simulation model implements First-In-First-Out (FIFO) model of network communication. In the first experimentation, the UCU model is converted into RCU by incorporating reliability in the FIFO network communication. The resulting 3-manifold structure generated by distributed computation is presented in Figure .

Figure 9. Distributed computing manifold using cyclic reliable unicast.

Figure 9. Distributed computing manifold using cyclic reliable unicast.

In this case, the number of local supremum and infimum points are reduced, and surface appears to be relatively uniform locally (but not strictly uniform globally). The overall smoothness of the surface is moderately enhanced as compared to unreliable systems. The reason is that transitions are highly synchronized in reliable systems having bidirectional communication. The overall message complexity is higher in this 3-manifold as compared to unreliable unicast model. The corresponding embedded lattice chain of consistent distributed computing is presented in Figure .

Figure 10. Execution lattice chain on 3-manifold for reliable unicast based computing.

Figure 10. Execution lattice chain on 3-manifold for reliable unicast based computing.

Evidently, the apparent presence of any periodicity is reduced, and the computation proceeds in phases. There are several discrete transitions between phases of distributed computation. The reason is that the cyclic synchronization forms several closed-loop graphs in RCU model of distributed computation. The 3-manifold structure generated by RCB model of network communication is presented in Figure .

Figure 11. Distributed computing manifold using cyclic reliable broadcast.

Figure 11. Distributed computing manifold using cyclic reliable broadcast.

In this case, the processes in the closed group perform synchronization by using reliable broadcast in repeated cycles. However, no two messages can be simultaneously sent or received by any process at a single time instant, which is inline to real-life implementation of data communication. The global smoothness of the surface is further enhanced due to complete synchronization between every process in the system with reliable communications. The message complexity is highest in this case. The surface of RCB 3-manifold appears to be locally uniform covering a larger section indicating communication reliability and complete covering of the group of processes in each cycle. The corresponding lattice chain embedded on RCB 3-manifold is presented in Figure .

Figure 12. Execution lattice chain on 3-manifold for reliable broadcast based computing.

Figure 12. Execution lattice chain on 3-manifold for reliable broadcast based computing.

The lattice chain of RCB communication based distributed computation illustrates reduced frequency of discrete transitions in the sequence of consistent computation. This signifies the existence of highest reliability of data and distributed synchronization in a process group. The dynamics of controlled filter values for generating consistent execution sequence of distributed computation is presented in Figure .

Figure 13. Variations for infimum/supremum of fineness boundaries of observation filter.

Figure 13. Variations for infimum/supremum of fineness boundaries of observation filter.

The infimum of filter is kept fixed at lowest value (unity), and the supremum is varied while maintaining consistency of observation of distributed computation.

8. Application aspects—faults detection in computation

The modern approaches to analyze distributed computation employ various geometric analysis techniques. These techniques effectively result in shape analysis in order to detect consistency of computation in structural forms. For example, the simplicial complexes and topological models of distributed computation form a set of dynamic graph like shapes, which helps in determining consistency of computation (Conde & Rajsbaum, Citation2012; Herlihy & Rajsbaum, Citation1999). The manifold is a geometric structure, and its application helps in determining consistency of observation of distributed computation in N-dimensional space. The manifold structure employed in this paper facilitates computational shape analysis in order to detect inconsistency in observable distributed computation. The two main faults that occur in a distributed computation are: Fail-Stop fault and Byzantine fault. The faults are simulated by distribution of points in events space of processes with skews. In case of Fail-Stop fault, the points are clustered within event space of a faulty process. However, in case of Byzantine fault, the points are distributed arbitrarily within a sub-space of topological execution space of faulty process having nonconvergent nature. The application of manifold structure to detect fail-stop fault in distributed computing (UCU mode) is illustrated in Figure .

Figure 14. Deformation in distributed computing manifold due to fail-stop fault.

Figure 14. Deformation in distributed computing manifold due to fail-stop fault.

The first (top) surface of Figure represents distributed computing having fail-stop fault within execution space of process 2. The local deformations are indicated on the surfaces representing fail-stop and regular computations in UCU mode. The manifold of a faulty distributed computation appears to be smoother with respect to regular manifold structure due to partial blocking of global execution space eliminating transitions. The deformation on manifold due to Byzantine fault in process 2 execution space in RCU mode of computation is illustrated in Figure .

Figure 15. Detecting Byzantine fault in distributed computing manifold (RCU mode).

Figure 15. Detecting Byzantine fault in distributed computing manifold (RCU mode).

The Byzantine fault induces a highly localized distortion in a bounded region. The distortion is severe due to Byzantine nature of fault in computation enhancing unpredictability. The manifold outside this region of deformation appears to be relatively smoother than regular distributed computation. This is due to the partial blocking of other processes due to existence of a Byzantine (faulty) process in the system. These examples illustrate the application of manifold model of distributed computation to monitor a system by detecting the existence of different types of failures of computation in a deterministic system. It would help the distributed systems designers to formulate and analyze possible fault models a priori along with detection of locality of faults in a system.

9. Conclusions

The observation of an asynchronous distributed computation is required in order to maintain stability in an execution sequence. The consistency of computation is evaluated for generating correct output of computation and to detect faults. The consistent observation of large-scale asynchronous distributed systems is challenging due to enlarged events space. The construction of computational model on manifold structure helps in gaining analytical insights to complex systems. The formulation proposed in this paper offers a model and analysis of consistent observation of asynchronous distributed computation on N-manifold. The fineness of consistent observation can be controlled by employing filter. The simulations of resulting structures offer better understanding of the dynamics of the computation in view of analysis.

Additional information

Funding

The author received no direct funding for this research.

Notes on contributors

Susmit Bagchi

Susmit Bagchi has received BSc (Honors) in 1993 from Calcutta University, B.E. in Electronics Engineering in 1997 from Nagpur University, M.E. in Electronics and Telecommunication Engineering in 1999 from Bengal Engineering and Science University (presently IIEST). He obtained PhD (Engineering) in Information Technology in 2008 from IIEST. Currently, he is Associate Professor in Department of Aerospace and Software Engineering (Informatics), Gyeongsang National University, Jinju, South Korea. His research interests are in Distributed Computing and Systems.

References

  • Akinwunmi, A. O. , Olajubu, E. A. , & Aderounmu, G. A. (2016). A multi-agent system approach for trustworthy cloud service discovery. Cogent Engineering , 3, Taylor & Francis, UK. doi: 10.1080/23311916.2016.1256084
  • Armstrong, M. A. (1983). Basic topology . New York, NY: Springer-Verlag. ISBN 978-1-4757-1793-8.
  • Ayara, A. , & Najjar, F. , A formal specification model of survivability for pervasive systems, In Proceedings of 2008 International Symposium on Parallel and Distributed Processing with Applications, Sydney, Australia, IEEE, 2008.
  • Babaoglu, O. , & Marzullo, K. (1993).Consistent global states of distributed systems: Fundamental concepts and mechanisms. Book: Distributed systems , 2nd ). pp. 55–96 ACM Press/Addison-Wesley Publishing: New York, NY ISBN 0-201-62427-3.
  • Bagchi, S. (2018). Formal analysis of control and termination of distributed computation in weaker spaces, Cogent Engineering , 5. U.K.: Taylor & Francis.
  • Bauer, U. , Kerber, M. , & Reininghaus, J. (2014). Distributed computation of persistent homology, In proceedings of Meeting on Algorithm Engineering and Experiments, SIAM, Oregon, USA, pp. 31–38. doi: 10.1002/jbm.b.32976
  • Carson, S. D. , & Reynolds, P. F., Jr. (1987). The geometry of semaphore programs. ACM Transaction Programming Languages Systems, ACM , 9(1), 25–53. doi:10.1145/9758.9759
  • Conde, R. , & Rajsbaum, S. (2012). An introduction to topological theory of distributed computing with safe-consensus. Electronic Notes in Theoretical Computer Science, Elsevier , 283, 29–51. doi:10.1016/j.entcs.2012.05.004
  • Edelsbrunner, H. , & Harer, J. (2010). Computational topology: An introduction . USA: American Mathematical Society.
  • Fajstrup, L. , Rauben, M. , & Goubault, E. 2006. Algebraic topology and concurrency. Theoretical Computer Science 357(1–3): 241–278. Elsevier. doi:10.1016/j.tcs.2006.03.022.
  • Fidge, C. (1996). Fundamentals of distributed system observation. IEEE Software , 13(6), 77–83. doi:10.1109/52.542297
  • Garg, V. K. , Agarwal, A. , & Ogale, V. (2014). Modeling. Analyzing and Slicing Periodic Distributed Computations, Information and Computation , 234(Elsevier), 26–43.
  • Goldberg, A. P. , Gopal, A. , Lowry, A. , & Strom, R. (1991). Restoring consistent global states of distributed computations, In Proceedings of the 1991 ACM/ONR Workshop on Parallel and Distributed Debugging (PADD 1991), ACM SIGPLAN, California, USA, pp. 144–154.
  • Goubault, E. (2003). Some geometric perspectives in concurrency theory. Homology Homotopy Applications , 5(2), 95–136. doi:10.4310/HHA.2003.v5.n2.a5
  • Gunawardena, J. (1994). Homotopy and concurrency. Bulletin of the EATCS , 54, 184–193.
  • Herlihy, M. , & Rajsbaum, S. , New perspectives in distributed computing. In Proceedings of the 24th International Symp. on Mathematical Foundations of Computer Science, LNCS, Vol. 1672, Springer, Poland, 1999, pp. 170–186.
  • Kuhn, F. , Lynch, N. , & Oshman, R. , Distributed computation in dynamic networks, Proceedings of the forty-second ACM symposium on Theory of computing (STOC’10), ACM, Massachusetts, USA, 2010, pp. 513–522. doi: 10.1177/1753193409357372
  • Lee, J. M. (2013). Introduction to smooth manifolds. Graduate texts in mathematics 218 . New York, NY: Springer. doi: 10.1007/978-1-4419-9982-5_1
  • Parlangeli, G. , & Notarstefano, G. (2012). On the reachability and observability of path and cycle graphs. IEEE Transactions on Automatic Control , 57(3), 743–748. doi:10.1109/TAC.2011.2168912
  • Ranga, V. , Dave, M. , & Verma, A. K. (2016). Restoration of lost connectivity of partitioned wireless sensor networks. Cogent Engineering , 3, Taylor & Francis, UK. doi: 10.1080/23311916.2016.1186263
  • Rhode, M. G. , Presicce, F. P. , Simeoni, M. , & Taentzer, G. ; Modeling distributed systems by modular graph transformation based on refinement via rule expressions, International Workshop on Applications of Graph Transformations with Industrial Relevance (AGTIVE), ACM, Netherlands, 1999, pp. 31–45.
  • Schwarz, R. , & Mattern, F. 1994. Detecting causal relationships in distributed computations: In search of the holy grail. Distributed Computing 7(3): 149–174. Springer. doi:10.1007/BF02277859.
  • Sitohang, B. , Parallel execution of relational algebra operator under distributed database systems, 2002 International Conference on Information Technology: Coding and Computing, IEEE, Las Vegas, USA, 2002, pp. 207–211.
  • Zomorodian, A. , & Carlsson, G. (2005). Computing persistent homology. Discrete and Computational Geometry , 33(2), 249–274. doi:10.1007/s00454-004-1146-y