1,525
Views
2
CrossRef citations to date
0
Altmetric
Research Articles

On the isolatitude property of the rHEALPix Discrete Global Grid System

&
Pages 362-377 | Received 28 Mar 2019, Accepted 12 Aug 2019, Published online: 12 Sep 2019

ABSTRACT

Digital Earth frameworks provide a way to integrate, analyze, and visualize large volumes of geospatial data, and the foundation of such frameworks is the Discrete Global Grid System (DGGS). One approach in particular, the rHEALPix DGGS, has the rare property of distribution of cell nuclei along rings of constant latitude (or isolatitude rings). However, this property is yet to be explored. In this paper, we extend existing work on the rHEALPix DGGS by proposing a method to determine the isolatitude ring on which the nucleus of a given cell falls by converting a cell identifier to isolatitude ring without recourse to geodetic coordinates. In addition, we present an efficient method to calculate the geodetic latitude of a cell’s nucleus via its associated isolatitude ring. Lastly, we use the proposed methods to demonstrate how the isolatitude property of the rHEALPix DGGS can be utilized to facilitate latitudinal data analysis at multiple resolutions.

1. Introduction

In recent years, the volume and variety of geospatial data have exploded and we are firmly in the era of geospatial big data. Our challenge now is how to store, integrate, analyze, and visualize this massive amount of information on a global scale. Digital Earth frameworks have emerged to meet this challenge and the foundation of such frameworks is the Discrete Global Grid System (DGGS) (Mahdavi-Amiri, Alderson, & Samavati, Citation2015). A DGGS can be thought of as a discrete, multiresolution, digital framework for referencing geospatial information to the earth. For the last two decades, DGGSs have played an important role in Digital Earth research (Adams, Citation2017; Amiri, Bhojani, & Samavati, Citation2013; Amiri, Samavati, & Peterson, Citation2015; Goodchild, Citation2000; Lin, Zhou, Xu, Zhu, & Lu, Citation2018; Sherlock, Citation2017; Vince & Zheng, Citation2009) but they are now gaining attention in the big data community and have recently been suggested as a solution to IoT (Internet of Things) data integration (Purss et al., Citation2017), big spatial vector data management (Yao & Li, Citation2018), and as the de facto global reference system for geospatial big data (OGC, Citation2017a). In addition, they have recently been utilized in the design of a scalable crowdsensing platform for geospatial data (Birk, Citation2018), and in the study of massive point cloud handling (Sirdeshmukh, Citation2018).

DGGSs represent a class of spatial data structures that consist of a hierarchy of discrete global grids at multiple resolutions (Sahr & White, Citation1998). Although several methods exist to create a DGGS, the most popular approach is to partition the faces of a platonic solid into equal area cells (hexagons, triangles, or quadrilaterals) and then inversely project the result to the surface of the sphere (or ellipsoid) using an equal area projection (Sahr, White, & Kimerling, Citation2003). This approach is preferable because a uniform grid of equal area cells simplifies data sampling and analysis (OGC, Citation2017b; Sahr et al., Citation2003; White, Kimerling, & Overton, Citation1992), as well as providing a way to store, manage, and integrate geospatial big data (OGC, Citation2017a). In fact, this approach is the basis of the Open Geospatial Consortium (OGC) DGGS Abstract Specification (OGC, Citation2017b) which aims to standardize the DGGS model, increase usability of DGGSs, and increase interoperability between DGGSs.

The rHEALPix DGGS (Gibb, Raichev, & Speth, Citation2016) is a promising OGC conformant quadrilateral-based approach with several interesting properties that distinguish it from other DGGSs. Arguably, the most unique property is the distribution of cell nuclei along rings of constant latitude (often referred to as isolatitude distribution), whereby at each resolution, k nuclei lie on Ok isolatitude rings (or rings of constant latitude) (Gibb et al., Citation2016). This property is inherited from the DGGS on which rHEALPix is based, namely the Hierarchical Equal Area isoLatitude Pixelization (HEALPix) DGGS (Gorski et al., Citation2005) and to our knowledge, no other DGGSs share this property. Gorski et al. (Citation2005) developed the HEALPix DGGS to enable efficient processing and analysis of large volumes of astronomical data distributed on a sphere, and the isolatitude property was specifically incorporated to support this. However, the isolatitude property on the rHEALPix DGGS is yet to be explored (Bowater & Stefanakis, Citation2019a).

In this paper, we explore the isolatitude property and extend existing theoretical work on the rHEALPix DGGS by proposing a method to determine the isolatitude ring on which the nucleus of a given cell falls. More specifically, we adopt the same indexing method, cell identifier (ID) notation, row notation, and column notation defined in Gibb et al. (Citation2016) and Gibb (Citation2016), to efficiently convert a unique cell ID to isolatitude ring without recourse to geodetic coordinates. In addition, we present a method to calculate the geodetic latitude of a cell’s nucleus via its associated isolatitude ring. Lastly, we consider latitudinal data analysis and move away from the traditional approach of using the geographic grid, by applying the proposed methods to demonstrate how we can utilize the isolatitude property of the rHEALPix DGGS at multiple resolutions.

The content of this paper is organized as follows. Section 2 provides relevant background information on the rHEALPix DGGS. Section 3 presents methods to (i) convert a unique cell ID to isolatitude ring and (ii) calculate the geodetic latitude associated with an isolatitude ring. Section 4 explores the application of the proposed methods for latitudinal data analysis on the rHEALPix DGGS. Section 5 concludes the paper and provides directions for future work.

2. The rHEALPix DGGS

The rHEALPix DGGS (Gibb et al., Citation2016) is a promising OGC conformant quadrilateral-based approach with several interesting properties that distinguish it from other DGGSs. For example, it has a hierarchical and congruent cell structure, an aligned cell structure for odd factors of refinement, equal area cells that completely cover the globe at every resolution, distribution of cell nuclei along rings of constant latitude, a planar projection consisting of horizontal-vertical aligned square grids with low average angular and linear distortion, and it is compatible with ellipsoids of revolution (Gibb, Citation2016). Arguably, the most unique property is the distribution of cell nuclei along rings of constant latitude (often referred to as isolatitude distribution), whereby at each resolution, k nuclei lie on Ok isolatitude rings (or rings of constant latitude) (Gibb et al., Citation2016). This property is inherited from the DGGS on which rHEALPix is based, namely the Hierarchical Equal Area isoLatitude Pixelization (HEALPix) DGGS (Gorski et al., Citation2005) and to our knowledge, no other DGGSs share this property.

The rHEALPix DGGS is constructed by projecting an ellipsoid of revolution (such as WGS84) onto the faces of a cube, subdividing each face into a grid of square cells, and then inversely projecting the result back onto the ellipsoid using the equal area rHEALPix projection. The definition of the rHEALPix DGGS describes a general class of DGGSs for any integer Nside2, where planar squares are divided into Nside×Nside sub-squares at successive resolutions. shows the planar grid and ellipsoidal grid of the rHEALPix DGGS with Nside=3 at resolution i=0 and i=1.

Figure 1. The planar grid and ellipsoidal grid of the rHEALPix DGGS with Nside=3 at (a) resolution i=0 and (b) resolution i=1 (based on the (0, 0)-rHEALPix projection) (Gibb, Citation2016).

Figure 1. The planar grid and ellipsoidal grid of the rHEALPix DGGS with Nside=3 at (a) resolution i=0 and (b) resolution i=1 (based on the (0, 0)-rHEALPix projection) (Gibb, Citation2016).

Each cell of the rHEALPix DGGS has a unique identifier (ID). Gibb et al. (Citation2016) define a cell ID to be a string beginning with one of the letters N,S,O,P,Q,R followed by a sequence of zero or more of the integers 0,1,2,,Nside21. The process of assigning a unique ID to each cell is as follows. First, assign the planar cells at resolution i=0 the IDs N,O,P,Q,R,S from top to bottom and left to right. Then, for each resolution i cell with ID t assign its Nside2 resolution i+1 sub-cells the IDs t0,t1,,tNside21 using a Z space filling curve from top to bottom and left to right (). As a result of this approach, the resolution of a cell can easily be determined from its ID by ID1. Note that IDs of planar cells are shared with their corresponding ellipsoidal cells.

Each planar cell of the rHEALPix DGGS has a nucleus which is defined as its centroid. The nucleus of an ellipsoidal cell is simply the inverse projection of the nucleus of its corresponding planar cell. Given that parallels are projected to horizontal lines in the equatorial region and concentric squares in the polar region (), it is clear that planar cell nuclei coincide with projected parallels ( shows an example of the rHEALPix DGGS with Nside=3 at resolution i=1). Consequently, ellipsoidal cell nuclei must also lie on rings of constant latitude. In the equatorial region, 4Nside2i ellipsoidal cells lie on Nsidei isolatitude rings. In the north polar region, there are Nside2i ellipsoidal cells if Nside is even, and Nside2i1 ellipsoidal cells if Nside is odd (excluding the cap cell whose nucleus coincides with the pole), which lie on Nsidei/2 isolatitude rings. The same can be said of the south polar region. In total, the nuclei of k=6Nside2i ellipsoidal cells lie on 2Nsidei1 (non-pole) isolatitude rings if Nside is odd, or 2Nsidei isolatitude rings if Nside is even. Therefore, k cells lie on Ok isolatitude rings (Gibb et al., Citation2016).

Figure 2. (a) The 0,0-rHEALPix projection, grid lines represent parallels and meridians (derived from Gibb, Citation2016). (b) Cell nuclei distributed along the planar projection of parallels.

Figure 2. (a) The 0,0-rHEALPix projection, grid lines represent parallels and meridians (derived from Gibb, Citation2016). (b) Cell nuclei distributed along the planar projection of parallels.

3. Extending the rHEALPix DGGS

In this section, we introduce a method to determine the isolatitude ring on which the nucleus of a given cell falls using only the cell ID. Importantly, this method is only applicable to the rHEALPix DGGS with Nside=2 and Nside=3. We focus on these two values because (i) the Nside=3 approach is the smallest integer that produces aligned hierarchies and is the preferred approach in existing work (Bowater & Stefanakis, Citation2018, Citation2019b; Gibb, Citation2016) and (ii) the Nside=2 approach maximizes the number of resolution levels under a fixed number of cells (i.e. provides a smooth transition) and it can be exactly encoded using 2 bits.

It is important to mention that during our attempts to generalize the method for any Nside value, it became clear that a fundamental problem exists with the definition of the rHEALPix DGGS when Nside>3. This is easily demonstrated by considering the case when Nside=4. At resolution i=1, each base cell with ID t will have 16 sub-cells with IDs t0,t1,t2,t3,t4,,t12,t13,t14,t15. Because six of the cells have IDs consisting of two numerical digits, it is no longer possible to relate cell ID to resolution using the string length of the ID. Note that hexadecimal encoding could potentially solve this problem (when Nside=4) but such an approach is not considered in the current rHEALPix DGGS definition. More importantly however, cell IDs will no longer be unique across all resolutions. For example, at resolution i=2, a cell with ID t1 will have 16 sub-cells with IDs t10,t11,t12,t13,t14,t15,,t113,t114,t115 (recall that for each resolution i cell with ID t, its Nside2 resolution i+1 sub-cells have IDs t0,t1,,tNside21). Clearly, there are now cells at resolution i=1 and i=2 with the same ID. For example, cell ID t15 represents a resolution i=1 sub-cell of cell t, but it is also represents a resolution i=2 sub-cell of cell t1. This is a major issue because a fundamental requirement for any OGC conformant DGGS is the uniqueness of cell IDs across all resolutions (OGC, Citation2017b). Therefore, the definition of the rHEALPix DGGS should be amended to account for the issues that arise when Nside>3, or be constrained to the valid and perhaps most pertinent cases when Nside=2 and Nside=3.

In this method, isolatitude rings are expressed as integer values increasing from north to south and ranging from 123i1 (excluding the poles) when Nside=3, and 122i when Nside=2, where i is the resolution of the grid. To ensure this method is compatible with previous work, we utilize the same cell ID notation, indexing method, row notation, and column notation defined in Gibb et al. (Citation2016) and Gibb (Citation2016). In addition, we present a method to convert an isolatitude ring value to its associated geodetic latitude, which ultimately provides a way to calculate the geodetic latitude of a cell’s nucleus via its associated isolatitude ring. We end this section by comparing the computation speed of the proposed methods when Nside=3 against the method described in Section 6 of Gibb et al. (Citation2016) for calculating the geodetic latitude of a cell’s nucleus for all cell IDs at various resolutions.

To determine the isolatitude ring value associated with a given cell ID, we utilize the planar grid representation of the rHEALPix DGGS rather than the ellipsoidal grid representation because it has the advantage that all cells are squares uniformly positioned in rows and columns at every resolution. That being said, using the planar grid is not without challenges. The main one relates to how lines of constant latitude are represented. In the equatorial region, lines of constant latitude are projected as horizontal lines whereas in the polar region, lines of constant latitude are projected as concentric squares. As a result, this method addresses the equatorial region and polar region separately. Because the conversion process is simplest in the equatorial region, we will start there and then build on this approach for the polar region.

3.1. Equatorial region

First, let us define MNPi to be the number of non-pole isolatitude rings in the north polar region at resolution i, where MNPi=Nsidei/2. Let MEi be the number of isolatitude rings in the equatorial region at resolution i, where MEi=Nsidei. Also, given the ID of a cell, let the resolution i of that cell be determined by i=ID1.

Now let us consider cells in the equatorial region when Nside=3 at resolution i=1. If we convert the ID of every cell ()) into row,column notation (as described in Section 4 of Gibb (Citation2016)) and record just the rows ()), it is clear that every column is identical in this notation. If we take only the digits in row notation to the right of the decimal point, let r3 be the base3 integer denoted by these digits ()), and let r be the base10 integer equivalent to r3 ()), then as we move down the rows from top to bottom, values of r increase from 02. For cells at resolution i=2, values of r increase from 08 as we move down the rows from top to bottom. In general, we observe that for cells at any resolution i, values of r increase from 0MEi1. Knowing this, we can simply offset the values of r by the number of non-pole isolatitude rings in the north polar region MNPi to determine the isolatitude ring associated with any cell in the equatorial region.

Figure 3. Equatorial region cells at resolution i=1 when Nside=3 expressed using (a) cell IDs, (b) row notation, (c) r3 notation, and (d) r notation.

Figure 3. Equatorial region cells at resolution i=1 when Nside=3 expressed using (a) cell IDs, (b) row notation, (c) r3 notation, and (d) r notation.

We can adopt a similar method for cells in the equatorial region when Nside=2. However, rather than using base3 in the construction of row,column notation, we use base2. For example, at resolution i=1 the row notation of cells P0,P1,P2,P3, would be 13.02,13.02,13.12,13.12, respectively. We then let r2 be the base2 integer denoted by the digits to the right of the decimal point in row notation, and let r be the base10 integer equivalent to r2. Then, as we move down the rows from top to bottom, values of r increase from 0MEi1. Therefore, we can generalize the method for either Nside value if we use the relevant baseNside value in the construction of row,column notation and let r be the base10 integer equivalent to rNside. Thus, given the ID of an equatorial region cell, we can determine the isolatitude ring on which the nucleus of the cell falls, denoted IE, using IE=MNPi+r+1.

3.2. Polar region

Let us first consider the case when Nside=3. In the north polar region at resolution i=1, base cell N has nine sub-cells with IDs N0,N1,N2,,N8. Knowing that the north pole is projected to the center of base cell N and thus coincides with the nucleus of cell N4, it is clear that the eight remaining cells lie on the only isolatitude ring. In general, at resolution i there are MNPi non-pole isolatitude rings so it makes sense to assign the value of zero to the cell that covers the north pole. shows this scenario at resolution i=1 and i=2.

Figure 4. IDs and corresponding isolatitude ring values for (a) resolution i=1 cells and (b) resolution i=2 cells in cell N when Nside=3.

Figure 4. IDs and corresponding isolatitude ring values for (a) resolution i=1 cells and (b) resolution i=2 cells in cell N when Nside=3.

At this point, let us define c3 to be the base3 integer denoted by the digits to the right of the decimal point of the column in row,column notation (as described in Section 4 of Gibb (Citation2016)), and c to be the base10 integer equivalent to c3. Furthermore, let us assign the values r0 and c0 to the cell covering the north pole at resolution i, where r0=c0=3i/2=MNPi. Then, given the ID of a cell in the north polar region when Nside=3 at resolution i, we can determine the isolatitude ring on which the nucleus of the cell falls, denoted IN3, using IN3=maxabsrr0,abscc0.

In the south polar region, the process to determine the isolatitude ring associated with a given cell is similar to the north polar region, with a slight difference. First, let us recall that in total, there are 2Nsidei1 (non-pole) isolatitude rings if Nside is odd. Therefore, if we define ISPi to be the isolatitude ring value assigned to the cell that covers the south pole at resolution i where ISPi=23i, then given the ID of any cell in the south polar region at resolution i, we can determine the isolatitude ring on which the nucleus of the cell falls, denoted IS3, using IS3=ISPimaxabsrr0,abscc0.

With a slight modification, the methods for the north and south polar regions can be extended for the case when Nside=2. In a similar manner to r, let us generalize c by using the relevant baseNside value in the construction of row, column notation and let c be the base10 integer equivalent to cNside. Furthermore, if we let r0=c0=2i/2=MNPi, then given the ID of a cell in the north polar region when Nside=2 at resolution i, we can determine the isolatitude ring on which the nucleus of the cell falls, denoted IN2, using Equation (1).

(1) IN2ID=maxabsrr0,abscc0,         r+c<2imaxabsrr0,abscc0+1,   else(1)

For cells in the south polar region, recall there are a total of 2Nsidei isolatitude rings if Nside is even. In addition, there is no sub-cell in base cell S that covers the south pole. Therefore, if we use 2Nsidei rather than ISPi, then given the ID of a cell in the south polar region when Nside=2 at resolution i, we can determine the isolatitude ring on which the nucleus of the cell falls, denoted IS2, using Equation (2).

(2) IS2ID=2Nsideimaxabsrr0,abscc0+1,   r+c<2i2Nsideimaxabsrr0,abscc0,  else(2)

3.3. Isolatitude ring to geodetic latitude conversion

Here we present a method to determine the geodetic latitude associated with an isolatitude ring value I at resolution i when Nside=2 and Nside=3.

Let us consider the 0,0-rHEALPix projection of the unit sphere as a simple rearrangement of the 1,2-rHEALPix projection presented in Gibb et al. (Citation2016). In this projection, the x,y coordinates of the north pole and south pole are 3π/4,π/2 and 3π/4,π/2, respectively. In addition, we know from Gibb et al. (Citation2016) that the width of a planar cell w at resolution i is π/2Nsidei. Therefore, if we draw a vertical line between the north pole and south pole coordinates, we can conclude that cell nuclei are evenly spaced at intervals of w along this line.

Now, if we consider the case where Nside=3, then for a given resolution i there are 23i1 non-pole isolatitude rings which means there are an equivalent number of cell nuclei on the line between the north pole and south pole. Knowing that cell nuclei coincide with isolatitude rings, we can easily deduce that the first cell nucleus on the line represents the first isolatitude ring I=1, the second cell nucleus represents the second isolatitude ring I=2, and so on until the last cell nucleus represents isolatitude ring I=23i1. shows this scenario at resolution i=1. Therefore, given a non-pole isolatitude ring value I and resolution i, we can determine the y coordinate of I by y=π/2wI where w=π/23i.

The case where Nside=2 follows a similar pattern except that for a given resolution i there are 22i isolatitude rings, and the distance between the first isolatitude ring and the north pole is half the cell width w (similarly for the last isolatitude ring and the south pole). Therefore, we simply need to add an offset to the formula presented above for Nside=3 to get y=π/2+w/2wI where w=π/22i.

Figure 5. Determining the y coordinate of non-pole isolatitude rings when Nside=3 at resolution i=1.

Figure 5. Determining the y coordinate of non-pole isolatitude rings when Nside=3 at resolution i=1.

Lastly, we can use formulae from Appendix A in Gibb et al. (Citation2016) to convert the y coordinate to authalic latitude and then to geodetic latitude. Evidently, by using the methods described above, it is possible to calculate the geodetic latitude of a cell’s nucleus via its associated isolatitude ring by simply converting cell ID to isolatitude ring to geodetic latitude.

3.4. Performance comparison

To verify the simplicity of the proposed methods and to gain a better understanding of their efficiency, we compared the computation speed of the proposed methods for Nside=3 against the method described in Section 6 of Gibb et al. (Citation2016) for calculating the geodetic latitude of a cell’s nucleus for all cell IDs at various resolutions. Tests were performed using the Python programming language on a 64-bit Windows 10 machine with an Intel Core i7-4800MQ CPU and 8 GB of RAM. and show the average computation speeds in the equatorial region and polar region, respectively.

Table 1. Cell ID to geodetic latitude computation speeds in the equatorial region when Nside=3.

Table 2. Cell ID to geodetic latitude computation speeds in the polar region when Nside=3.

As shown in , the average computation speed of the proposed method is approximately three times faster in the polar region. The difference in computation speeds is predominantly because the Gibb et al. (Citation2016) method converts a cell ID to x,y coordinates in the rHEALPix projection followed by a costly matrix-based conversion to x,y coordinates in the HEALPix projection. In contrast, the proposed method converts a cell ID to isolatitude ring and then to a y coordinate in the HEALPix projection using basic arithmetic. As a result, the proposed method avoids the costly conversion, is less computationally complex, and takes less time as a result. Note that both methods use the same formulae to convert the y coordinate in the HEALPix projection to geodetic latitude.

In the equatorial region, both methods have similar computation speeds, as shown in . Although the proposed method follows the same computational process as for the polar region, the Gibb et al. (Citation2016) method is computationally simpler for the equatorial region. This is evident by noticing that computation speeds in the equatorial region are significantly less than the polar region even though the equatorial region has double the amount of cells at each resolution. The main reason for this is because x,y coordinates are equivalent in both rHEALPix and HEALPix projections which means costly conversions are avoided. In addition, similar to the proposed approach, the x coordinate can be ignored in the computations (unlike in polar region formulae). As a result, both methods have similar complexity and therefore similar speeds.

4. Application of proposed methods for latitudinal data analysis

As mentioned earlier, a DGGS can be thought of as a discrete, multiresolution, digital framework that allows us to store, integrate, analyze, and visualize geospatial big data on a global scale. In this section, we consider latitudinal data analysis and move away from the traditional approach of using the geographic grid, by applying the methods proposed thus far to demonstrate how we can utilize the isolatitude property of the rHEALPix DGGS.

The benefits of analyzing data using a latitudinal approach are clearly emphasized in Kummu and Varis (Citation2011) which to our knowledge, is the most recent and comprehensive paper dedicated to latitudinal data analysis. In this paper, the authors state that although it is common to analyze population, environment, social, and economic development data using administrative boundaries, it can be argued that distance to the equator is one of the key factors affecting the living conditions of human societies. As a result, they use a latitudinal approach based on equal degree latitudinal bands to analyze such data on a global scale to provide an alternative viewpoint that complements administrative boundary-based analyses. In fact, the authors point out that their analyses reveal interesting latitudinal patterns relating to economic development and CO2 emissions for example, that are not evident in cross-country analyses.

Recall that in the DGGS model, geospatial data (be it vector, raster, or point cloud data) is quantized into cells which can subsequently be retrieved using a cell’s unique ID. In addition, the OGC DGGS Abstract Specification states that a DGGS must include encoding methods to perform algebraic operations on cells and data assigned to them (OGC, Citation2017b). In fact, this prompted Gibb (Citation2016) to show how cell adjacency and topological relationships (such as contains and touches) can be efficiently computed on the rHEALPix DGGS using cell IDs directly rather than geodetic coordinates. Therefore, once data is embedded into the rHEALPix DGGS, the proposed method of converting a cell ID directly to isolatitude ring without recourse to geodetic coordinates, enables the isolatitude property to be utilized for latitudinal data analysis in such a way that it builds on the work by Gibb (Citation2016) and supports the OGC specification.

To demonstrate the feasibility of the proposed approach, we use the rHEALPix DGGS when Nside=3 to explore how population changes over Canada in 2015 at different resolutions. The dataset that we use is published by NASA Socioeconomic Data and Applications Center (Center for International Earth Science Information Network-CIESIN-Columbia University, Citation2018) and includes population estimates (among many other attributes) for administrative units across Canada from 2000 to 2020 (at 5-year intervals). Administrative units are represented as point locations using WGS84 centroid coordinates. In total, the vector dataset includes information for 493193 locations across Canada.

Despite the large volume of points in the dataset, which we found GIS software has difficulty opening, processing the data using the proposed methods is straightforward. First, we need to determine which cell (at a desired resolution) each point location falls in. This can be achieved using formulae described in Gibb et al. (Citation2016) that converts WGS84 geodetic coordinates to cell ID. To accomplish this task, we utilized the coordinate conversion functionality of the open-source web service described in Bowater and Stefanakis (Citation2019b). Second, we must convert the cell ID associated with each point location to isolatitude ring using the proposed methods. This allows us to then aggregate the data based on a simple integer attribute (i.e. isolatitude ring) rather than using costly spatial functions on point geometries. This task was programmed in Python and made use of the pandas Python package because it is fast, flexible, and well suited for performing data analysis on large datasets (see https://pypi.org/project/pandas/).

shows how population changes with isolatitude ring over Canada in 2015 at two different resolutions. Note that a resolution 4 grid (114 km ellipsoidal cell width) corresponds to roughly a 1 geographic grid (111 km at equator), and a resolution 6 grid (13 km ellipsoidal cell width) corresponds to roughly a 6 geographic grid (11 km at equator). Both plots show the same general trend of having the majority of the population in the south (corresponding to high isolatitude ring values). This is expected because the bulk of the population live near the US border. Furthermore, on both plots we can identify peaks in the population and the corresponding isolatitude rings at which they occur. The resolution 4 grid provides this information at a small scale (114 km isolatitude ring width), whereas the resolution 6 grid provides a larger scale view (13 km isolatitude ring width). The benefit of the larger scale view is a more fine-grained representation of the change in population which allows us to better identify the main peaks, but also sub-peaks that are lost in the small scale view.

Figure 6. Population change with respect to isolatitude ring at resolution 4 (top) and resolution 6 (bottom).

Figure 6. Population change with respect to isolatitude ring at resolution 4 (top) and resolution 6 (bottom).

Even though it is clear to see in how the Canadian population changes along the north-south axis, isolatitude ring values by themselves provide no geospatial reference. Therefore, if there is a need to extract geospatial information from the data, for instance the geodetic latitudes associated with population peaks, we could use the simple conversion method proposed in Section 3.3 to convert an isolatitude ring value to geodetic latitude for the peaks of interest. Alternatively, we could convert every isolatitude ring value to geodetic latitude and plot these values against population instead. For example, shows the latitudinal change in the 2015 population at resolution 4 where the two largest peaks can be seen at approximately 49 N and 44 N. To go further, we could take advantage of the inherent nature of a DGGS and visualize the data directly on the earth’s surface using a virtual globe such as ArcGIS Earth or Google Earth. This approach would allow us to rotate the globe and zoom in/out on different areas, thereby providing a more interactive environment in which to explore the data compared to that of a static map visualization. Thus, shows the latitudinal change in the 2015 population at resolution 4 in Google Earth, where isolatitude rings are colored by population ranging from blue (lowest) to red (highest). To create , we first utilized the open-source web service from Bowater and Stefanakis (Citation2019b) along with polygon data from Natural Earth (https://www.naturalearthdata.com/) to create a resolution 4 grid over Canada. Then, we used the GeoPandas Python package (http://geopandas.org/) to join the vector grid data with isolatitude ring and population attribute data. Lastly, we used GIS software to color the cells based on population and to create a Keyhole Markup Language (KML) file suitable for Google Earth.

Figure 7. Latitudinal change in population at resolution 4.

Figure 7. Latitudinal change in population at resolution 4.

Figure 8. A globe-based visualization showing latitudinal change in population at resolution 4 (isolatitude rings are colored by population ranging from blue (lowest) to red (highest)) (©2018 Google).

Figure 8. A globe-based visualization showing latitudinal change in population at resolution 4 (isolatitude rings are colored by population ranging from blue (lowest) to red (highest)) (©2018 Google).

As we have shown, the isolatitude property of the rHEALPix DGGS can be utilized to facilitate latitudinal data analysis at multiple resolutions. Although we have only considered Canadian population data in our examples, the process could easily be applied to other data sets and at a global scale. Of course, massive global data sets would require more memory and more time to process which may be problematic using current software. However, it is possible to envisage a mature implementation of the rHEALPix DGGS that is capable of efficiently handling data sets of any size and any spatial extent. In such a computing environment, isolatitude ring values could easily be stored as integer attributes for all cell IDs at every resolution. Thus, utilizing the isolatitude property to perform latitudinal data analysis at regional or global scale would be straightforward.

5. Conclusion

Arguably, the most unique property of the rHEALPix DGGS is distribution of k cell nuclei along Ok rings of constant latitude. In this paper, we have explored this isolatitude property. Firstly, we presented a method to determine the isolatitude ring on which the nucleus of a given cell falls. Importantly, the method extends existing work on the rHEALPix DGGS to convert any cell ID to isolatitude ring without recourse to geodetic coordinates. Secondly, we presented a method to convert isolatitude ring to geodetic latitude. Combining these two methods provides a way to determine the geodetic latitude associated with the nucleus of any cell ID. In the polar region especially, this approach shows advantages in terms of computation speed. Lastly, we used the proposed methods to demonstrate how the isolatitude property of the rHEALPix DGGS can be utilized to facilitate latitudinal data analysis at multiple resolutions.

The methods introduced in this paper focus on the rHEALPix DGGS when Nside=3 and Nside=2. We focused on these two values because the Nside=3 approach is the smallest integer that produces aligned hierarchies and is the preferred approach in existing work, while the Nside=2 approach maximizes the number of resolution levels under a fixed number of cells and can be exactly encoded using 2 bits. Moreover, a fundamental problem exists with the definition of the rHEALPix DGGS when Nside>3, namely cell IDs are not unique across all resolutions. Therefore, a solution to this issue should be explored in future work.

Finally, the application of the isolatitude property toward harmonic analysis remains to be investigated (Bowater & Stefanakis, Citation2019a). In their work on the HEALPix DGGS, Gorski et al. (Citation2005) state that the isolatitude property is essential for computational speed in operations involving spherical harmonics, which suggests the rHEALPix DGGS is a good choice for geospatial applications (such as gravitational field modelling) involving harmonic analysis (Gibb et al., Citation2016). Although the methods presented here could facilitate such work, a thorough investigation on the potential advantages of this approach is yet to be carried out.

Acknowledgments

We would like to thank the anonymous reviewers for their insightful comments that helped to improve the manuscript. This work was funded by the Natural Sciences and Engineering Research Council of Canada (NSERC).

Data availability statement

The data that support the findings of this study are openly available at https://doi.org/10.7927/h4bc3wmt.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was supported by the Natural Sciences and Engineering Research Council of Canada.

References

  • Adams, B. (2017). Wāhi, a discrete global grid gazetteer built using linked open data. International Journal of Digital Earth, 10(5), 490–503.
  • Amiri, A., Samavati, F., & Peterson, P. (2015). Categorization and conversions for indexing methods of Discrete Global Grid Systems. ISPRS International Journal of Geo-Information, 4(1), 320–336.
  • Amiri, A. M., Bhojani, F., & Samavati, F. (2013). One-to-two digital earth. In G. Bebis et al. (Ed.), Advances in visual computing. ISVC 2013. Lecture notes in computer science (Vol. 8034, pp. 681–692). Berlin, Heidelberg: Springer. doi:10.1007/978-3-642-41939-3_67
  • Birk, F. (2018). Design and Implementation of a Scalable Crowdsensing Platform for Geospatial Data ( Master’s Thesis). Ulm University.
  • Bowater, D., & Stefanakis, E. (2018). The rHEALPix Discrete Global Grid System: considerations for Canada. Geomatica, 72(1), 27–37.
  • Bowater, D., & Stefanakis, E. (2019a, February 22–23). Research directions for the rHEALPix Discrete Global Grid System. Proceedings of the Conference on Spatial Knowledge and Information Canada (SKI 2019), (Vol. 2323). Banff, Alberta, Canada: CEUR-WS.org. [Online]. Retrieved from http://ceur-ws.org/Vol-323/SKI-Canada-2019-7-6-2.pdf.
  • Bowater, D., & Stefanakis, E. (2019b). An open-source web service for creating quadrilateral grids based on the rHEALPix Discrete Global Grid System. International Journal of Digital Earth. doi:10.1080/17538947.2019.1645893
  • Center for International Earth Science Information Network-CIESIN-Columbia University. (2018). Gridded Population of the World, Version 4 (GPWv4): Administrative Unit Center Points with Population Estimates, Revision 11 [Data set]. Palisades, NY: NASA Socioeconomic Data and Applications Center (SEDAC). doi:10.7927/h4bc3wmt.
  • Gibb, R., Raichev, A., & Speth, M. (2016). The rHEALPix Discrete Global Grid System. Retrieved from https://datastore.landcareresearch.co.nz/dataset/rhealpix-discrete-global-grid-system
  • Gibb, R. G. (2016). The rHEALPix Discrete Global Grid System. IOP Conference Series: Earth and Environmental Science, 34. doi:10.1088/1755-1315/34/1/012012
  • Goodchild, M. F. (2000, March 26–28). Discrete Global Grids for Digital Earth. Proceedings of the 1st International Conference on Discrete Global Grids, Santa Barbara, CA, USA.
  • Gorski, K. M., Hivon, E., Banday, A. J., Wandelt, B. D., Hansen, F. K., Reinecke, M., & Bartelman, M. (2005). HEALPix – A framework for high resolution discretization, and fast analysis of data distributed on the sphere. The Astrophysical Journal, 622(2), 759–771.
  • Kummu, M., & Varis, O. (2011). The world by latitudes: A global analysis of human population, development level and environment across the north-south axis over the past half century. Applied Geography, 31(2), 495–507.
  • Lin, B., Zhou, L., Xu, D., Zhu, A. X., & Lu, G. (2018). A discrete global grid system for earth system modelling. International Journal of Geographical Information Science, 32(4), 711–737.
  • Mahdavi-Amiri, A., Alderson, T., & Samavati, F. (2015). A survey of Digital Earth. Computers and Graphics, 53, 95–117.
  • OGC (2017a). OGC announces a new standard that improves the way information is referenced to the earth. Retrieved from http://www.opengeospatial.org/pressroom/pressreleases/2656
  • OGC (2017b). Topic 21: Discrete Global Grid Systems abstract specification. Retrieved from http://docs.opengeospatial.org/as/15-104r5/15-104r5.html
  • Purss, M. B. J., Liang, S., Gibb, R., Samavati, F., Peterson, P., Ben, J., … Saeedi, S. (2017). Applying discrete global grid systems to sensor networks and the Internet of things. IEEE International Geoscience and Remote Sensing Symposium (IGARSS) (pp. 5581–5583). Fort Worth, TX.doi:10.1109/IGARSS.2017.8128269
  • Sahr, K., & White, D. (1998). Discrete Global Grid Systems. Proceedings of the 30th Symposium on the Interface, Computing Science and Statistics (pp. 269–278), Minneapolis, Minnesota.
  • Sahr, K., White, D., & Kimerling, A. J. (2003). Geodesic Discrete Global Grid Systems. Cartography and Geographic Information Science, 30(2), 121–134.
  • Sherlock, M. (2017). Visualisations on a Web-Based View-Aware Digital Earth ( Master’s Thesis). University of Calgary.
  • Sirdeshmukh, N. (2018). Utilizing a Discrete Global Grid System For Handling Point Clouds With Varying Locations, Times, and Levels of Detail ( Master’s Thesis). Delft University of Technology.
  • Vince, A., & Zheng, X. (2009). Arithmetic and fourier transform for the PYXIS multi-resolution digital earth model. International Journal of Digital Earth, 2(1), 59–79.
  • White, D., Kimerling, J. A., & Overton, S. W. (1992). Cartographic and geometric components of a global sampling design for environmental monitoring. Cartography and Geographic Information Systems, 19(1), 5–22.
  • Yao, X., & Li, G. (2018). Big spatial vector data management: a review. Big Earth Data, 2(1), 108–129.