Search in:

International Journal of Geographical Information Science Volume 35, 2021 - Issue 11

Submit an article Journal homepage

513

Views

CrossRef citations to date

Altmetric

Research Articles

Spatially–encouraged spectral clustering: a technique for blending map typologies and regionalization

Levi John WolfSchool of Geographical Sciences, University of Bristol, Bristol, UKCorrespondence[email protected]

https://orcid.org/0000-0003-0274-599X View further author information

Pages 2356-2373 | Received 20 Apr 2018, Accepted 21 May 2021, Published online: 05 Jul 2021

Cite this article
https://doi.org/10.1080/13658816.2021.1934475
CrossMark

Sample our Computer Science journals, sign in here to start your access, latest two full volumes FREE to you for 14 days

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/13658816.2021.1934475?needAccess=true

ABSTRACT

Clustering is a central concern in geographic data science and reflects a large, active domain of research. In spatial clustering, it is often challenging to balance two kinds of ‘goodness of fit:’ clusters should have ‘feature’ homogeneity, in that they aim to represent one ‘type’ of observation, and also ‘geographic’ coherence, in that they aim to represent some detected geographical ‘place’. This divides ‘map typologization’ studies, common in geodemographics, from ‘regionalization’ studies, common in spatial optimization and statistics. Recent attempts to simultaneously typologize and regionalize data into clusters with both feature homogeneity and geographic coherence have faced conceptual and computational challenges. Fortunately, new work on spectral clustering can address both regionalization and typologization tasks within the same framework. This research develops a novel kernel combination method for use within spectral clustering that allows analysts to blend smoothly between feature homogeneity and geographic coherence. I explore the formal properties of two kernel combination methods and recommend multiplicative kernel combination with spectral clustering. Altogether, spatially encouraged spectral clustering is shown as a novel kernel combination clustering method that can address both regionalization and typologization tasks in order to reveal the geographies latent in spatially structured data.

KEYWORDS:

Clustering
geodemographics
spatial analysis
spectral clustering

Data and codes availability statement

All code and documentation for the plots and algorithms in this paper are made available on the Open Science Framework (https://doi.org/10.17605/OSF.IO/FCS5X). Furthermore, a generalized spatially encouraged spectral clustering algorithm has been made available in the PySAL package (Rey and Anselin Citation2007) as part of the spopt subpackage. The algorithm depends primarily on NumPy (van der Walt et al. Citation2011) and scikit-learn (Pedregosa et al. Citation2011).

Disclosure Statement

No potential conflict of interest was reported by the author(s).

Notes

1. Further types of ‘core detection’ (Aldstadt and Getis Citation2006, Murray et al. Citation2014, Kim et al. Citation2017) or ‘boundary detection’ (Jacquez et al. Citation2008, Dean et al. Citation2018, Dong et al. Citation2018) allow for ‘non-exhaustive’ partitions, where observations can evade cluster assignments. This is not of interest at here – using Kim et al. (Citation2017)’s terminology, this means only ‘districting’ methods are considered.

2. Numerically, it is common to use a kernel function, such as the negative exponential kernel, and standardize the resulting values to between $0$ and $1$ .

3. although the minimum size or shape regularity are not parameterized directly as in other methods (Duque et al. Citation2012, Li et al. Citation2014).

4. In their specific case, Yuan et al. (Citation2015) cluster principal components derived from many mean-centered and unit-deviation standardized covariates. But, $τ^{2}$ is not intended to stand in as the empirical variance of $X$ generally, as $X$ may be $N \times P$ with different variances for each feature but $τ^{2}$ is scalar and used for all $P$ .

5. This algorithm is made available post-publication in PySAL, the Python spatial analysis library (Rey and Anselin Citation2007), and is built primarily using NumPy (van der Walt et al. Citation2011) and scikit-learn (Pedregosa et al. Citation2011).

6. The binarized contiguity kernel is used here for simplicity. Each row of uses the $A_{η}$ connectivity matrix, connecting observations with maximum path order $η$ , since the non-binary exponential kernel behaves substantively similarly to $η = 0$ .

7. Further, this has similar semantics to the Queen contiguity matrix used in the previous example: the Delaunay triangulation is the dual graph of a Voronoi diagram for the Airbnbs, as the Queen contiguity graph is a kind of dual graph for the Texas counties. Their order statistics, both at first order and higher, are also similar. Alternative spatial kernels, like $k$ -nearest neighbor or Distance-weighted kernels could also be used.

8. Precisely, I set aside 25% of the listings, compute their nearest geographic cluster, and predict their price using the mean cluster price. I am grateful to an anonymous reviewer for proposing this method.

Aldstadt, J. and Getis, A., 2006. Using AMOEBA to create a spatial weights matrix and identify spatial clusters. Geographical Analysis, 38 (4), 327–343, October. 0016-7363, 1538-4632. doi:https://doi.org/10.1111/j.1538-4632.2006.00689.x.

Web of Science ®Google Scholar

Murray, A.T., Grubesic, T.H., and Wei, R., 2014. Spatially significant cluster detection. Spatial Statistics, 10, 103–116. doi:https://doi.org/10.1016/j.spasta.2014.03.001

Web of Science ®Google Scholar

Kim, K., Chun, Y., and Kim, H., 2017. P-functional clusters location problem for detecting spatial clusters with covering approach. Geographical Analysis, 49 (1), 101–121. 1538-4632. doi:https://doi.org/10.1111/gean.12109.

Google Scholar

Jacquez, G.M., Kaufmann, A., and Goovaerts, P., 2008. Boundaries, links and clusters: a new paradigm in spatial analysis? Environmental and Ecological Statistics, 15 (4), 403–419, December. 1352-8505, 1573-3009. doi:https://doi.org/10.1007/s10651-007-0066-4.

PubMed Web of Science ®Google Scholar

Dean, N., et al., 2018. Frontiers in residential segregation: understanding neighbourhood boundaries and their impacts. Tijdschrift Voor Economische En Sociale Geografie, in print, 1467-9663, 110 (3), 289–302. doi:https://doi.org/10.1111/tesg.12307.

Google Scholar

Dong, G., et al., 2018. Inferring neighbourhood quality with property transaction records by using a locally adaptive spatial multi-level model. Computers, Environment and Urban Systems, 73, 118–125.

Web of Science ®Google Scholar

Google Scholar

Duque, J.C., Anselin, L., and Rey, S.J., 2012. The max-p-regions problem. Journal of Regional Science, 52 (3), 397–419. doi:https://doi.org/10.1111/j.1467-9787.2011.00743.x.

Web of Science ®Google Scholar

Li, W., Church, R.L., and Goodchild, M.F., 2014. The p-compact-regions problem. Geographical Analysis, 46 (3), 250–273. doi:https://doi.org/10.1111/gean.12038.

Web of Science ®Google Scholar

Yuan, S., et al., 2015. Constrained spectral clustering for regionalization: exploring the trade-off between spatial contiguity and landscape homogeneity. In: Data Science and Advanced Analytics (DSAA), 2015. 36678 2015. IEEE International Conference On. IEEE, Paris, France. 1–10.

Google Scholar

Rey, S.J. and Anselin, L., 2007. PySAL: a python library of spatial analytical methods. The Review of Regional Studies, 37 (1), 5–27.

Google Scholar

van der Walt, S., Colbert, S.C., and Varoquaux, G., 2011. The NumPy array: a structure for efficient numerical computation. Computing in Science & Engineering, 13 (2), 22–30, March. 1521-9615. doi:https://doi.org/10.1109/MCSE.2011.37.

Web of Science ®Google Scholar

Pedregosa, F., et al., 2011. Scikit-learn: machine learning in python. Journal of Machine Learning Research, 12, 2825–2830.

Web of Science ®Google Scholar

Additional information

Funding

This material is based upon work supported by the National Science Foundation under [Grant No. 1733705]; as well as the Alan Turing Institute. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author and do not necessarily reflect the views of the National Science Foundation and/or the Alan Turing Institute.

Notes on contributors

Levi John Wolf

Levi John Wolf is a Senior Lecturer at the University of Bristol and a Fellow with the Alan Turing Institute. He develops new concepts, methods, and measures to analyse and understand inequality and segregation in cities. He is also a maintainer of many open source spatial analysis software projects.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Spatially–encouraged spectral clustering: a technique for blending map typologies and regionalization

Notes on contributors

Levi John Wolf

Information for

Open access

Opportunities

Help and information

Spatially–encouraged spectral clustering: a technique for blending map typologies and regionalization

ABSTRACT

Data and codes availability statement

Disclosure Statement

Notes

Additional information

Funding

Notes on contributors

Levi John Wolf

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature