1,489
Views
0
CrossRef citations to date
0
Altmetric
Research Articles

Revealing intra-urban hierarchical spatial structure through representation learning by combining road network abstraction model and taxi trajectory data

, , , , , & show all
Pages 499-516 | Received 24 Aug 2022, Accepted 23 Jul 2023, Published online: 30 Jul 2023

ABSTRACT

The unprecedented urbanization in China has dramatically changed the urban spatial structure of cities. With the proliferation of individual-level geospatial big data, previous studies have widely used the network abstraction model to reveal the underlying urban spatial structure. However, the construction of network abstraction models primarily focuses on the topology of the road network without considering individual travel flows along with the road networks. Individual travel flows reflect the urban dynamics, which can further help understand the underlying spatial structure. This study therefore aims to reveal the intra-urban hierarchical spatial structure by integrating the road network abstraction model and individual travel flows. To achieve this goal, we 1) quantify the spatial interaction relatedness of road segments based on the Word2Vec model using large volumes of taxi trip data, then 2) characterize the road abstraction network model according to the identified spatial interaction relatedness, and 3) implement a community detection algorithm to reveal sub-regions of a city. Our results reveal three levels of hierarchical spatial structures in the Wuhan metropolitan area. This study provides a data-driven approach to the investigation of urban spatial structure via identifying traffic interaction patterns on the road network, offering insights to urban planning practice and transportation management.

1. Introduction

The unprecedented global urbanization in China has increasingly attracted the attention of urban planners and geographers who seek to explore the intra-city structure through spatial interactions (e.g. daily commuting) (Barbosa et al. Citation2018; He et al. Citation2020; Liu et al. Citation2015; Martin and Schuurman Citation2020). Urban spatial structure is defined as the abstract of geographical space distribution in cities (Chen et al. Citation2019; Foley Citation1964). A city acts as a dynamic and connected system with the spatial interaction flows of people between sub-regions (e.g. places and parcels) (Batty Citation2009). With different travel demands and people’s behaviours, the resources and infrastructures of a city are allocated, resulting in the formation of complex and multi-dimensional city structures (Liu et al. Citation2015; Jusup et al. Citation2022). Meanwhile, the rapid development of urban spatial structure has significantly impacted on urban dynamics and intra-city travel patterns of a city’s residents (Burger, van der Knaap, and Wall Citation2014; van Meeteren et al. Citation2016; Wu, Smith, and Wang Citation2021). Over the years, research efforts have been conducted to explore the impacts of city structure on the dynamic urban system (Yue et al. Citation2014). Revealing the underlying spatial structure can help derive interesting insights on the organization and distribution of the urban system, which is important in terms of urban morphology, transportation, and economy (Dokuz Citation2022; He et al. Citation2020; Liang and Kang Citation2021; Luo and MacEachren Citation2014; Schläpfer et al. Citation2021; Zhang et al. Citation2017).

With the proliferation of individual-level mobility data within cities, numerous complex network approaches have been implemented to reveal the underlying spatial structures within a city (Batty Citation2013; Estrada Citation2012; Lee and Kang Citation2015; Zhang et al. Citation2014; Zhong et al. Citation2014). For example, the aggregated analysis models an entire city in a spatially embedded graph based on individual travel behaviours (De Montis, Caschili, and Chessa Citation2013; Liu et al. Citation2015; Yu Citation2018). However, the regional units (e.g. administrative districts and grid cells) in the aggregated analysis have been proven to be large and abstract, and detailed human movement information may be ignored without considering interaction relations between fine-grained spatial units, such as urban roads (Liu et al. Citation2017; Zhu et al. Citation2017). Recently, network abstraction models from the urban road network perspective are frequently applied for urban functional zone identification and spatial structure analysis (Hong and Yao Citation2019; Rodrigue, Comtois, and Slack Citation2016). In the network abstraction model, the road intersections are usually represented as nodes and the road segments are represented as edges. The construction of a network abstraction model, however, primarily depends on the topology of the road network, which does not consider the spatial interaction derived from individual travel flows. The integration of individual travel flows into the road network abstraction model can take advantage of the fine-grained spatial flow information among urban roads and provides new insights for exploring the underlying urban spatial structure.

Therefore, we proposed a framework to reveal the underlying spatial structure within a city by integrating the topology-based network abstraction model and individual travel flows derived from the taxi trajectory data. The remainder of this paper is organized as follows. Section 2 reviews the related literature on investigation of both urban spatial structure and road network abstraction model. Section 3 introduces the proposed framework and methods. In Section 4 and 5, we conducted a case study in the city of Wuhan and presented the results with discussions. Section 6 concludes this study and presents our vision on the future work.

2. Literature review

2.1. Revealing city structure based on the spatially embedded graph

The rapid growth of map services and mobile positioning technology has provided a massive amount of emerging geo-tagged data (e.g. taxi trajectory data and mobile phone record data) in the past decade. Those data sources lay the foundation to analyse the intra-urban flows to address many research issues, such as detecting human mobility patterns (Liu et al. Citation2012; Luo, Gao, and Cassels Citation2018; Siła-Nowicka et al. Citation2016; Zhang et al. Citation2018; Zhu et al. Citation2018), assessing traffic indicators of the urban road network (Brauer, Mäkinen, and Oksanen Citation2021; Cui et al. Citation2016; Hu et al. Citation2021), revealing multi-level city structure (Gao et al. Citation2013; Liu et al. Citation2015; Liu, Gao, and Lu Citation2019; Zhong et al. Citation2014), and understanding urban land use (Liu et al. Citation2012, Citation2016; Hu et al. Citation2021; Zhang et al. Citation2020)

Meanwhile, a spatially embedded graph (also called spatially embedded network) has been widely used to detect city structure with those human mobility data. Each geographical region stands for a node and the travel flow derived from the human mobility data stands for the edge between regions in a spatially embedded graph. Then, the community detection method is used to divide the entire graph into sub-graphs, namely communities or modules (Chen, Xu, and Xu Citation2015; Fortunato and Hric Citation2016; Hric, Darst, and Fortunato Citation2014). Communities are sets of nodes that have strong inner connectivity and are sparsely connected to nodes of other communities. For example, Gao et al. (Citation2013) empirically discovered the clustering structures of spatial-interaction communities according to the construction of two intertwined embedded networks, e.g. the network of phone-call interaction and the network of phone-users’ movements. Liu et al. (Citation2015) found a two-level polycentric community structure in Shanghai with grid cell-based embedded graph using a taxi trip data. Therefore, significant community structures that represent the underlying sub-regions with strong internal travel flows can be revealed via spatially embedded graph (Hong and Yao Citation2019; Liu et al. Citation2015).

2.2. Network abstraction model based on urban road network

Recently, numerous in-depth researches have been conducted to reveal city structures via network abstraction model based on the urban road network. A road network, an artificial corridor in the urban areas, plays an important role in shaping the city’s traffic and functional structure (Burghardt et al. Citation2021; Hong and Yao Citation2019; Zhu et al. Citation2017). Inspired by the spatially embedded graph, the network abstraction model regards the urban road network as an embedded graph, in which the significant road nodes (e.g. intersections) are represented as nodes and the road segments are represented as edges. Hong and Yao (Citation2019) empirically found a hierarchical structure of communities according to the network abstraction model based on OpenStreetMap (OSM) road network.

Spatially embedded graph models an entire city to explore the underlying city structure using individual travel flows, but the regional units are large and abstract. Network abstraction model primarily focus on the inner topology of the road network, but do not consider the role of human movements along with the road network. Previous research argued that an appropriate integration between urban spatial structure and human mobility patterns can provide great potential for investigating urban morphology and transport geography (de Andrade et al. Citation2021; Guo et al. Citation2021; Liu et al. Citation2016; Xu et al. Citation2016). However, previous studies tend to employ individual travel behaviours and road network abstraction model, respectively, the applications of the integration between individual travel behaviours and road network abstraction model needs further studies. This work, therefore, aims to reveal the intra-urban spatial structure by combining road network abstraction model and individual travel flows.

2.3. A critical analysis

In sum, many efforts have been made for revealing the urban spatial structure, but they have the following shortcomings:

  • The units to explore the underlying urban spatial structure using individual travel flows on spatially embedded graph models are aggregated to areal units that may suffer from the modified areal unit problem (MAUP).

  • Network abstraction model primarily focuses on the inner topology of the road network, but does not consider the role of human movements along with the road network. Previous studies tend to employ individual travel behaviours and road network abstraction model separately.

This research, therefore, aims to reveal the intra-urban spatial structure by combining road network abstraction model and individual travel flows. We proposed an integrated framework to reveal the underlying spatial structure within a city based on individual travel flows derived from the taxi trajectory data. The contributions of this work are as follows:

  • We proposed an integrated framework for sensing the underlying hierarchical urban spatial structure, which can benefit from both network abstraction model and detailed human movement information at a fine-grained scale.

  • We investigated the integration of the quantified spatial interaction relatedness of road segments into the urban road network to enrich urban road network-based GIS research.

3. Methodology

In this study, we proposed a framework to explore hierarchical urban spatial structure using urban road network and taxi trip data (). First of all, we used a representation embedding method-Word2Vec to quantitively measure the spatial interaction relatedness of road segments using taxi trajectory data. Then, we constructed a road network abstraction model based on the OSM road network, and further weighted the network using the quantified interaction relatedness. Finally, we employed the Infomap community detection method to reveal significant community patterns that represent sub-regional patterns of a city. In addition, we used point-of-interest (POI) data to evaluate the spatial distribution of the detected regional structure of this city.

Figure 1. The flowchart of the proposed representation learning framework for extracting hierarchical intra-urban spatial structure.

Figure 1. The flowchart of the proposed representation learning framework for extracting hierarchical intra-urban spatial structure.

3.1. Measurement of spatial interaction relatedness of road segments based on the word2Vec model

In this section, we introduced the Word2Vec representation learning model to quantify the spatial interaction relatedness of road segments on the urban road network using GPS-enabled taxi trajectory data.

3.1.1. Representing taxi trajectories by road nodes on road network

Taxis operate along the urban road network and taxi trip routes contain valuable human activity information about people’s movement and traffic flow. Here, we presented a map-matching-based approach that represents taxi trajectories along a road network using the interaction node as a spatial assembly and analytic unit. In this study, a road node is defined as a point that represents connectivity between two road segments, including the starting point, the ending point, and the intersection of the road segments. Specifically, we started by mapping taxi GPS records to urban roads via a fast map matching algorithm (Yang and Gidófalvi Citation2018), and we represented each route as the sequence of consecutive matched road nodes ().

Figure 2. An example of mapping a raw taxi route to intersection nodes sequence along the road network.

Figure 2. An example of mapping a raw taxi route to intersection nodes sequence along the road network.

3.1.2. Building traffic analytic semantic corpus

In recent years, geo-semantic analysis framework derived from the natural language processing (NLP) field has shown potential as a promising tool to exploit spatial relationships in urban areas based on big data (Cai Citation2021; Crivellari and Ristea Citation2021; Liu, Gao, and Lu Citation2019; Luo et al. Citation2019; Yao et al. Citation2017). By analogizing traffic elements to NLP terms, geo-semantic analysis builds high-dimensional embedding vectors to quantitatively describe the traffic components, exploring potential information in geographical data (Bengio, Courville, and Vincent Citation2013; Hu et al. Citation2021). Researchers can effectively train a language-based representation learning model and evaluate semantic representations or relationships based on a large semantic corpus in geo-semantic analysis.

In this study, based on the assumption that the traffic interaction indicates the travel activities of urban people and to be intimately connected with the urban spatial structure, the entire study area was represented as a corpus of traffic analysis. In this corpus, the taxi movement trajectories were analogized to NLP documents, while road nodes on the road network were words in documents. The objective of building the corpus is to capture fine-scale traffic interaction patterns along with the urban road network and measure the spatial co-occurrence relationships between traffic nodes. A higher degree of traffic relatedness between two intersection nodes indicates that they are likely to co-occur in the taxi movement trajectories on the road network.

3.1.3. Training word embedding representation model

Word embedding is one of the most popular representation learning techniques for representing the vocabulary of a document, originating from the NLP domain. For machine learning tasks, words must be represented meaningfully, which requires numerical representation. Word embedding techniques are used to solve this problem, with algorithms, such as Word2Vec enabling words to be expressed mathematically (Cai Citation2021; Liang and Kang Citation2021). The core principle behind word embedding techniques is that each word is represented by a D-dimensional real-valued vector, which reflects the multidimensional semantics of the word. The distance between two vectors quantifies their semantic similarity and relatedness (Mai et al. Citation2022). The word embedding technique represents each distinct word with a numerical vector via a self-supervised neural network. It captures the context of vocabulary words in a document and measures their semantic similarity or relatedness with other words.

Word2Vec is an open-source, state-of-the-art tool proposed by Mikolov et al (Mikolov et al. Citation2013). for word embedding with ease of operation and high scalability. Word2Vec trains a two-layer neural network and reconstructs linguistic contexts of words to convert each word to a unique vector. In this study, the skip-gram-based Wrod2vec model (Levy and Goldberg Citation2014) was introduced to obtain the numerical representation of intersections (as nodes) along the road network. Given a sequence of road nodes (a trajectory), the skip-gram-based Word2Vec model’s goal is to predict the context of road nodes for a given target node in a sliding window. The maximum likelihood function can be estimated as:

(1) Jθ=1nj=1nwjwlogp(vj+w|vj)(1)

where w is the training context window size, n is the number of road nodes, and vj+w denotes the context road nodes of the target one vj. The conditional probability can be estimated as:

(2) pvo|vt=expxvoTyvtc=1CexpxcTyvt(2)

where xvo and yvt are context and target vector representations of road nodes and C is size of the semantic corpus.

We trained the Word2Vec model using the traffic analytic semantic corpus built from taxi trajectory data, reconstructed spatial contexts of traffic elements, and finally obtained the symbolic vector representation with real values for each road node.

3.1.4. Measuring spatial interaction relatedness

The urban road network facilitates the majority of human movement, and vehicle routes provide valuable information about human activity and traffic flow in the urban road network. The measurement of relatedness (Liu et al. Citation2017), cosine similarity, between two road nodes aims to quantitatively investigate the inherent spatial interactions on a road network and reveal intra-urban spatial structure. If two nodes in the road network have a strong traffic relatedness, they are more likely to occur in close proximity along taxi trajectories. A cosine similarity measurement is employed to quantify the traffic relatedness of the road nodes based on their embedding vectors, denoted as (Xu et al. Citation2021):

(3) simwi,wj=wi  wjwi∥∥wj(3)

where simwi,wj means cosine similarity between embedding vectors wi and wj and changes between 0 and 1. A value of simwi,wj approaching 1 indicates that road node i has strong correlations with road node j. It is worth noting that the obtained similarity denotes the topological measurement of urban roads and quantitates the functional relations of road interactions (Hong and Yao Citation2019).

3.2. Identification of hierarchical spatial structure using the infomap algorithm

3.2.1. Road network abstraction model

Road network architectures in urban settings have been studied with the analogy of urban road networks to graphs in complex network disciplines (Gao et al. Citation2013; Liu et al. Citation2015; Yao et al. Citation2021). With the emergence of OSM, researchers are able to better explore urban road networks and access urban spatial interaction patterns. In this study, the road network abstraction model was constructed using OSM road data. Specifically, the road network was symbolized to a weighted directed graph GV,E,W, where the road node is referred to as graph vertex, V, the road linking two adjacent nodes is referred to as an edge, E, the road driving direction is used to describe the edge direction, and the spatial interaction relatedness of the road segment is used to describe the weight of edge, W. The construction of the road network abstraction model was implemented with an open-sourced tool-OSMnx (Boeing Citation2017).

3.2.2. Community detection using the infomap algorithm

The community detection method attempts to group or divide graph vertices into a few subsets based on their interaction patterns. The divided sets of vertices, commonly called communities or modules, are strongly linked together and sparsely connected to the rest of the graph (Xie, Kelley, and Szymanski Citation2013). Based on the spatial interaction relatedness and topological connectivity of road nodes, the Infomap community detection algorithm is employed to reveal significant community patterns that represent sub-regional patterns of a city. Among community detection algorithms, the Infomap method with its superior performance and high availability has emerged as a popular method in a wide range of applications (Liu et al. Citation2015; Yao et al. Citation2021).

The InfoMap algorithm is an approach for network partitioning built on the minimum entropy principle and random walks strategy. Network partition, given a graph, refers to a specific division of the nodes into modules with an objective function. Infomap optimizes the objective function known as the map equation (Edler, Bohlin, and Rosvall Citation2017). Infomap uses Huffman coding (Huffman Citation1952) to describe each node in the network with a two-level description and minimizes the description length of a random walker’s movements over all possible network partitions on a network. Thus, the partition with the shortest description length best describes the community structure of the network in terms of network dynamics. More details about the Infomap algorithm can be found in Rosvall and Bergstrom (Citation2008).

In this study, the Infomap algorithm was used to partition the urban road network into multiple layers. To detect hierarchical communities, the Infomap algorithm starts by partitioning the urban road network into modules (or communities) based on the flow of random walks. Each module is treated as a new network, and the process is repeated recursively until no further meaningful partitions can be made. The resulting hierarchy of modules represents a nested set of communities at different scales or resolutions. The input of the Infomap algorithm is the weighted directed graph Accessisdenied, and the output is a graph G divided into hierarchical modules/subgroups.

3.2.3. Mixed land used indices

Inspired by Yue et al. (Citation2017), three mixed-use indices, including the Margalef Species Richness index, Shannon’s entropy index, and the Simpson’s index of diversity, were introduced to measure the extent of mix/evenness in the distributions of land use types. The Margalef Species Richness index (Gotelli and Colwell Citation2011), namely Richness index, is a species diversity index in the ecology domain and is used to estimate the degree of diversity in the land use and POI context. The Richness index for the specific region i is defined as Ri:

(4) Ri=S1ln ni(4)

where S is the number of POI types and ni is the number of POI within the region i. Note that a lower value of Ri indicates the less diversity of POI types, suggesting a higher purity urban function of the division.

The Shannon’s entropy index (Brown et al. Citation2009), namely Shannon index, reflects the degree of orderliness of the distribution of POI types, which is defined as Hi:

(5) Hi=j=1Spjlnpj(5)

where pj is the percentage of the number of POI type j in total number of POIs of region i. Note that A lower value of Hi indicates the greater orderliness and less random of the distribution of POIs.

The Simpson’s index of diversity (Hunter and Gaston Citation1988; Simpson Citation1949) measures the concentration of the distribution of POIs, which is defined as Di:

(6) Di=1j=1Smjmj1nini1(6)

where mj is the total number of POIs of a particular type j within the region i. Note that a higher value of Diindicates a greater degree of concentration of region i. The above three indices evaluate mixed land use in the POIs context from different aspects, and therefore can effectively verify the advantage of our proposed method in delineating urban functional areas.

4. Case study

4.1. Study area and data collection

A case study was carried out in the city of Wuhan, China which is known for its complex urban morphology and high rates of mixed land use. Because of Wuhan’s rapid urbanization, new requirements for exploring the urban spatial structure and planning have been emerging in recent years. The study area was selected from a downtown area of Wuhan city, covering an area of 748.39 km2 (). The study area is characterized by a wide variety of growth and population density levels, wherein the landscapes are exceedingly diverse.

Figure 3. Our study area in Wuhan, which covers the downtown area of this city (within the third ring road). (a) Administrative districts of Wuhan city; (b) Satellite remote sensing image of the study area. The bottom images show examples of diverse landscapes: (1) industrial area, (2) residential area, (3) commercial area and (4) educational area.

Figure 3. Our study area in Wuhan, which covers the downtown area of this city (within the third ring road). (a) Administrative districts of Wuhan city; (b) Satellite remote sensing image of the study area. The bottom images show examples of diverse landscapes: (1) industrial area, (2) residential area, (3) commercial area and (4) educational area.

In this work, we used three kinds of geographic data including urban road network, taxi GPS trajectories, and POI data to conduct the experiments. Note that the coordinate system of all geospatial data was unified to the WGS geographic coordinate system.

  • Urban road network. The primary road network data was obtained from OSM, which was acquired in January 2018. OSM is an open-source map service that gives users free and easy-to-access digital map data, and it is now the most popular and successful volunteered geographic information provider (Xu et al. Citation2019). The road data contains essential attribute information, including road name, road type, coordinate location (longitude and latitude) of road nodes, and topological connection information. Extraordinarily high precision in the position and topological relationship is found in the study area. 22115 road segments and 14,715 nodes were extracted after pre-processing procedures such as simplification and topological checking ().

    Figure 4. Data schema of the road network in the study area. (a) road segments and (b) road nodes.

    Figure 4. Data schema of the road network in the study area. (a) road segments and (b) road nodes.

  • GPS-enabled taxi trajectories. As an important form of transportation, GPS-enabled taxis are not constrained by routes or time and provide a kind of flexible and wide-ranging trajectory data in urban regions with high accuracy and fewer privacy concerns (Cui, Wu, and He Citation2022; Zheng et al. Citation2011). Taxi GPS record data used in this study was collected from May 9, 2015-May 15, 2015 in Wuhan, which contains movement tracks of more than 10,000 taxis. The original taxi trajectory dataset is essentially a collection of GPS track points. Each point includes basic driving data such as taxi number, timestamp, coordinate location, speed, direction, and status (vacant or occupied). The sampling frequency of the GPS track is about 50 s. With the attribute of occupied status, passenger-carrying trajectories are retrieved from the original GPS track points. A consecutive taxi movement route is composed of the pick-up location, the drop-off location, and several intermediate GPS records.

  • POIs: The POI data was collected from an open-source data platform, Peking University Research Data (State Information Center Citation2017). This data covers the period from July 1, 2017 to September 30, 2018. A total of 404,721 POIs is extracted in the study area (). A POI consists of essential attributes: POI identity number, name, hierarchical types, address, and coordinate location. Hierarchical types are grouped into three categories, i.e. top-level, second-level, and third-level. From the top level to third level, the POIs descriptions are provided in greater detail. In this study, the second-level category of POIs types is utilized to identify urban functions and evaluate the effectivity in order to balance the trade-off between the mixture of urban land use and the complexity of semantic computation. POI data is mainly used to calculate the land use mixed indices and identify the urban functions in hierarchical city structure analysis.

    Figure 5. (a) kernel density estimation of pois (unit: count per km2); (b) spatial distribution of POIs near Jianghan road, a famous commercial area within the downtown of Wuhan. Note that different colours indicate types of POIs.

    Figure 5. (a) kernel density estimation of pois (unit: count per km2); (b) spatial distribution of POIs near Jianghan road, a famous commercial area within the downtown of Wuhan. Note that different colours indicate types of POIs.

4.2. Results

4.2.1. Embedding representation and spatial interaction relatedness of road nodes

The road network in the study area was abstracted onto a weighted directed graph. Taking the road network graph as input, a skip-gram-based Word2Vec model was trained to obtain embedding representations. In view of the volume of graph structure and the computational costs, most parameters were set to recommended or default values. Specifically, for model training, the dimension of the road node representation was set to 128, the number of training epochs was set to 10, and the window size was set to 10. The Word2Vec model was trained using the representation learning tool gensim in Python (Rehurek and Sojka Citation2010).

Cosine similarity metrics were then utilized to obtain the relatedness between any two road nodes of a road segment using the embedding vectors. demonstrates the relationship between the average similarity and the distance between pairwise two node embeddings. For each road node, we randomly selected 10% of all road nodes to calculate the average similarity and road distance between two nodes. We found that the topological correlation generally follows Tobler’s first law of geography (Tobler Citation1970): near nodes are more related than distant nodes. Meanwhile, shows an example of the spatial distribution of the similarity of road nodes in a local area. The red node denotes selected centre node. As the blue colour deepens, so does the similarity between the centre node and its neighbours, implying a larger topological correlation. The nodes on roads with the same direction are more related than roads in the opposite direction, which indicates that the similarity of road nodes can be influenced by traffic interactions.

Figure 6. Relationship between the average similarity of node embeddings and the distance between two nodes.

Figure 6. Relationship between the average similarity of node embeddings and the distance between two nodes.

Figure 7. An example of embedding vectors in a road network graph: (a) road networks; (b) embedding vectors of road nodes; (c) spatial distribution of cosine similarities.

Figure 7. An example of embedding vectors in a road network graph: (a) road networks; (b) embedding vectors of road nodes; (c) spatial distribution of cosine similarities.

4.2.2. Hierarchical urban spatial structure

Based on spatial interaction relatedness and topological connectivity, the Infomap algorithm was further employed to implement multilayered detection of communities, and hierarchical urban spatial structure was further explored. shows the statistical results of three-level communities. At the top level, the entire urban road network in the study area is divided into three aggregated sub-regions. As depicted in , the division result is very consistent with the previous urban spatial pattern, namely Three Towns of Wuhan, which formed in the last century before the administrative regionalization reform in China (Wuhan Citation2021). Three towns of Wuhan consist of Wuchang, Hankou, and Hanyang, which are three independent towns and located in three disjoint parts in the downtown area of Wuhan. Each town has its own development characteristics and patterns with respective specialities in economy, culture and industry. This result indicates that the spatial structure of the Three Towns of Wuhan still has far-reaching effects on the contemporary urban spatial structure.

Figure 8. The spatial distribution of hierarchical communities. Note that the solid black lines indicate the administrative boundaries. (a) top-level communities; (b) second-level communities; (c) third-level communities; (d) and (e) indicate the second-level divisions and third-level divisions of Jianghan district.

Figure 8. The spatial distribution of hierarchical communities. Note that the solid black lines indicate the administrative boundaries. (a) top-level communities; (b) second-level communities; (c) third-level communities; (d) and (e) indicate the second-level divisions and third-level divisions of Jianghan district.

Table 1. Comparison of different communities.

As shown in , more detailed structural information is detected at the second-level communities. The study area is divided into 22 communities or sub-regions with an average edge weight of 0.859. Most studies usually demonstrate the sub-regional structure of a city by referring to its administrative divisions (De Montis, Caschili, and Chessa Citation2013; Liu et al. Citation2015; Ratti et al. Citation2010). In the study area, most sub-regions are inconsistent with the district-level administrative division, indicating the different connectivity between the districts and the underlying urban spatial structure via intra-city interactions in the study area. For example, Jianghan district, located in the downtown area of Wuhan, is mainly divided into three different regional areas, corresponding to three different functional areas (Hankou railway station transportation area, Wanda plaza commercial area, and Jianghan Road shopping area) (as shown in ). These three areas with different functions have distinct spatial interactions among each other. Administrative division structure is a static artefact which is partially arbitrary. Cities function as dynamic systems where traffic flows play a significant role in connecting the discrete resources of a city into an integrated system. Our results embrace the same idea that spatial interaction ties sub-regions within a city.

To further validate our results and understand the spatial interaction patterns between the second-level communities, the chord diagram was used to visualize the actual traffic flows using the origins and destinations of taxi trips. As depicted in , we found that the frequency of intra-flows is greater than inter-flows between sub-regions, indicating that most taxi movements happen within the same sub-regions. This result shows that the proposed methods can effectively reveal regional urban spatial structure and identify the intra-city spatial interaction patterns.

Figure 9. The chord diagram of traffic flows using origins and destinations of taxi movements. noting that the numbers in the left diagram and right map is consistent and indicate the identifier of the sub-regions. (a) the chord diagram; (b) the spatial distribution of sub-regions. the blue lines indicate the sub-regions boundary generated by Thiessen polygon algorithm.

Figure 9. The chord diagram of traffic flows using origins and destinations of taxi movements. noting that the numbers in the left diagram and right map is consistent and indicate the identifier of the sub-regions. (a) the chord diagram; (b) the spatial distribution of sub-regions. the blue lines indicate the sub-regions boundary generated by Thiessen polygon algorithm.

maps a more elaborate regional division pattern with 127 communities and an average edge weight of 0.864. These results reveal a fine-scale urban spatial structure, and each sub-region can reflect specific urban functional areas. We employed the Thiessen polygon to estimate the influence area of each road node and determine the coverage area of each division. To further explore the functional area characteristics, the term frequency-inverse document frequency (TF-IDF) method was used to identify the urban functions using POI types. By weighting different POI types within each sub-region, the TF-IDF method stresses the uniqueness of POI types in each sub-region and decreases the weight of common POI types, therefore effectively identifying functional characteristics of each sub-region. A similar approach has been used in previous works (Gao, Janowicz, and Couclelis Citation2017; Hu et al. Citation2021; Liu et al. Citation2020); detailed descriptions about TF-IDF algorithm can be found in Beel et al. (Citation2016).

shows the spatial distribution of urban functional areas, and shows the function descriptions. Cluster 1 mainly includes educational areas such as many universities. Cluster 2 consists of many kinds of comprehensive marketplaces. Cluster 3 includes most industrial areas located alone with 3rd Ring Road. Cluster 4 includes business and commercial areas located near Wuhan Central Business Unit (CBD). Cluster 5 is heavily mixed areas of residential blocks and commercial areas. Transportation, few business and industrial areas dominate Cluster 6.

Figure 10. The spatial distribution of urban functional areas based on third-level division.

Figure 10. The spatial distribution of urban functional areas based on third-level division.

Table 2. Urban functions areas and example locations.

4.2.3. Result verification

To evaluate the effectiveness of our proposed method, we implement two comparison experiments with different weights of each road segment within the urban road graph:

  • O-Infomap: Original Infomap method with all weight of edges set to 1;

  • D-Infomap: Distance-weighted Infomap method with weight set as Euclidean length of each road segment;

  • Our proposed method: Relatedness-weighted Infomap method with weight set as spatial interaction relatedness between traffic nodes.

Three hierarchical spatial division patterns were detected using the comparison experiments in a quantitative way. As suggested by Brown and Holmes (Citation1971) and Noronha and Goodchild (Citation1992), one classic approach to measure the performance of delineating urban functional regions is maximizing the interactions within the same region while minimizing the interactions between different regions. Inspired by Liu et al. (Citation2019), we calculated the percentage of average frequency (PAF) of taxi flows between the second-level divisions within the same division over 7 days. A greater value of PAF suggests that people are more inclined to travel within the same division or sub-region, indicating a more significant spatial division and better capability of identifying spatial interaction patterns. As shown in the values of PAF are higher than that of the other methods on most days, proving that our proposed method performs significantly better than other methods in identifying spatial interaction patterns.

Table 3. The percentage of average frequency of taxi flows between the second-level divisions within the same division of seven days.

Moreover, we calculated multiply indices for mixed land use of the third-level divisions using POI types. We hypothesize that the mixing degree of urban functions in a specific sub-region reflects regional urban functional structure, thus verifying the validity of the division results. A lower mixing degree of urban functions suggests a more pronounced spatial division and is better capable of delineating urban functional areas.

shows statistical information of three mixed-use indices using different methods. Our proposed method achieves good fitting results with a lower average Richness index and average Shannon index, indicating the advantage of our proposed method in keeping the purity of urban functions and orderliness of the distribution of land-use types. However, compared to other methods, Simpson’s index of diversity has no significant difference. In summary, the quantitative comparison results from two different perspectives indicate the effectiveness of our proposed approach in revealing hierarchical spatial structure in a city.

Table 4. Indices for mixed land use of the third-level divisions.

5. Discussion

5.1. Hierarchical urban spatial structure

Urban spatial structure is closely related to the urban travel patterns generated from their daily lives of urban dwellers. The distribution of various urban elements (or infrastructure) significantly impacts people’s travel, and their movement flows connect discrete parcels or areas into an integrated system. Benefited from the spatial interaction analysis, urban parcels or areas, such as grids, communities, or traffic analysis zones could be joint via the similarity of functionality to obtain spatial interactions among aggregated regions (i.e. such urban functional region). With multilayer aggregating at different scales, hierarchical urban spatial structure could be revealed (Hong and Yao Citation2019; Wu, Smith, and Wang Citation2021). For example, Liu et al. (Citation2015) revealed a two-level hierarchical urban spatial structure by dividing taxi trips into short- and long-distance trips based on the different travel demand of local dwellers and their mobility patterns. In this hierarchical urban spatial structure, local clusters are generated with short-distance trips (dominate local spatial interactions), whereas long-distance trips play the role of connecting these local clusters. Our results resonate with Liu et al. (Citation2015) ‘s previous work, and particularly address the spatial hierarchy at a fine-grained scale.

Exploring the hierarchical urban spatial structure have great meanings in multiple application domains. Hierarchical urban spatial structure highlights the multilayer spatial interactions, therefore could bring new insights for extracting the co-location patterns and underlying spatial semantics of urban functional regions. What’s more, different hierarchical levels of urban regions associated with their functions could be dynamically drawn on a digital map (i.e. tourist map or urban planning map). Such hierarchical semantical map could be beneficial for various map users (Gao Citation2017). For city planners, it is critical to answer questions such as ‘where are the most dynamic regions in the city’ and ‘where are the neighbourhoods that are changing the most’? Moreover, hierarchical urban spatial structure could reveal the regions that attract a lot of attention from the general public in an urban area. As a result, when limited resources are available for urban planning initiatives, these regions may be given a higher priority.

5.2. The spatial interaction relatedness and human mobility patterns within a city

There has been continued and sustained interest in extracting hidden information from human mobility patterns in urban studies. Previous studies generally extract temporal or spatial statistical characteristics (i.e. frequencies statistics of origins and destinations of the individual-level trips) to determine an urban region within a city. Origins and destinations represent the travel purposes of people, but the intermediate trajectory information between origins and destinations is rarely used (Hu et al. Citation2021; Zheng et al. Citation2014; Zhu et al. Citation2017). Inspired by Hu et al. (Citation2021), an analogy strategy from urban elements to NLP field was introduced, where the urban area was regarded as a corpus, a travel trip was deemed as a sentence, road node was used as word. Further a Word2Vec model was employed to measure the spatial interaction relatedness between road nodes.

Embracing the analogy strategy from GIS elements to the NLP field, considerable research in recent years has demonstrated the advantages of word embedding technologies in GIScience from different perspectives. For example, Crivellari and Ristea (Citation2021) firstly introduced the CrimeVec approach based on word embedding technology to understand the criminology of urban place. Word2Vec model was used to creating dense vectors of crime types based on spatial-temporal distribution. Li et al. (Citation2019) proposed a regionalization method for clustering and partitioning based on semantic trajectories extracted from call detail record (CDR) of mobile phone signalling data. Zhang et al. (Citation2020) proposed a Traj2Vec model to quantify trip trajectories as high-dimensional semantic vectors using mobile phone positioning data. They proved that cell towers with similar vectors are spatially closer to one another. Attempts have also been made to measure traffic interactions (Liu et al. Citation2017; Wu et al. Citation2020; Xu et al. Citation2020), delineate urban functional use (Huang et al. Citation2022; Niu and Silva Citation2021; Sun et al. Citation2021) and so on using word embedding models (Mai et al. Citation2022).

The spatial interaction relatedness has the unique advantage in extracting the underlying human mobility patterns. Compared with traditional origins and destinations information, the spatial interaction relatedness thoroughly extracts refined characteristics by integrating fixed urban road network structure and dynamic human movement trips. In this regard, the spatial interaction relatedness could be used to reveal the movement flows and traffic states. For example, how upstream or downstream road traffic flows impact the neighbourhood roads? We could employ the embedding representation and cosine-based similarity measurement to quantify the impact.

6. Conclusions

It is one of the central themes for transport geographers to reveal the underlying urban spatial structure from the intra-urban spatial interaction patterns. In this study, we employed the Word2Vec representation learning model to quantify fine-scale interaction relations of roads segments in a road network using taxi trajectory data. We then utilized the Infomap community detection method to reveal significant sub-regional patterns in the Wuhan metropolitan area. We found the three-level hierarchical structures that indicate the division of urban space with the integration of a large volume of real taxi trajectories into the urban road network. Urban space is divided into three aggregated sub-regions at the top level, which shows great consistency with a previous urban spatial pattern known as the Three Towns of Wuhan. The second level reflects sub-regions with more intra-flow than inter-flow connections. The third level reveals a fine-scale spatial distribution of urban functions. We further assessed the advantage of our proposed method via two comparison experiments. Our work considers the entire city as a connected system using human travel flows on the urban road network. Our method and results can provide traffic managers and urban planners with a better understanding of traffic patterns on the road network and the regional structure of a city.

Furthermore, we would like to regard this research as a beginning to detect the spatial interaction communities based on trajectory data and road network abstraction model. Further research can be conducted to improve our proposed framework and compare with state-of-the-art methods if more detailed urban geographical data and open data sources are available. Future work is also anticipated to collect multimodal travel data (e.g. subway, biking, bus trips) to support a more comprehensive spatial interaction investigation in cities.

Acknowledgements

This work was supported by the National Natural Science Foundation of China [grant number 41871311]; Guangdong Science and Technology Strategic Innovation Fund (the Guangdong-Hong Kong-Macau Joint Laboratory Program, Project No. 2020B1212030009); and the Fundamental Research Funds for National Universities, China University of Geosciences (Wuhan); China Scholarship Council (CSC) during a visit of Sheng Hu to National University of Singapore.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

The data and codes that support the findings of this study are available from the corresponding author upon reasonable request.

Additional information

Funding

The work was supported by the National Natural Science Foundation of China [41871311]; Guangdong-Hong Kong-Macau Joint Laboratory Program [2020B1212030009]; China Scholarship Council (CSC) the Fundamental Research Funds for National Universities, China University of Geosciences (Wuhan).

References

  • Barbosa, H., M. Barthelemy, G. Ghoshal, C. R. James, M. Lenormand, T. Louail, R. Menezes, J. J. Ramasco, F. Simini, and M. Tomasini. 2018. “Human Mobility: Models and Applications.” Physics Reports 734:1–74. https://doi.org/10.1016/j.physrep.2018.01.001.
  • Batty, M. 2009. Cities as Complex Systems: Scaling, Interaction, Networks, Dynamics and Urban Morphologies.
  • Batty, M. 2013. The New Science of Cities. MIT press. https://doi.org/10.7551/mitpress/9399.001.0001.
  • Beel, J., B. Gipp, S. Langer, and C. Breitinger. 2016. “Research-Paper Recommender Systems: A Literature Survey.” International Journal on Digital Libraries 17 (4): 305–338. https://doi.org/10.1007/s00799-015-0156-0.
  • Bengio, Y., A. Courville, and P. Vincent. 2013. “Representation Learning: A Review and New Perspectives.” IEEE Transactions on Pattern Analysis and Machine Intelligence 35 (8): 1798–1828. https://doi.org/10.1109/TPAMI.2013.50.
  • Boeing, G. 2017. “OSMnx: New Methods for Acquiring, Constructing, Analyzing, and Visualizing Complex Street Networks.” Computers, Environment and Urban Systems 65:126–139. https://doi.org/10.1016/j.compenvurbsys.2017.05.004.
  • Brauer, A., V. Mäkinen, and J. Oksanen. 2021. “Characterizing Cycling Traffic Fluency Using Big Mobile Activity Tracking Data.” Computers, Environment and Urban Systems 85:101553. https://doi.org/10.1016/j.compenvurbsys.2020.101553.
  • Brown, L. A., and J. Holmes. 1971. “The Delimitation of Functional Regions, Nodal Regions, and Hierarchies by Functional Distance approaches.” Ekistics 32 (192): 387–391. JSTOR.
  • Brown, B. B., I. Yamada, K. R. Smith, C. D. Zick, L. Kowaleski-Jones, and J. X. Fan. 2009. “Mixed Land Use and Walkability: Variations in Land Use Measures and Relationships with BMI, Overweight, and Obesity.” Health & Place 15 (4): 1130–1141. https://doi.org/10.1016/j.healthplace.2009.06.008.
  • Burger, M. J., B. van der Knaap, and R. S. Wall. 2014. “Polycentricity and the Multiplexity of Urban Networks.” European Planning Studies 22 (4): 816–840. https://doi.org/10.1080/09654313.2013.771619.
  • Burghardt, K., J. Uhl, K. Lerman, and S. Leyk. 2021. Road Network Evolution in the Urban and Rural United States Since 1900. ArXiv:2108.13407 [Nlin, Physics:Physics]. http://arxiv.org/abs/2108.13407.
  • Cai, M. 2021. “Natural Language Processing for Urban Research: A Systematic Review.” Heliyon 7 (3): e06322. https://doi.org/10.1016/j.heliyon.2021.e06322.
  • Chen, T., E. C. M. Hui, J. Wu, W. Lang, and X. Li. 2019. “Identifying Urban Spatial Structure and Urban Vibrancy in Highly Dense Cities Using Georeferenced Social Media Data.” Habitat International 89:102005. https://doi.org/10.1016/j.habitatint.2019.102005.
  • Chen, Y., J. Xu, and M. Xu. 2015. “Finding Community Structure in Spatially Constrained Complex Networks.” International Journal of Geographical Information Science 29 (6): 889–911. https://doi.org/10.1080/13658816.2014.999244.
  • Crivellari, A., and A. Ristea. 2021. “CrimeVec—Exploring Spatial-Temporal Based Vector Representations of Urban Crime Types and Crime-Related Urban Regions.” ISPRS International Journal of Geo-Information 10 (4): 210. Article 4. https://doi.org/10.3390/ijgi10040210.
  • Cui, J., F. Liu, D. Janssens, S. An, G. Wets, and M. Cools. 2016. “Detecting Urban Road Network Accessibility Problems Using Taxi GPS Data.” Journal of Transport Geography 51:147–157. https://doi.org/10.1016/j.jtrangeo.2015.12.007.
  • Cui, H., L. Wu, and Z. He. 2022. “Optimization Framework for Spatiotemporal Analysis Units Based on Floating Car Data.” Remote Sensing 14 (10): 2376. Article 10. https://doi.org/10.3390/rs14102376.
  • de Andrade, S. C., J. Porto de Albuquerque, C. Restrepo-Estrada, R. Westerholt, C. A. M. Rodriguez, E. M. Mendiondo, and A. C. B. Delbem. 2021. “The Effect of Intra-Urban Mobility Flows on the Spatial Heterogeneity of Social Media Activity: Investigating the Response to Rainfall Events.” International Journal of Geographical Information Science 36 (6): 1140–1165. https://doi.org/10.1080/13658816.2021.1957898.
  • De Montis, A., S. Caschili, and A. Chessa. 2013. “Commuter Networks and Community Detection: A Method for Planning Sub Regional Areas.” The European Physical Journal Special Topics 215 (1): 75–91. https://doi.org/10.1140/epjst/e2013-01716-4.
  • Dokuz, A. S. 2022. “Weighted Spatio-Temporal Taxi Trajectory Big Data Mining for Regional Traffic Estimation.” Physica A: Statistical Mechanics and Its Applications 589:126645. https://doi.org/10.1016/j.physa.2021.126645.
  • Edler, D., L. Bohlin, and M. Rosvall. 2017. “Mapping Higher-Order Network Flows in Memory and Multilayer Networks with Infomap.” Algorithms 10 (4): 112. https://doi.org/10.3390/a10040112.
  • Estrada, E. 2012. The Structure of Complex Networks: Theory and Applications. Oxford University Press. https://doi.org/10.1093/acprof:oso/9780199591756.001.0001.
  • Foley, D. L. 1964. ”An Approach to Metropolitan Spatial Structure”. Explorations into Urban Structure. edited by. Melvin M. Webber 21–78 , Philadelphia: University of Pennsylvania Press. https://doi.org/10.9783/9781512808063-004.
  • Fortunato, S., and D. Hric. 2016. “Community Detection in Networks: A User Guide.” Physics Reports 659:1–44. https://doi.org/10.1016/j.physrep.2016.09.002.
  • Gao, S. 2017. Extracting Computational Representations of Place with Social Sensing [UC Santa Barbara Extracting Computational Representations of Place with Social Sensing]. https://escholarship.org/uc/item/9303m2hj.
  • Gao, S., K. Janowicz, and H. Couclelis. 2017. “Extracting Urban Functional Regions from Points of Interest and Human Activities on Location-Based Social Networks.” Transactions in GIS 21 (3): 446–467. https://doi.org/10.1111/tgis.12289.
  • Gao, S., Y. Liu, Y. Wang, and X. Ma. 2013. “Discovering Spatial Interaction Communities from Mobile Phone Data.” Transactions in GIS 17 (3): 463–481. https://doi.org/10.1111/tgis.12042.
  • Gotelli, N. J., and R. K. Colwell. 2011. “Estimating Species Richness.” Biological Diversity: Frontiers in Measurement and Assessment 12 (39–54): 35.
  • Guo, S., T. Pei, S. Xie, C. Song, J. Chen, Y. Liu, H. Shu, X. Wang, and L. Yin. 2021. “Fractal Dimension of Job-Housing Flows: A Comparison Between Beijing and Shenzhen.” Cities 112:103120. https://doi.org/10.1016/j.cities.2021.103120.
  • He, Z., M. Deng, J. Cai, Z. Xie, Q. Guan, and C. Yang. 2020. “Mining Spatiotemporal Association Patterns from Complex Geographic Phenomena.” International Journal of Geographical Information Science 34 (6): 1162–1187. https://doi.org/10.1080/13658816.2019.1566549.
  • Hong, Y., and Y. Yao. 2019. “Hierarchical Community Detection and Functional Area Identification with OSM Roads and Complex Graph Theory.” International Journal of Geographical Information Science 33 (8): 1569–1587. https://doi.org/10.1080/13658816.2019.1584806.
  • Hric, D., R. K. Darst, and S. Fortunato. 2014. “Community Detection in Networks: Structural Communities versus Ground Truth.” Physical Review E 90 (6): 062805. https://doi.org/10.1103/PhysRevE.90.062805.
  • Huang, W., L. Cui, M. Chen, D. Zhang, and Y. Yao. 2022. “Estimating Urban Functional Distributions with Semantics Preserved POI Embedding.” International Journal of Geographical Information Science: 36(10):1–26. https://doi.org/10.1080/13658816.2022.2040510.
  • Huffman, D. A. 1952. “A Method for the Construction of Minimum-Redundancy Codes.” Proceedings of the IRE 40 (9): 1098–1101. https://doi.org/10.1109/JRPROC.1952.273898.
  • Hu, S., S. Gao, L. Wu, Y. Xu, Z. Zhang, H. Cui, and X. Gong. 2021. “Urban Function Classification at Road Segment Level Using Taxi Trajectory Data: A Graph Convolutional Neural Network Approach.” Computers, Environment and Urban Systems 87:101619. https://doi.org/10.1016/j.compenvurbsys.2021.101619.
  • Hunter, P. R., and M. A. Gaston. 1988. “Numerical Index of the Discriminatory Ability of Typing Systems: An Application of Simpson’s Index of Diversity.” Journal of Clinical Microbiology 26 (11): 2465–2466. https://doi.org/10.1128/jcm.26.11.2465-2466.1988.
  • Hu, S., Y. Xu, L. Wu, X. Wu, R. Wang, Z. Zhang, R. Lu, and W. Mao. 2021. “A Framework to Detect and Understand Thematic Places of a City Using Geospatial Data.” Cities 109:103012. https://doi.org/10.1016/j.cities.2020.103012.
  • Jusup, M., P. Holme, K. Kanazawa, M. Takayasu, I. Romić, Z. Wang, S. Geček, et al. 2022. “Social Physics.” Physics Reports 948:1–148. https://doi.org/10.1016/j.physrep.2021.10.005.
  • Lee, J.-G., and M. Kang. 2015. “Geospatial Big Data: Challenges and Opportunities.” Big Data Research 2 (2): 74–81. https://doi.org/10.1016/j.bdr.2015.01.003.
  • Levy, O., and Y. Goldberg 2014. “Dependency-Based Word Embeddings”. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Baltimore, Maryland, 302–308.
  • Liang, X., and Y. Kang. 2021. “A Review of Spatial Network Insights and Methods in the Context of Planning: Applications, Challenges, and Opportunities.” In Urban Informatics and Future Cities, edited by S. C. M. Geertman, C. Pettit, R. Goodspeed, and A. Staffans, 71–91. Springer International Publishing. https://doi.org/10.1007/978-3-030-76059-5_5.
  • Li, Y., T. Fei, and F. Zhang. 2019. “A Regionalization Method for Clustering and Partitioning Based on Trajectories from NLP Perspective.” International Journal of Geographical Information Science 33 (12): 2385–2405. https://doi.org/10.1080/13658816.2019.1643025.
  • Liu, K., S. Gao, and F. Lu. 2019. “Identifying Spatial Interaction Patterns of Vehicle Movements on Urban Road Networks by Topic Modelling.” Computers, Environment and Urban Systems 74:50–61. https://doi.org/10.1016/j.compenvurbsys.2018.12.001.
  • Liu, K., S. Gao, P. Qiu, X. Liu, B. Yan, F. Lu, K. Liu, et al. 2017. “Road2Vec: Measuring Traffic Interactions in Urban Road System from Massive Travel Routes.” ISPRS International Journal of Geo-Information 6 (11): 321. https://doi.org/10.3390/ijgi6110321.
  • Liu, X., L. Gong, Y. Gong, and Y. Liu. 2015. “Revealing Travel Patterns and City Structure with Taxi Trip Data.” Journal of Transport Geography 43:78–90. https://doi.org/10.1016/j.jtrangeo.2015.01.016.
  • Liu, Y., C. Kang, S. Gao, Y. Xiao, and Y. Tian. 2012. “Understanding Intra-Urban Trip Patterns from Taxi Trajectory Data.” Journal of Geographical Systems 14 (4): 463–483. https://doi.org/10.1007/s10109-012-0166-z.
  • Liu, X., C. Kang, L. Gong, and Y. Liu. 2016. “Incorporating Spatial Interaction Patterns in Classifying and Understanding Urban Land Use.” International Journal of Geographical Information Science 30 (2): 334–350. https://doi.org/10.1080/13658816.2015.1086923.
  • Liu, Y., X. Liu, S. Gao, L. Gong, C. Kang, Y. Zhi, G. Chi, and L. Shi. 2015. “Social Sensing: A New Approach to Understanding Our Socioeconomic Environments.” Annals of the Association of American Geographers 105 (3): 512–530. https://doi.org/10.1080/00045608.2015.1018773.
  • Liu, K., P. Qiu, S. Gao, F. Lu, J. Jiang, and L. Yin. 2020. “Investigating Urban Metro Stations as Cognitive Places in Cities Using Points of Interest.” Cities 97:102561. https://doi.org/10.1016/j.cities.2019.102561.
  • Liu, Y., F. Wang, Y. Xiao, and S. Gao. 2012. “Urban Land Uses and Traffic ‘Source-Sink areas’: Evidence from GPS-Enabled Taxi Data in Shanghai.” Landscape and Urban Planning 106 (1): 73–87. https://doi.org/10.1016/j.landurbplan.2012.02.012.
  • Luo, W., P. Gao, and S. Cassels. 2018. “A Large-Scale Location-Based Social Network to Understanding the Impact of Human Geo-Social Interaction Patterns on Vaccination Strategies in an Urbanized Area.” Computers, Environment and Urban Systems 72:78–87. https://doi.org/10.1016/j.compenvurbsys.2018.06.008.
  • Luo, W., and A. M. MacEachren. 2014. “Geo-Social Visual Analytics.” Journal of Spatial Information Science 2014 (8): 27–66. https://doi.org/10.5311/JOSIS.2014.8.139.
  • Luo, W., Y. Wang, X. Liu, and S. Gao. 2019. “Cities as Spatial and Social Networks: Towards a Spatio-Socio-Semantic Analysis Framework.” In Cities as Spatial and Social Networks, 21–37. Springer International Publishing. https://doi.org/10.1007/978-3-319-95351-9_3.
  • Mai, G., K. Janowicz, Y. Hu, S. Gao, B. Yan, R. Zhu, L. Cai, and N. Lao. 2022. “A Review of Location Encoding for GeoAi: Methods and Applications.” International Journal of Geographical Information Science : 1–35. https://doi.org/10.1080/13658816.2021.2004602.
  • Martin, M. E., and N. Schuurman. 2020. “Social Media Big Data Acquisition and Analysis for Qualitative GIScience: Challenges and Opportunities.” Annals of the American Association of Geographers 110 (5): 1335–1352. https://doi.org/10.1080/24694452.2019.1696664.
  • Mikolov, T., I. Sutskever, K. Chen, G. Corrado, and J. Dean 2013. “Distributed Representations of Words and Phrases and Their Compositionality”. Proceedings of the 26th International Conference on Neural Information Processing Systems - Volume 2, Red Hook, NY, USA, 3111–3119.
  • Niu, H., and E. A. Silva. 2021. “Delineating Urban Functional Use from Points of Interest Data with Neural Network Embedding: A Case Study in Greater London.” Computers, Environment and Urban Systems 88:101651. https://doi.org/10.1016/j.compenvurbsys.2021.101651.
  • Noronha, V. T., and M. F. Goodchild. 1992. “Modeling Interregional Interaction: Implications for Defining Functional Regions.” Annals of the Association of American Geographers 82 (1): 86–102. JSTOR. https://doi.org/10.1111/j.1467-8306.1992.tb01899.x.
  • Ratti, C., S. Sobolevsky, F. Calabrese, C. Andris, J. Reades, M. Martino, R. Claxton, S. H. Strogatz, and O. Sporns. 2010. “Redrawing the Map of Great Britain from a Network of Human Interactions.” PloS One 5 (12): e14248. https://doi.org/10.1371/journal.pone.0014248.
  • Rehurek, R., and P. Sojka 2010. “Software Framework for Topic Modelling with Large Corpora”. In Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks, Valletta, Malta.
  • Rodrigue, J.-P., C. Comtois, and B. Slack. 2016. The Geography of Transport Systems. Routledge.
  • Rosvall, M., and C. T. Bergstrom. 2008. “Maps of Random Walks on Complex Networks Reveal Community Structure.” Proceedings of the National Academy of Sciences 105 (4): 1118–1123. https://doi.org/10.1073/pnas.0706851105.
  • Schläpfer, M., L. Dong, K. O’Keeffe, P. Santi, M. Szell, H. Salat, S. Anklesaria, M. Vazifeh, C. Ratti, and G. B. West. 2021. “The Universal Visitation Law of Human Mobility.” Nature 593 (7860): 522–527. https://doi.org/10.1038/s41586-021-03480-9.
  • Siła-Nowicka, K., J. Vandrol, T. Oshan, J. A. Long, U. Demšar, and A. S. Fotheringham. 2016. “Analysis of Human Mobility Patterns from GPS Trajectories and Contextual Information.” International Journal of Geographical Information Science 30 (5): 881–906. https://doi.org/10.1080/13658816.2015.1100731.
  • Simpson, E. H. 1949. “Measurement of Diversity.” Nature 163 (4148): 688–688. https://doi.org/10.1038/163688a0.
  • State Information Center. 2017. Map POI (Point of Interest) Data, V2 ed. Peking University Open Research Data Platform. https://doi.org/10.18170/DVN/WSXCNM.
  • Sun, Z., H. Jiao, H. Wu, Z. Peng, and L. Liu. 2021. “Block2vec: An Approach for Identifying Urban Functional Regions by Integrating Sentence Embedding Model and Points of Interest.” ISPRS International Journal of Geo-Information 10 (5): 339. Article 5. https://doi.org/10.3390/ijgi10050339.
  • Tobler, W. R. 1970. “A Computer Movie Simulating Urban Growth in the Detroit Region.” Economic Geography 46:234–240. https://doi.org/10.2307/143141.
  • van Meeteren, M., A. Poorthuis, B. Derudder, and F. Witlox. 2016. “Pacifying Babel’s Tower: A Scientometric Analysis of Polycentricity in Urban Research.” Urban Studies 53 (6): 1278–1298. https://doi.org/10.1177/0042098015573455.
  • Wuhan. (2021). In Wikipedia. https://en.wikipedia.org/w/index.php?title=Wuhan&oldid=1035568696.
  • Wu, C., D. Smith, and M. Wang. 2021. “Simulating the Urban Spatial Structure with Spatial Interaction: A Case Study of Urban Polycentricity Under Different Scenarios.” Computers, Environment and Urban Systems 89:101677. https://doi.org/10.1016/j.compenvurbsys.2021.101677.
  • Wu, N., X. W. Zhao, J. Wang, and D. Pan 2020. “Learning Effective Road Network Representation with Hierarchical Graph Neural Networks”. Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 6–14. https://doi.org/10.1145/3394486.3403043.
  • Xie, J., S. Kelley, and B. K. Szymanski. 2013. “Overlapping Community Detection in Networks: The State-Of-The-Art and Comparative Study.” ACM Computing Surveys (CSUR) 45 (4): 1–35. https://doi.org/10.1145/2501654.2501657.
  • Xu, Y., S.-L. Shaw, Z. Zhao, L. Yin, F. Lu, J. Chen, Z. Fang, and Q. Li. 2016. “Another Tale of Two Cities: Understanding Human Activity Space Using Actively Tracked Cellphone Location Data.” Annals of the American Association of Geographers 106 (2): 489–502. https://doi.org/10.1080/00045608.2015.1120147.
  • Xu, D., C. Wei, P. Peng, Q. Xuan, and H. Guo. 2020. “GE-GAN: A Novel Deep Learning Framework for Road Traffic State Estimation.” Transportation Research Part C: Emerging Technologies 117:102635. https://doi.org/10.1016/j.trc.2020.102635.
  • Xu, Y., Z. Xie, Z. Chen, and M. Xie. 2021. “Measuring the Similarity Between Multipolygons Using Convex Hulls and Position Graphs.” International Journal of Geographical Information Science 35 (5): 847–868. https://doi.org/10.1080/13658816.2020.1800016.
  • Xu, Y., Z. Xie, L. Wu, and Z. Chen. 2019. “Multilane Roads Extracted from the OpenStreetmap Urban Road Network Using Random Forests.” Transactions in GIS 23 (2): 224–240. https://doi.org/10.1111/tgis.12514.
  • Yang, C., and G. Gidófalvi. 2018. “Fast Map Matching, an Algorithm Integrating Hidden Markov Model with Precomputation.” International Journal of Geographical Information Science 32 (3): 547–570. https://doi.org/10.1080/13658816.2017.1400548.
  • Yao, Y., X. Li, X. Liu, P. Liu, Z. Liang, J. Zhang, and K. Mai. 2017. “Sensing Spatial Distribution of Urban Land Use by Integrating Points-Of-Interest and Google Word2Vec Model.” International Journal of Geographical Information Science 31 (4): 825–848. https://doi.org/10.1080/13658816.2016.1244608.
  • Yao, Y., J. Wang, Y. Hong, C. Qian, Q. Guan, X. Liang, L. Dai, and J. Zhang. 2021. “Discovering the Homogeneous Geographic Domain of Human Perceptions from Street View Images.” Landscape and Urban Planning 212:104125. https://doi.org/10.1016/j.landurbplan.2021.104125.
  • Yu, W. 2018. “Discovering Frequent Movement Paths from Taxi Trajectory Data Using Spatially Embedded Networks and Association Rules.” IEEE Transactions on Intelligent Transportation Systems 20 (3): 855–866. https://doi.org/10.1109/TITS.2018.2834573.
  • Yue, Y., T. Lan, A. G. Yeh, and Q.-Q. Li. 2014. “Zooming into Individuals to Understand the Collective: A Review of Trajectory-Based Travel Behaviour Studies.” Travel Behaviour and Society 1 (2): 69–78. https://doi.org/10.1016/j.tbs.2013.12.002.
  • Yue, Y., Y. Zhuang, A. G. O. Yeh, J.-Y. Xie, C.-L. Ma, and Q.-Q. Li. 2017. “Measurements of POI-Based Mixed Use and Their Relationships with Neighbourhood Vibrancy.” International Journal of Geographical Information Science 31 (4): 658–675. https://doi.org/10.1080/13658816.2016.1220561.
  • Zhang, J., X. Li, Y. Yao, Y. Hong, J. He, Z. Jiang, and J. Sun. 2020. “The Traj2Vec Model to Quantify residents’ Spatial Trajectories and Estimate the Proportions of Urban Land-Use Types.” International Journal of Geographical Information Science. 35(1):193211. (1):193211. https://doi.org/10.1080/13658816.2020.1726923.
  • Zhang, D., L. Sun, B. Li, C. Chen, G. Pan, S. Li, and Z. Wu. 2014. “Understanding Taxi Service Strategies from Taxi GPS Traces.” IEEE Transactions on Intelligent Transportation Systems 16 (1): 123–135. https://doi.org/10.1109/TITS.2014.2328231.
  • Zhang, S., J. Tang, H. Wang, Y. Wang, and S. An. 2017. “Revealing Intra-Urban Travel Patterns and Service Ranges from Taxi Trajectories.” Journal of Transport Geography 61:72–86. https://doi.org/10.1016/j.jtrangeo.2017.04.009.
  • Zhang, X., Y. Xu, W. Tu, and C. Ratti. 2018. “Do Different Datasets Tell the Same Story About Urban Mobility—A Comparative Study of Public Transit and Taxi Usage.” Journal of Transport Geography 70:78–90. https://doi.org/10.1016/j.jtrangeo.2018.05.002.
  • Zheng, Y., L. Capra, O. Wolfson, and H. Yang. 2014. “Urban Computing: Concepts, Methodologies, and Applications.” ACM Trans Intell Syst Technol 5 (3): 55. 38:1-38. https://doi.org/10.1145/2629592.
  • Zheng, Y., Y. Liu, J. Yuan, and X. Xie 2011, September. “Urban Computing with Taxicabs”. Proceedings of the 13th ACM International Conference on Ubiquitous Computing. https://www.microsoft.com/en-us/research/publication/urban-computing-with-taxicabs/.
  • Zhong, C., S. M. Arisona, X. Huang, M. Batty, and G. Schmitt. 2014. “Detecting the Dynamics of Urban Structure Through Spatial Network Analysis.” International Journal of Geographical Information Science 28 (11): 2178–2199. https://doi.org/10.1080/13658816.2014.914521.
  • Zhu, D., Z. Huang, L. Shi, L. Wu, and Y. Liu. 2018. “Inferring Spatial Interaction Patterns from Sequential Snapshots of Spatial Distributions.” International Journal of Geographical Information Science 32 (4): 783–805. https://doi.org/10.1080/13658816.2017.1413192.
  • Zhu, D., N. Wang, L. Wu, and Y. Liu. 2017. “Street as a Big Geo-Data Assembly and Analysis Unit in Urban Studies: A Case Study Using Beijing Taxi Data.” Applied Geography 86:152–164. https://doi.org/10.1016/j.apgeog.2017.07.001.