1,449
Views
0
CrossRef citations to date
0
Altmetric
Articles

A hybrid model for high spatial and temporal resolution population distribution prediction

, ORCID Icon, ORCID Icon & ORCID Icon
Pages 2268-2295 | Received 31 Aug 2022, Accepted 02 Dec 2022, Published online: 12 Dec 2022

Figures & data

Table 1. Comparison of conventional, model-based ML, and the proposed hybrid methods

Figure 1. Framework of this study.

Figure 1. Framework of this study.

Figure 2. Road network of Milan.

Figure 2. Road network of Milan.

Table 2. Details of collected data

Figure 3. Distribution of land use in Milan.

Figure 3. Distribution of land use in Milan.

Figure 4. Distribution of one week of mobile phone usage in Milan as a high-resolution grid cell.

Figure 4. Distribution of one week of mobile phone usage in Milan as a high-resolution grid cell.

Figure 5. Data pre-processing workflow.

Figure 5. Data pre-processing workflow.

Figure 6. Process of predicting building type.

Figure 6. Process of predicting building type.

Figure 7. Example of (a) Moore and (b) diamond-shaped neighbourhood.

Figure 7. Example of (a) Moore and (b) diamond-shaped neighbourhood.

Figure 8. Process of combing CA and LSTM.

Figure 8. Process of combing CA and LSTM.

Figure 9. Structure of CAE.

Figure 9. Structure of CAE.

Figure 10. Examples of (a) building outlines and (b) their reconstruction.

Figure 10. Examples of (a) building outlines and (b) their reconstruction.

Figure 11. All building distribution in Milan after prediction.

Figure 11. All building distribution in Milan after prediction.

Table 3. Setting and result of LightGBM.

Figure 12. Relationship between temperature, building density, and population distribution.

Figure 12. Relationship between temperature, building density, and population distribution.

Figure 13. Population distribution of various land use (a–o) and grid cells (p).

Figure 13. Population distribution of various land use (a–o) and grid cells (p).

Figure 14. Relationship between dew point (a), humidity (b), wind speed (c), pressure (d), temperature, wind direction (e), and population distribution; the black line is the median value, and the white line is the mean value.

Figure 14. Relationship between dew point (a), humidity (b), wind speed (c), pressure (d), temperature, wind direction (e), and population distribution; the black line is the median value, and the white line is the mean value.

Figure 15. Cosine similarity between grid cells on days of a week for land use types.

Figure 15. Cosine similarity between grid cells on days of a week for land use types.

Figure 16. Autocorrelation of population distribution for grid cells in different land use types.

Figure 16. Autocorrelation of population distribution for grid cells in different land use types.

Figure 17. Training process of proposed model.

Figure 17. Training process of proposed model.

Figure 18. Comparison of proposed model and other three models (baseline, CA + BPNN, and LSTM)

Figure 18. Comparison of proposed model and other three models (baseline, CA + BPNN, and LSTM)

Figure 19. Importance of each variable (n1 to n12 refers to the code of neighbour cells shown in ).

Figure 19. Importance of each variable (n1 to n12 refers to the code of neighbour cells shown in Figure 7).

Figure 20. Importance of each category.

Figure 20. Importance of each category.

Data availability statement

The authors confirm that the data supporting the findings of this study are available within the article.