Search in:

International Journal of Geographical Information Science Volume 35, 2021 - Issue 2

Submit an article Journal homepage

Open access

3,044

Views

CrossRef citations to date

Altmetric

Research Articles

Land cover harmonization using Latent Dirichlet Allocation

Zhan Lia Canadian Forest Service (Pacific Forestry Centre), Natural Resources Canada, Victoria, Canada

https://orcid.org/0000-0001-6307-5200 View further author information

Joanne C. Whitea Canadian Forest Service (Pacific Forestry Centre), Natural Resources Canada, Victoria, CanadaCorrespondence[email protected]

https://orcid.org/0000-0003-4674-0373 View further author information

Michael A. Wuldera Canadian Forest Service (Pacific Forestry Centre), Natural Resources Canada, Victoria, Canada

https://orcid.org/0000-0002-6942-1896 View further author information

Txomin Hermosillaa Canadian Forest Service (Pacific Forestry Centre), Natural Resources Canada, Victoria, Canada

https://orcid.org/0000-0002-5445-0360 View further author information

Andrew M. Davidsonb Science and Technology Branch, Agriculture and Agri-Food Canada, Ottawa, Canada;c Department of Geography and Environmental Studies, Carleton University, Ottawa, Canada

https://orcid.org/0000-0003-3784-682X View further author information

Alexis J. Comberd School of Geography, University of Leeds, Leeds, UK

https://orcid.org/0000-0002-3652-7846 View further author information

Pages 348-374 | Received 31 Dec 2019, Accepted 08 Jul 2020, Published online: 27 Jul 2020

Cite this article
https://doi.org/10.1080/13658816.2020.1796131
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Figures & data

Figure 1. Study area as represented by the source maps (A) Virtual Land Cover Engine (VLCE) and (B) Annual Crop Inventory (ACI). Note that ACI map displays only the 35 ACI classes with validation samples in the accuracy assessment. ACI classes without validation samples were merged into higher level parent classes

Table 1. Scenarios and approaches used to produce maps in the generalized legend of harmonized land cover classes. The coded name indicates the information used to produce the harmonized map (2 H). Information used comprise E: error matrices, L: Latent Dirichlet Allocation, S: semantic affinity scores, or alternatively ‘~’ when that information was not used. Source maps are VLCE: Virtual Land Cover Engine, and ACI: Annual Crop Inventory

Download CSV Display Table

Table 2. Definition of semantic affinity scores between example classes

Download CSV Display Table

Table 3. Crosswalk rules from classes in the VLCE legend to classes in the HLC legend

Download CSV Display Table

Table 4. Crosswalk rules from classes in the ACI legend to classes in the HLC legend

Download CSV Display Table

Figure 2. Reference sample allocation. In the matrix image on the upper left, blanks indicate no such combination and grays indicate no reference sample is allocated. Darker fonts of class names mean higher frequencies. The bar charts on the right of and directly below the image are marginal proportions. Class colors are presented on the right Y-axis and the bottom X-axis of the matrix image. The proportion of reference sample units over areas of agreement and disagreement is presented on the lower-right bar chart

Figure 3. Co-occurrence frequencies (pixel counts) of combinations of source map classes. In the matrix image on the upper left, blanks indicate no such combination. Darker fonts of class names mean higher frequencies. The bar charts on the right of and directly below the image are marginal proportions. Class colors are presented on the right Y-axis and the bottom X-axis of the matrix image. The proportion of combinations in agreement and disagreement is presented on the lower-right bar chart

Table 5. Estimates of overall, producer’s ( $P_{j}$ ), and user’s ( $U_{i}$ ) accuracy per HLC class over areas of agreement, with standard error. Codes for harmonization scenarios are fully described in

Display Table

Table 6. Estimates of overall, producer’s ( $P_{j}$ ), and user’s ( $U_{i}$ ) accuracy per HLC class over areas of disagreement, with standard error. Codes for harmonization scenarios are fully described in

Display Table

Figure 4. Harmonized maps by the approaches (A) ‘EL~2H’, using error matrices and LDA outputs for harmonization; and (B) ‘ELS2H’, using error matrices, LDA outputs, and semantic affinity scores for harmonization

Table 7. For the area of disagreement, the change in accuracy in the harmonized output maps as a function of not using an error matrix, not using an LDA model, and not using semantic affinity scores. Codes for harmonization scenarios are fully described in

Download CSV Display Table

Figure 5. Harmonization by the approach ‘EL~2H’, using error matrices and LDA outputs (). HLC labels and their class probabilities of all the combinations of source map classes. Blanks indicate no such combination. Darker fonts of class names mean higher frequencies

Figure 5. Harmonization by the approach ‘EL~2H’, using error matrices and LDA outputs (Table 1). HLC labels and their class probabilities of all the combinations of source map classes. Blanks indicate no such combination. Darker fonts of class names mean higher frequencies

Figure 6. Harmonization by the approach ‘ELS2H’, using error matrices, LDA outputs, and semantic affinity scores (). HLC labels and their class probabilities of all the combinations of source map classes. Blanks indicate no such combination. Darker fonts of class names mean higher frequencies

Figure 6. Harmonization by the approach ‘ELS2H’, using error matrices, LDA outputs, and semantic affinity scores (Table 1). HLC labels and their class probabilities of all the combinations of source map classes. Blanks indicate no such combination. Darker fonts of class names mean higher frequencies

Figure 7. Details of benchmark maps (VLCE2H and ACI2H), harmonized maps (EL~2H and ELS2H), and maps of HLC class probabilities over an example region west of Calgary, Alberta, represented by the red dot in the lower-right map

Data availability statement

The land cover data used in this study are openly available from the National Forest Information System and the Federal Geospatial Platform:

2015 VLCE: https://opendata.nfis.org/downloads/forest_change/CA_forest_VLCE_2015.zip

2015 ACI: https://open.canada.ca/data/en/dataset/ba2645d5-4458-414d-b196-6303ac06c1c9

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Land cover harmonization using Latent Dirichlet Allocation

Table 2. Definition of semantic affinity scores between example classes

Table 3. Crosswalk rules from classes in the VLCE legend to classes in the HLC legend

Table 4. Crosswalk rules from classes in the ACI legend to classes in the HLC legend

Table 5. Estimates of overall, producer’s ( $P_{j}$ ), and user’s ( $U_{i}$ ) accuracy per HLC class over areas of agreement, with standard error. Codes for harmonization scenarios are fully described in

Table 6. Estimates of overall, producer’s ( $P_{j}$ ), and user’s ( $U_{i}$ ) accuracy per HLC class over areas of disagreement, with standard error. Codes for harmonization scenarios are fully described in

Table 7. For the area of disagreement, the change in accuracy in the harmonized output maps as a function of not using an error matrix, not using an LDA model, and not using semantic affinity scores. Codes for harmonization scenarios are fully described in

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

Land cover harmonization using Latent Dirichlet Allocation

Figures & data

Table 2. Definition of semantic affinity scores between example classes

Table 3. Crosswalk rules from classes in the VLCE legend to classes in the HLC legend

Table 4. Crosswalk rules from classes in the ACI legend to classes in the HLC legend

Table 5. Estimates of overall, producer’s (Pj), and user’s (Ui) accuracy per HLC class over areas of agreement, with standard error. Codes for harmonization scenarios are fully described in Table 1

Table 6. Estimates of overall, producer’s (Pj), and user’s (Ui) accuracy per HLC class over areas of disagreement, with standard error. Codes for harmonization scenarios are fully described in Table 1

Table 7. For the area of disagreement, the change in accuracy in the harmonized output maps as a function of not using an error matrix, not using an LDA model, and not using semantic affinity scores. Codes for harmonization scenarios are fully described in Table 1

Data availability statement

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 5. Estimates of overall, producer’s ( $P_{j}$ ), and user’s ( $U_{i}$ ) accuracy per HLC class over areas of agreement, with standard error. Codes for harmonization scenarios are fully described in

Table 6. Estimates of overall, producer’s ( $P_{j}$ ), and user’s ( $U_{i}$ ) accuracy per HLC class over areas of disagreement, with standard error. Codes for harmonization scenarios are fully described in

Table 7. For the area of disagreement, the change in accuracy in the harmonized output maps as a function of not using an error matrix, not using an LDA model, and not using semantic affinity scores. Codes for harmonization scenarios are fully described in