471
Views
12
CrossRef citations to date
0
Altmetric
Research Article

MS-CNN: multiscale recognition of building rooftops from high spatial resolution remote sensing imagery

, , &
Pages 270-298 | Received 19 Apr 2021, Accepted 03 Dec 2021, Published online: 10 Jan 2022
 

ABSTRACT

The effective recognition and precise positioning of multiscale building rooftop is one of the key scientific problems that have yet to be resolved urgently in the current implementation of high-resolution remote sensing. In recent years, the automatic recognition of high-resolution image targets often employs convolutional neural networks to extract features. However, such traditional methods often ignore multiscale features of geographical objects, while lacking effective multiscale information extraction strategies. By utilizing the feature learning capability of deep neural networks, this study proposes a multiscale convolutional neural network named MS-CNN to recognize building rooftops from high-resolution remote sensing imagery. In addition, this study constructs a pedigree deep learning sample library based on the remote sensing Tupu theory that considers the spectral and geometric characteristics of building rooftops. Able to utilize feature segmentation mechanism and fusion enhancement strategy, MS-CNN enriches the receptive fields obtained by each convolution layer. The proposed network of this study is also compared with the famous Mask R-CNN method, proving the relative advantages of the MS-CNN method with multiscale characteristics. The experimental results show that the precision and recall metrics of the MS-CNN are 4.18% (.8655 vs. .8238) and 5.71% (.8380 vs. .7809) higher than those of the Mask R-CNN, respectively. The proposed method has been deployed in practical engineering projects in Vietnam and Myanmar, etc.

Acknowledgments

The authors would like to thank anonymous reviewers, the editors, and the scholars for their constructive comments and suggestions, which greatly improved the quality of the manuscript.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This research is funded by the National Natural Science Foundation of China [No. 41301489], Beijing Natural Science Foundation [No. 4192018, No. 4142013], Outstanding Youth Teacher Program of Beijing Municipal Education Commission [No. YETP1647, No. 21147518608], Outstanding Youth Researcher Program of Beijing University of Civil Engineering and Architecture [No. 21082716012], the Fundamental Research Funds for Beijing Universities [No. X18282, No. X20099, No. X20077], High-resolution remote sensing building information extraction project in Vietnam [No. H20173] and the BUCEA Post Graduate Innovation Project [No. PG2020078].

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.