1,618
Views
0
CrossRef citations to date
0
Altmetric
Articles

Approaches and tools for user-driven provenance and data quality information in spatial data infrastructures

ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon show all
Pages 1510-1529 | Received 06 Dec 2022, Accepted 30 Mar 2023, Published online: 24 Apr 2023

References

  • AGROVOC. 2022. “Forests.” AGROVOC Multilingual Thesaurus. Accessed 2022/12/01. https://agrovoc.fao.org/browse/agrovoc/en/page/c_3062.
  • Anderson, R. P., M. B. Araújo, A. Guisan, J. M. Lobo, E. Martínez-Meyer, A. T. Peterson, and J. Soberón. 2020. “Optimizing Biodiversity Informatics to Improve Information Flow, Data Quality, and Utility for Science and Society.” Frontiers of Biogeography 12 (e47839): 1–14. doi:10.21425/F5FBG47839.
  • Balbi, S., K. J. Bagstad, A. Magrach, M. J. Sanz, N. Aguilar-Amuchastegui, C. Giupponi, and F. Villa. 2022. “The Global Environmental Agenda Urgently Needs a Semantic Web of Knowledge.” Environmental Evidence 11 (1): 5. doi:10.1186/s13750-022-00258-y.
  • Bastin, J.-F., N. Berrahmouni, A. Grainger, D. Maniatis, D. Mollicone, R. Moore, C. Patriarca, et al. 2017. “The Extent of Forest in Dryland Biomes.” Science 356 (6338): 635–638. doi:10.1126/science.aam6527.
  • Bernard, L., S. Mäs, M. Müller, C. Henzen, and J. Brauner. 2014. “Scientific Geodata Infrastructures: Challenges, Approaches and Directions.” International Journal of Digital Earth 7 (7): 613–633. doi:10.1080/17538947.2013.781244.
  • Bielecka, E. 2015. “Geographical Data Sets Fitness for Use Evaluation.” Geodetski vestnik 59 (2): 335–348. doi:10.15292/geodetski-vestnik.2015.02.335-348.
  • Bullock, E. L., S. P. Healey, Z. Yang, P. Oduor, N. Gorelick, S. Omondi, E. Ouko, and W. B. Cohen. 2021. “Three Decades of Land Cover Change in East Africa.” Land 10 (2): 150. doi:10.3390/land10020150
  • Chazdon, R. L., P. H. Brancalion, L. Laestadius, A. Bennett-Curry, K. Buckingham, C. Kumar, J. Moll-Rocek, I. C. G. Vieira, and S. J. Wilson. 2016. “When is a Forest a Forest? Forest Concepts and Definitions in the era of Forest and Landscape Restoration.” Ambio 45 (5): 538–550. doi:10.1007/s13280-016-0772-y.
  • Closa, G., J. Masó, B. Proß, and X. Pons. 2017. “W3C PROV to Describe Provenance at the Dataset, Feature and Attribute Levels in a Distributed Environment.” Computers, Environment and Urban Systems 64: 103–117. doi:10.1016/j.compenvurbsys.2017.01.008.
  • Comber, A., P. Fisher, F. Harvey, M. Gahegan, and R. Wadsworth. 2006. “Using Metadata to Link Uncertainty and Data Quality Assessments.” In Progress in Spatial Data Handling, edited by A. Riedl, W. Kainz, and G. A. Elmes, 279–292. Berlin, Heidelberg: Springer. doi:10.1007/3-540-35589-8_18.
  • Devillers, R., Y. Bédard, and R. Jeansoulin. 2005. “Multidimensional Management of Geospatial Data Quality Information for its Dynamic Use Within GIS.” Photogrammetric Engineering & Remote Sensing 71 (2): 205–215. doi:10.14358/PERS.71.2.205.
  • Dornelles, A. Z., W. J. Boonstra, I. Delabre, J. M. Denney, R. J. Nunes, A. Jentsch, K. A. Nicholas, M. Schröter, R. Seppelt, and J. Settele. 2022. “Transformation Archetypes in Global Food Systems.” Sustainability Science 1827–1840 . doi:10.1007/s11625-022-01102-5.
  • Dupuy, S. 2019. “Reunion Island – 2017, Reference Spatial Database.” CIRAD Dataverse. doi:10.18167/DVN1/TOARDN.
  • Ehrmann, S., A. Rümmler, and C. Meyer. 2022. “The LUCKINet Land Use Ontology”. doi:10.5281/zenodo.7228852.
  • Ehrmann, S., R. Seppelt, and C. Meyer. 2020. “Harmonise and Integrate Heterogeneous Areal Data with the R Package arealDB.” Environmental Modelling & Software 133: 104799. doi:10.1016/j.envsoft.2020.104799.
  • Estes, L., P. Chen, S. Debats, T. Evans, S. Ferreira, T. Kuemmerle, G. Ragazzo, et al. 2018. “A Large-Area, Spatially Continuous Assessment of Land Cover Map Error and Its Impact on Downstream Analyses.” Global Change Biology 24 (1): 322–337. doi:10.1111/gcb.13904.
  • Figgemeier, H., C. Henzen, and A. Rümmler. 2021. “A Geo-Dashboard Concept for the Interactively Linked Visualization of Provenance and Data Quality for Geospatial Datasets.” AGILE GIScience Ser. 2:25 1–8. doi:10.5194/agile-giss-2-25-2021.
  • Figgemeier, H., A. Rümmler, and C. Henzen. 2022. “A Geospatial Dashboard Prototype for Evaluating Spatial Datasets by Using Semantic Data Concepts and Open Source Libraries.” AGILE GIScience Ser. 3:34 1–34. doi:10.5194/agile-giss-3-34-2022.
  • Fischer, J., L. Egli, and C. Henzen. 2022. “Reporting Relevance, Availability, and Accessibility of Quality Information in Earth System Science Data (Version 1).” Zenodo. doi:10.5281/zenodo.7379019.
  • Foody, G. M. 2009. “Sample Size Determination for Image Classification Accuracy Assessment and Comparison.” International Journal of Remote Sensing 30 (20): 5273–5291. doi:10.1080/01431160903130937.
  • Freitas, A., and E. Curry. 2016. “Big Data Curation.” In New Horizons for a Data-Driven Economy, edited by J. Cavanillas, E. Curry, and W. Wahlster, 87–118. Cham: Springer. doi:10.1007/978-3-319-21569-3_6.
  • Fritz, S., L. See, C. Perger, I. McCallum, C. Schill, D. Schepaschenko, M. Duerauer, et al. 2016. “A Global Dataset of Crowdsourced Land Cover and Land Use Reference Data (2011-2012).” PANGAEA. doi:10.1594/PANGAEA.869682.
  • Gábor, L., W. Jetz, M. Lu, D. Rocchini, A. Cord, M. Malavasi, A. Zarzo-Arias, V. Barták, and V. Moudrý. 2022. “Positional Errors in Species Distribution Modelling are Not Overcome by the Coarser Grains of Analysis.” Methods in Ecology and Evolution n/a (n/a). doi:10.1111/2041-210X.13956.
  • Gadelha Jr, L. M. R., P. C. de Siracusa, E. C. Dalcin, L. A. E. da Silva, D. A. Augusto, E. Krempser, H. M. Affe, et al. 2021. “A Survey of Biodiversity Informatics: Concepts, Practices, and Challenges.” WIRES Data Mining and Knowledge Discovery 11 (1): e1394. doi:10.1002/widm.1394.
  • GEMET. 2021. “Forest.” General Multilingual Environmental Thesaurus. Accessed 2022/12/01. https://www.eionet.europa.eu/gemet/en/concept/3406.
  • Goodchild, M. F. 2007. “Beyond Metadata: Towards User-centric Description of Data Quality.” Proceedings of the 2007), International Symposium on Spatial Data Quality, ISPRS, June 13–15, Enschede.
  • He, L., P. Yue, L. Di, M. Zhang, and L. Hu. 2015. “Adding Geospatial Data Provenance Into SDI – A Service-Oriented Approach.” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 8 (2): 926–936. doi:10.1109/JSTARS.2014.2340737.
  • Henzen, C., A. Rümmler, and M. Wagner. 2021. “Recommendations for Developing a Metadata Profile for Earth System Science Data.” Zenodo 1–35. 10.5281/zenodo.4916698.
  • Herrick, J. E., A. Quandt, C. Kenny, M. Contreras, J. Neff, W. S. Jang, J. Maynard, et al. 2019. “LandPKS (Land Potential Knowledge System): Mobile App for Extension”, Land-Use and Project Planning, M&E and On-Farm Research.
  • Höck, H., F. Toussaint, and H. Thiemann. 2020. “Fitness for Use of Data Objects Described with Quality Maturity Matrix at Different Phases of Data Production.” Data Science Journal 19 (1): 1–14. doi:10.5334/dsj-2020-045.
  • ISO. 2013. ISO 19157:2013 Geographic Information – Data Quality. International Organization for Standardization, Geneva. doi:10.31030/2090178.
  • ISO. 2014. ISO 19115-1:2014 Geographic Information – Metadata – Part 1: Fundamentals. Geneva: International Organization for Standardization.
  • ISO. 2019. ISO 19115-2:2019 Geographic Information – Metadata – Part 2: Extensions for Acquisition and Processing. Geneva: International Organization for Standardization.
  • Ivánová, I., J. Morales, R. A. de By, T. S. Beshe, and M. A. Gebresilassie. 2013. “Searching for Spatial Data Resources by Fitness for Use.” Journal of Spatial Science 58 (1): 15–28. doi:10.1080/14498596.2012.759087.
  • Jiang, L., P. Yue, W. Kuhn, C. Zhang, C. Yu, and X. Guo. 2018. “Advancing Interoperability of Geospatial Data Provenance on the Web: Gap Analysis and Strategies.” Computers & Geosciences 117: 21–31. doi:10.1016/j.cageo.2018.05.001.
  • Jolivot, A., V. Lebourgeois, M. Ameline, V. Andriamanga, B. Bellon, M. Castets, A. Crespin-Boucaud, et al. 2021. “Harmonized In Situ JECAM Datasets for Agricultural Land Use Mapping and Monitoring in Tropical Countries.” CIRAD Dataverse. doi:10.18167/DVN1/P7OLAP.
  • Klein, A.-M., B. E. Vaissière, J. H. Cane, I. Steffan-Dewenter, S. A. Cunningham, C. Kremen, and T. Tscharntke. 2007. “Importance of Pollinators in Changing Landscapes for World Crops.” Proceedings of the Royal Society B: Biological Sciences 274 (1608): 303–313. doi:10.1098/rspb.2006.3721
  • Laurence, D., V. Andriamanga, E. Rasoamalala, S. Dupuy, and P. Burnod. 2019. “Antananarivo – Madagascar – 2017, Land Use Reference Spatial Database.” CIRAD Dataverse. doi:10.18167/DVN1/5TZOOW.
  • Lesiv, M. 2019. “Human Impact on Forests (Version 1)”. doi:10.5281/zenodo.3356758.
  • Leyk, S., A. E. Gaughan, S. B. Adamo, A. de Sherbinin, D. Balk, S. Freire, A. Rose, F. R. Stevens, B. Blankespoor, and C. Frye. 2019. “The Spatial Allocation of Population: A Review of Large-scale Gridded Population Data Products and Their Fitness for use.” Earth System Science Data 11 (3): 1385–1409. doi:10.5194/essd-11-1385-2019.
  • Liangyun, L., G. Yuan, Z. Xiao, C. Xidong, and X. Shuai. 2019. “A Dataset of Global Land Cover Validation Samples (Version v1)”. doi:10.5281/zenodo.3551995
  • Londoño Villegas, M. M., L. Le Mezo, and S. Dupuy. 2019. “Base de données spatiale pour l’apprentissage – 2016-2017- La Réunion”. CIRAD Dataverse. doi:10.18167/DVN1/HM9QNL.
  • Lush, V., J. Lumsden, and L. Bastin. 2018. “Visualisation of Trust and Quality Information for Geospatial Dataset Selection and Use: Drawing Trust Presentation Comparisons with B2C e-Commerce.” In Trust Management XII. IFIPTM 2018. IFIP Advances in Information and Communication Technology, edited by Nurit Gal-Oz and Peter R. Lewis, 75–90. Cham: Springer. doi:10.1007/978-3-319-95276-5_6.
  • Magagna, B., D. Goldfarb, P. Martin, M. Atkinson, S. Koulouzis, and Z. Zhao. 2020. “Data Provenance.” In Towards Interoperable Research Infrastructures for Environmental and Earth Sciences: A Reference Model Guided Approach for Common Challenges, edited by Zhiming Zhao and Margareta Hellström, 208–225. Cham: Springer. doi:10.1007/978-3-030-52829-4_12.
  • Meijer, M., L. A. E. Vullings, J. D. Bulens, F. I. Rip, M. Boss, G. Hazeu, and M. Storm. 2015. “Spatial Data Quality and a Workflow Tool.” The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences XL-3/W3: 55–59. doi:10.5194/isprsarchives-XL-3-W3-55-2015.
  • Meyer, C., P. Weigelt, and H. Kreft. 2016. “Multidimensional Biases, Gaps and Uncertainties in Global Plant Occurrence Information.” Ecology Letters 19 (8): 992–1006. doi:10.1111/ele.12624.
  • Monfreda, C., N. Ramankutty, and J. A. Foley. 2008. “Farming the Planet: 2. Geographic Distribution of Crop Areas, Yields, Physiological Types, and Net Primary Production in the Year 2000.” Global Biogeochemical Cycles 22: 1. doi:10.1029/2007GB002947.
  • Moudrý, V., and R. Devillers. 2020. “Quality and Usability Challenges of Global Marine Biodiversity Databases: An Example for Marine Mammal Data.” Ecological Informatics 56 (101051): 1–10. doi:10.1016/j.ecoinf.2020.101051.
  • Mushtaq, F., M. Henry, C. D. O’Brien, A. Di Gregorio, R. Jalal, J. Latham, D. Muchoney, et al. 2022. “An International Library for Land Cover Legends: The Land Cover Legend Registry.” Land 11 (7): 1083. doi:10.3390/land11071083.
  • NAL Agricultural Thesaurus. 2019. “Forests.” U.S. Department of Agriculture. Accessed 2022/12/01. https://agclass.nal.usda.gov/vocabularies/nalt/concept?uri=https://lod.nal.usda.gov/nalt/18644.
  • Nedd, R., K. Light, M. Owens, N. James, E. Johnson, and A. Anandhi. 2021. “A Synthesis of Land Use/Land Cover Studies: Definitions, Classification Systems, Meta-Studies, Challenges and Knowledge Gaps on a Global Landscape.” Land 10: 9. doi:10.3390/land10090994.
  • Nol, L., P. H. Verburg, G. B. M. Heuvelink, and K. Molenaar. 2008. “Effect of Land Cover Data on Nitrous Oxide Inventory in Fen Meadows.” Journal of Environmental Quality 37 (3): 1209–1219. doi:10.2134/jeq2007.0438.
  • Olofsson, P., S. V. Stehman, C. E. Woodcock, D. Sulla-Menashe, A. M. Sibley, J. D. Newell, M. A. Friedl, and M. Herold. 2012. “A Global Land-cover Validation Data set, Part I: Fundamental Design Principles.” International Journal of Remote Sensing 33 (18): 5768–5788. doi:10.1080/01431161.2012.674230.
  • Peng, G., R. R. Downs, C. Lacagnina, H. Ramapriyan, I. Ivánová, D. Moroni, Y. Wei, G. Larnicol, L. Wyborn, and M. Goldberg. 2021. “Call to Action for Global Access to and Harmonization of Quality Information of Individual Earth Science Datasets.” Data Science Journal 20 (1): 1–15. doi:10.5334/dsj-2021-019.
  • Peng, G., C. Lacagnina, R. R. Downs, A. Ganske, H. K. Ramapriyan, I. Ivánová, L. Wyborn, D. Jones, L. Bastin, and C. L. Shie. 2022. “Global Community Guidelines for Documenting, Sharing, and Reusing Quality Information of Individual Digital Datasets.” Data Science Journal 21 (1): 1–20. doi:10.5334/dsj-2022-008.
  • Percivall, G. 2010. “The Application of Open Standards to Enhance the Interoperability of Geoscience Information.” International Journal of Digital Earth 3 (sup1): 14–30. doi:10.1080/17538941003792751.
  • Portmann, F. T., S. Siebert, and P. Döll. 2010. “MIRCA2000 – Global Monthly Irrigated and Rainfed Crop Areas Around the Year 2000: A New High-Resolution Data set for Agricultural and Hydrological Modeling.” Global Biogeochemical Cycles 24 (1): 1–24. doi:10.1029/2008GB003435.
  • Prestele, R., P. Alexander, M. D. A. Rounsevell, A. Arneth, K. Calvin, J. Doelman, D. A. Eitelberg, et al. 2016. “Hotspots of Uncertainty in Land-use and Land-Cover Change Projections: A Global-scale Model Comparison.” Global Change Biology 22 (12): 3967–3983. doi:10.1111/gcb.13337.
  • Rounsevell, M. D. A., A. Arneth, P. Alexander, D. G. Brown, N. de Noblet-Ducoudré, E. Ellis, J. Finnigan, et al. 2014. “Towards Decision-based Global Land Use Models for Improved Understanding of the Earth System.” Earth Syst. Dynam 5 (1): 117–137. doi:10.5194/esd-5-117-2014.
  • Rümmler, A., H. Figgemeier, and C. Henzen. 2022. “Lösungsansätze zur automatisierten Erfassung und Weiterverarbeitung von strukturierten Provenance-Informationen in Forschungsdateninfrastrukturen am Beispiel von Analyse-Workflows in R.” Bausteine Forschungsdatenmanagement 1: 85–102. doi:10.17192/bfdm.2022.1.8367.
  • Schulp, C. J. E., S. Lautenbach, and P. H. Verburg. 2014. “Quantifying and Mapping Ecosystem Services: Demand and Supply of Pollination in the European Union.” Ecological Indicators 36: 131–141. doi:10.1016/j.ecolind.2013.07.014.
  • Sexton, J. O., P. Noojipady, X.-P. Song, M. Feng, D.-X. Song, D.-H. Kim, A. Anand, et al. 2016. “Conservation Policy and the Measurement of Forests.” Nature Climate Change 6 (2): 192–196. doi:10.1038/nclimate2816.
  • Stehman, S. V., and G. M. Foody. 2019. “Key Issues in Rigorous Accuracy Assessment of Land Cover Products.” Remote Sensing of Environment 231: 111199. doi:10.1016/j.rse.2019.05.018.
  • Szantoi, Z., A. Brink, A. Lupi, C. Mammone, and G. Jaffrain. 2020a. “Land Cover and Change Thematic and Validation Datasets for Sub-Saharan Africa.” PANGAEA. doi:10.1594/PANGAEA.914261.
  • Szantoi, Z., G. N. Geller, N.-E. Tsendbazar, L. See, P. Griffiths, S. Fritz, P. Gong, M. Herold, B. Mora, and A. Obregón. 2020b. “Addressing the Need for Improved Land Cover Map Products for Policy Support.” Environmental Science & Policy 112: 28–35. doi:10.1016/j.envsci.2020.04.005.
  • Tilmes, C., Y. Yesha, and M. Halem. 2010. “Tracking Provenance of Earth Science Data.” Earth Science Informatics 3 (1): 59–65. doi:10.1007/s12145-010-0046-3.
  • Vågen, T.-G., C. Okia, and L. Winowiecki. 2017. “Land and Soil Health Assessment in the Nile-Congo Sentinel Landscape.” World Agroforestry (ICRAF). doi:10.34725/DVN/BOFN8B.
  • Vahidnia, M. H., and H. Vahidi. 2021. “Open Community-based Crowdsourcing Geoportal for Earth Observation Products: A Model Design and Prototype Implementation.” ISPRS International Journal of Geo-Information 10 (1): 24. doi:10.3390/ijgi10010024.
  • VALERI. 2022. “VAlidation of Land European Remote Sensing Instruments.” Accessed 2022/12/05. http://w3.avignon.inra.fr/valeri/fic_htm/database/main.php.
  • Veiga, A. K., A. M. Saraiva, A. D. Chapman, P. J. Morris, C. Gendreau, D. Schigel, and T. J. Robertson. 2017. “A Conceptual Framework for Quality Assessment and Management of Biodiversity Data.” PLOS ONE 12 (6): e0178731. doi:10.1371/journal.pone.0178731.
  • Verburg, P. H., K. Neumann, and L. Nol. 2011. “Challenges in Using Land Use and Land Cover Data for Global Change Studies.” Global Change Biology 17 (2): 974–989. doi:10.1111/j.1365-2486.2010.02307.x.
  • Wagner, M., and C. Henzen. 2022. “Quality Assurance for Spatial Research Data.” ISPRS International Journal of Geo-Information 11: 6. doi:10.3390/ijgi11060334.
  • Wagner, M., C. Henzen, and R. Müller-Pfefferkorn. 2021. “A Research Data Infrastructure Component for the Automated Metadata and Data Quality Extraction to Foster the Provision of FAIR Data in Earth System Sciences.” AGILE GIScience Ser 2: 41. doi:10.5194/agile-giss-2-41-2021.
  • Wentz, E. A., and M. Shimizu. 2018. “Measuring Spatial Data Fitness-for-Use Through Multiple Criteria Decision Making.” Annals of the American Association of Geographers 108 (4): 1150–1167. doi:10.1080/24694452.2017.1411246.
  • Whitfield, P. H. 2012. “Why the Provenance of Data Matters: Assessing Fitness for Purpose for Environmental Data.” Canadian Water Resources Journal / Revue Canadienne des Ressources Hydriques 37 (1): 23–36. doi:10.4296/cwrj3701866.
  • WIKIDATA. 2022. “Forest.” Accessed 2022/12/01. https://www.wikidata.org/wiki/Q4421.
  • Wilkinson, M. D., M. Dumontier, I. J. Aalbersberg, G. Appleton, M. Axton, A. Baak, N. Blomberg, et al. 2016. “The FAIR Guiding Principles for Scientific Data Management and Stewardship.” Scientific Data 3 (1): 160018. doi:10.1038/sdata.2016.18.
  • Wüest, R. O., N. E. Zimmermann, D. Zurell, J. M. Alexander, S. A. Fritz, C. Hof, H. Kreft, et al. 2020. “Macroecology in the age of Big Data – Where to go from Here?” Journal of Biogeography 47 (1): 1–12. doi:10.1111/jbi.13633.
  • Yang, X., J. D. Blower, L. Bastin, V. Lush, A. Zabala, J. Masó, D. Cornford, P. Díaz, and J. Lumsden. 2013. “An Integrated View of Data Quality in Earth Observation.” Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 371 (1983): 20120072. doi:10.1098/rsta.2012.0072.
  • Yu, Q., L. You, U. Wood-Sichra, Y. Ru, A. K. B. Joglekar, S. Fritz, W. Xiong, M. Lu, W. Wu, and P. Yang. 2020. “A Cultivated Planet in 2010 – Part 2: The Global Gridded Agricultural-Production Maps.” Earth System Science Data 12 (4): 3545–3572. doi:10.5194/essd-12-3545-2020.
  • Zabala, A., J. Masó, L. Bastin, G. Giuliani, and X. Pons. 2021. “Geospatial User Feedback: How to Raise Users’ Voices and Collectively Build Knowledge at the Same Time.” ISPRS International Journal of Geo-Information 10 (3): 141. doi:10.3390/ijgi10030141.