505
Views
20
CrossRef citations to date
0
Altmetric
FEATURE

How to Manage Data to Enhance Their Potential for Synthesis, Preservation, Sharing, and Reuse—A Great Lakes Case Study

Cómo manejar datos para incrementar el potencial para su síntesis, preservación, intercambio y reutilización –los Grandes Lagos como caso de estudio

, , , , , , , & show all
Pages 52-64 | Published online: 07 Feb 2013
 

ABSTRACT

Proper data management (applying coordinated standards and structures to data collection, maintenance, retrieval, and documentation) is essential for complex projects to ensure data accuracy and accessibility. In this article, we used a recent project evaluating changes in Lake Whitefish (Coregonus clupeaformis) growth, condition, and recruitment in the Great Lakes as a case study to illustrate how thoughtful data management approaches can enhance and improve research. Data management best practices described include dedicating personnel to data curation, setting data standards, building a relational database, managing data updates, checking for and trapping errors, extracting data, documenting data sets, and coordinating with project collaborators. The data management actions taken ultimately resulted in a rich body of scientific publication and a robust database available for future studies. Investing in data management allowed this project to serve as a model for taking the first steps toward a common goal of sharing, documenting, and preserving data that are collected and reported during the scientific research process.

RESUMEN

en proyectos complejos, un manejo apropiado de datos (aplicación coordinada de estándares y estructuras a recolección, mantenimiento, recuperación y documentación) resulta esencial para asegurar la precisión y accesibilidad de los mismos. En la presente contribución se utiliza un proyecto de evaluación de los cambios en el crecimiento, condición y reclutamiento del coregono en los Grandes Lagos, como caso de estudio para ilustrar cómo un manejo adecuado de datos puede incrementar y mejorar la investigación. Las mejores prácticas en cuanto a manejo de datos incluyen: dedicar personal a la curación de datos, fijar estándares en los datos, construcción de una base de datos relacional, manejo de actualización de datos, revisión y filtro de errores en los datos, extracción de datos, documentación de bases de datos y coordinación con colaboradores del proyecto. Las acciones de manejo de datos que se tomaron resultaron en la producción de un cuerpo importante de publicaciones y en una base de datos robusta, disponible para investigaciones futuras. Los recursos invertidos en el manejo de datos permitieron que este proyecto sirviera de modelo para tomar los primeros pasos hacia el objetivo común de compartir, documentar y preservar datos que son recolectados y reportados durante el proceso de una investigación científica.

ACKNOWLEDGMENTS

We thank E. Volkman, C. Benoit, A. Charlton, R. Cripe, G. Fodor, J. Hoffmeister, V. Lee, A. McAlexander, R. Mollenhauer, S. Shaw, D. Rajchel, B. Williston, and W. Zak for their assistance in the field and the laboratory. Thanks also to M. Ebener and D. Tagerson for assistance with the Lake Superior samples, C. Krause on Lake Erie, and L. Barbeau, D. Frazier, D. Hickey, K. King, T. King, P. Jensen, P. Peeters, B. Peterson, and J. Peterson for Lake Whitefish collections in Lake Michigan. We thank D. Clapp, E. Grove, K. Porath, T. Pattison, D. Wiefreich, J. Witzig, and two anonymous reviewers for their thoughtful comments that helped improve this article. Funding for this project was provided by the Great Lakes Fishery Trust, project number 2004.570, Department of Forestry and Natural Resources at Purdue University, Department of Fisheries and Oceans (to M. Koops) and Environment Canada (to M. Arts).

Notes

1 Visceral fat index

2 in muscle tissue.

a This table is meant to indicate the general advantages/disadvantages of the different tiers of technologies to manage data, and the characterizations do not hold true in all cases.

a Our data were projected in the World Geodetic System (WGS84)—this information belongs in the metadata or data dictionary that describes each sampling parameter in detail.

b Double means that the number is noninteger.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.