194
Views
0
CrossRef citations to date
0
Altmetric
Review

Proteomic repository data submission, dissemination, and reuse: key messages

ORCID Icon
Pages 297-310 | Received 28 Oct 2022, Accepted 07 Dec 2022, Published online: 26 Dec 2022
 

ABSTRACT

Introduction

The creation of ProteomeXchange data workflows in 2012 transformed the field of proteomics, consisting of the standardization of data submission and dissemination and enabling the widespread reanalysis of public MS proteomics data worldwide. ProteomeXchange has triggered a growing trend toward public dissemination of proteomics data, facilitating the assessment, reuse, comparative analyses, and extraction of new findings from public datasets. By 2022, the consortium is integrated by PRIDE, PeptideAtlas, MassIVE, jPOST, iProX, and Panorama Public.

Areas covered

Here, we review and discuss the current ecosystem of resources, guidelines, and file formats for proteomics data dissemination and reanalysis. Special attention is drawn to new exciting quantitative and post-translational modification-oriented resources. The challenges and future directions on data depositions including the lack of metadata and cloud-based and high-performance software solutions for fast and reproducible reanalysis of the available data are discussed.

Expert opinion

The success of ProteomeXchange and the amount of proteomics data available in the public domain have triggered the creation and/or growth of other protein knowledgebase resources. Data reuse is a leading, active, and evolving field; supporting the creation of new formats, tools, and workflows to rediscover and reshape the public proteomics data.

Acknowledgments

I would like to thank Lennart Martens’ team, Eric Deutsch, and Ronald Beavis for providing the number of datasets reanalyzed by the resources and list of phospho peptides. Thanks to Juan A. Vizcaino for the feedback and discussions about ProteomeXchange repositories.

Declaration of interest

The authors have no relevant affiliations or financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript. This includes employment, consultancies, honoraria, stock ownership or options, expert testimony, grants or patents received or pending, or royalties.

Reviewer disclosures

Peer reviewers on this manuscript have no relevant financial or other relationships to disclose.

Geolocation information

Data sharing and reuse have become more common and standard for the proteomics community. This manuscript highlights major databases for storing proteomics data and the major challenges for data submission, dissemination, and reuse.

Data deposition

This manuscript does not contain any new data, no data was generated.

Additional information

Funding

This paper was not funded.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 99.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 641.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.