5,310
Views
345
CrossRef citations to date
0
Altmetric
Research Articles

EVpedia: an integrated database of high-throughput data for systemic analyses of extracellular vesicles

, , , , , , , , , , , , , , , , & show all
Article: 20384 | Received 05 Jan 2013, Accepted 20 Feb 2013, Published online: 20 Mar 2013

Abstract

Secretion of extracellular vesicles is a general cellular activity that spans the range from simple unicellular organisms (e.g. archaea; Gram-positive and Gram-negative bacteria) to complex multicellular ones, suggesting that this extracellular vesicle-mediated communication is evolutionarily conserved. Extracellular vesicles are spherical bilayered proteolipids with a mean diameter of 20–1,000 nm, which are known to contain various bioactive molecules including proteins, lipids, and nucleic acids. Here, we present EVpedia, which is an integrated database of high-throughput datasets from prokaryotic and eukaryotic extracellular vesicles. EVpedia provides high-throughput datasets of vesicular components (proteins, mRNAs, miRNAs, and lipids) present on prokaryotic, non-mammalian eukaryotic, and mammalian extracellular vesicles. In addition, EVpedia also provides an array of tools, such as the search and browse of vesicular components, Gene Ontology enrichment analysis, network analysis of vesicular proteins and mRNAs, and a comparison of vesicular datasets by ortholog identification. Moreover, publications on extracellular vesicle studies are listed in the database. This free web-based database of EVpedia (http://evpedia.info) might serve as a fundamental repository to stimulate the advancement of extracellular vesicle studies and to elucidate the novel functions of these complex extracellular organelles.

Communication between cells and the environment is essential for single and multicellular organisms. Almost all kinds of simple unicellular organisms (e.g. archaea; Gram-positive and Gram-negative bacteria) to complex multicellular ones secrete nano-sized extracellular vesicles (EVs) for their intercellular communications (Citation1Citation4). These membrane vesicles are spherical bilayered proteolipids with a mean diameter of 20–1,000 nm (Citation2, Citation3, Citation5, Citation6), which are known to contain various bioactive molecules, including proteins, nucleic acids (mRNA, microRNA, rRNA, and tRNA), and lipids (Citation3, Citation6Citation11). EVs derived from archaea and Gram-positive bacteria are called membrane vesicles (Citation1, Citation2), whereas Gram-negative bacterial EVs are called outer membrane vesicles (Citation3). Mammalian cells secrete exosomes and ectosomes (also known as microvesicles) either constitutively or in a regulated manner (Citation4).

Although EVs play several physiological and pathological functions (Citation12, Citation13), and occupy an emerging position in the field of biomarker discovery (Citation14, Citation15), it is difficult to study EVs because they contain various bioactive molecules. Recently, however, to solve this problem, high-throughput analyses were performed on prokaryotic and eukaryotic EVs. For example, many mass spectrometry-based proteomic studies, microarray or next-generation sequencing-based transcriptomic studies, and chromatography-based lipidomic studies on EVs are reported (Citation16Citation18). Over 200,000 vesicular proteins, mRNAs, miRNAs, and lipids identified in non-mammalian eukaryotic and mammalian EVs, have been compiled in the Exocarta database (Citation16, Citation17), which was recently updated into Vesiclepedia (http://www.microvesicles.org; 18). However, there has been no resource on vesicular components (proteins, nucleic acids, and lipids) derived from diverse types of prokaryotic and eukaryotic cells. In addition, an analytical tool for their Gene Ontology enrichment analyses, network analyses of vesicular proteins and mRNAs, and a comparison of vesicular datasets by ortholog identification have not been developed. These types of systematic analyses on vesicular components provide global insights into the mechanisms involved in vesicular cargo-sorting and EV biogenesis as well as the pathophysiological roles of EVs. For example, we recently showed that mammalian vesicular proteins are physically and functionally interconnected to form functional modules involved in EV biogenesis and functions; those data suggest that an EV is a nano-sized extracellular organelle (i.e. nanocosmos) rather than a cellular dust (Citation19).

Here, we present EVpedia (http://evpedia.info), which is an integrated database of high-throughput datasets from EVs launched in January of 2012 with the latest update in November of 2012 (). EVpedia provides information on proteins, mRNAs, miRNAs, and lipids enclosed in prokaryotic, non-mammalian eukaryotic, and mammalian EVs. Moreover, EVpedia also provides an array of tools, such as the search and browse of vesicular components, Gene Ontology enrichment analysis, network analysis of vesicular components, and a comparison of vesicular datasets by ortholog identification. In addition, publications on EV studies are listed in the database. This free web-based database of EVpedia might serve as a fundamental repository to stimulate the advancement of EV studies and to elucidate the novel functions of these complex extracellular organelles.

Fig. 1 Snapshot of EVpedia homepage (http://evpedia.info). General introduction of EVs and statistics of EVpedia are provided in the “Home” menu.

Fig. 1 Snapshot of EVpedia homepage (http://evpedia.info). General introduction of EVs and statistics of EVpedia are provided in the “Home” menu.

Overall structure of EVpedia

For a systematic exploration of high-throughput datasets from prokaryotic and eukaryotic EVs, EVpedia has four functional modules: (a) database of high-throughput datasets, (b) search and browse of database, (c) identification of orthologous vesicular proteins and (d) bioinformatic analyses of vesicular components.

A total of 230,937 vesicular components from 190 high-throughput datasets from 130 high-throughput studies were collected in the current EVpedia database (). Among the 190 high-throughput datasets, 166 derived from eukaryotes and 24 from prokaryotes. Vesicular high-throughput datasets, detailed methods for EV isolation, and high-throughput analysis for each dataset were arranged as tables under the “Experiment” menu. User-requested lists of vesicular components are provided in the search and browse function. A new vesicular high-throughput dataset can be submitted to the database by the “Upload” menu.

Table I Statistics of EVpedia, Exocarta, and Vesiclepedia

To analyze protein or mRNA lists, EVpedia provides Gene Ontology enrichment analysis and network analysis. In addition, for comparison of vesicular proteome and transcriptome (mRNA) from different strains, we provide ortholog identification among those species in EVpedia. Based on the ortholog information, one can compare lists of vesicular proteins or mRNAs among different species with set analysis. For all the analyses, the molecule lists from the EVpedia and the new molecule lists from the users are both applicable.

An overall comparison of EVpedia with Exocarta and Vesiclepedia is shown in and . These three web-based repositories contain proteomic, transcriptomic and lipidomic studies of non-mammalian eukaryotic and mammalian EVs. However, only EVpedia provides additional proteomic and lipidomic studies on prokaryotic EVs (). Moreover, EVpedia also provides the information on EV-related publications () as well as an array of analytic tools (), including (a) Gene Ontology enrichment analysis of vesicular components, (b) network analyses of vesicular components and (c) set analysis – a comparison of vesicular proteome and transcriptome data sets with ortholog identification. Currently, EVpedia collects vesicular proteomes identified only with high-throughput studies, but not immunoblotting or immunoelectron microscopy. We will expand the EVpedia database by adding these low-throughput protein datasets to address the hypothesis-driven or biological questions on EVs.

Table II Analytic tools in EVpedia, Exocarta, and Vesiclepedia

Search and browse of EVpedia

In the “Search” menu, EVpedia provides two web interfaces: (a) search with protein names, UniProt accessions, or UniProt IDs (“Search - Protein search” menu) and (b) search with amino acid sequences in the plain format (“Search - Sequence search” menu). In addition, all vesicular components in EVpedia can be browsed in the “Browse” () and the “Experiment” menu, and these tables can be downloaded in the tab-separated values (TSV) format.

Fig. 2 Browse function of EVpedia. With “Browse” menu, the protein list from the individual dataset or all the datasets can be browsed. In this figure, we browsed the vesicular proteins of the urine from normal donors (Citation20).

Fig. 2 Browse function of EVpedia. With “Browse” menu, the protein list from the individual dataset or all the datasets can be browsed. In this figure, we browsed the vesicular proteins of the urine from normal donors (Citation20).

Gene ontology enrichment, network, and set analyses of EVpedia

The “Analysis” menu in EVpedia provides an array of bioinformatic analysis tools: (a) Gene Ontology enrichment and network analyses of the vesicular components and (b) set analysis among more than two different sets of the vesicular proteins or mRNAs through ortholog identification.

Through Gene Ontology enrichment analysis is in the “Analysis – Gene Ontology enrichment analysis” menu, the enriched terms (i.e. Gene Ontology biological process, molecular function, and cellular component) of vesicular components can be obtained (e.g. mRNAs in a). Via network analysis in the “Analysis – Network analysis” menu, functional relationships among vesicular components can be drawn into biological networks (e.g. mRNAs in b).

Fig. 3 Gene Ontology enrichment and network analyses. All analyses in this figure involved mRNA transcriptome of EVs derived from Homo sapiens mast cell HMC-1 (Citation21). In the “Analysis – Gene Ontology enrichment analysis” menu (a), by defining the species from which the analyzed list of proteins originates, the type of Gene Ontology terms (i.e. biological process, molecular function, and cellular component), and the cut-off of the enrichment p-value, the Gene Ontology enrichment analysis can be performed. In “Analysis – Network analysis” menu (b), by defining the species to which the functional interactome data belong, the number of additional nodes, and confidence of the interactome data, one can perform the network analysis of a protein list in EVpedia.

Fig. 3 Gene Ontology enrichment and network analyses. All analyses in this figure involved mRNA transcriptome of EVs derived from Homo sapiens mast cell HMC-1 (Citation21). In the “Analysis – Gene Ontology enrichment analysis” menu (a), by defining the species from which the analyzed list of proteins originates, the type of Gene Ontology terms (i.e. biological process, molecular function, and cellular component), and the cut-off of the enrichment p-value, the Gene Ontology enrichment analysis can be performed. In “Analysis – Network analysis” menu (b), by defining the species to which the functional interactome data belong, the number of additional nodes, and confidence of the interactome data, one can perform the network analysis of a protein list in EVpedia.

Moreover, EVpedia provides comparative analyses among more than two different sets of vesicular components by the “Analysis – Set analysis” menu. For example, we selected two sets of vesicular proteins (Citation22): Homo sapiens colorectal cancer cell SW480 and SW620 (). The Venn diagram in a shows the number of members in the set intersection and the set difference between SW480 and SW620. The lists of each subset in the Venn diagram can be obtained for further analyses, such as the network analysis (b). Note that all of these analyses can be applied to a new list of proteins or mRNAs, including a newly uploaded vesicular proteome or mRNA transcriptome.

Fig. 4 Set analysis of EVpedia. In “Analysis – Set analysis” menu, the dataset of the “proteome+transcriptome (mRNA)”, “transcriptome (miRNA)”, and “lipidome” can be chosen for set analysis. Especially for the datasets of “proteome+transcriptome (mRNA)”, information from ortholog identification is used to build the Venn diagram (a). For example, the Venn diagram is drawn by comparing the ortholog clusters of vesicular proteomes from Homo sapiens SW480 and SW620 colorectal cancer cells (Citation20). The functional network of SW480-specific vesicular proteins can be drawn (b).

Fig. 4 Set analysis of EVpedia. In “Analysis – Set analysis” menu, the dataset of the “proteome+transcriptome (mRNA)”, “transcriptome (miRNA)”, and “lipidome” can be chosen for set analysis. Especially for the datasets of “proteome+transcriptome (mRNA)”, information from ortholog identification is used to build the Venn diagram (a). For example, the Venn diagram is drawn by comparing the ortholog clusters of vesicular proteomes from Homo sapiens SW480 and SW620 colorectal cancer cells (Citation20). The functional network of SW480-specific vesicular proteins can be drawn (b).

EV-related publications in EVpedia

Publications on prokaryotic and eukaryotic EVs are manually curated and stored in the “Publications” menu. With NCBI PubMed search (http://www.ncbi.nlm.nih.gov/pubmed) for text-mining solution, we collected candidate papers related to prokaryotic and eukaryotic EVs: using argosome*, “blebbing vesicle”, “blebbing vesicles”, “budding vesicle”, “budding vesicles”, dexosome*, ectosome*, “extracellular vesicle”, “extracellular vesicles”, exosome*, exovesicle*, “matrix vesicle”, “matrix vesicles”, microparticle*, microvesicle*, “membrane particle”, “membrane particles”, “membrane vesicle”, “membrane vesicles”, nanovesicle*, oncosome*, “outer membrane bleb”, “outer membrane blebs”, prostasome*, “shedding vesicle”, “shedding vesicles”, and tolerosome* as search parameters. All searched publications were manually reviewed to verify whether they are genuinely related to EVs. For example, studies on exosomes of RNA degradation activity (Citation23) were excluded. More detailed information, such as bibliographies, authors, and abstracts was excerpted from NCBI PubMed with automatized Python code (version 2.7.3).

As shown in , the number of studies on prokaryotic and eukaryotic EVs is growing rapidly; this rapid growth indicates that the field of EVs is expanding intensively. In addition, the major principal investigators published a paper on EVs are listed with their EV-study publications as tables in the “Principal investigators” menu. The users can survey the major researchers and their research fields to provide more insights on their EV studies.

Fig. 5 Publications in EVpedia. In “Publications” menu, one can browse papers related to prokaryotic (a) and eukaryotic (b) EVs. The bar graph shows the number of EV publications for each year. One can search the papers with a keyword in the type of “category”, “bibliography”, “title”, “author”, and “title+abstract”.

Fig. 5 Publications in EVpedia. In “Publications” menu, one can browse papers related to prokaryotic (a) and eukaryotic (b) EVs. The bar graph shows the number of EV publications for each year. One can search the papers with a keyword in the type of “category”, “bibliography”, “title”, “author”, and “title+abstract”.

System requirements of EVpedia

When building the EVpedia, we followed the international web standards that are compatible with most existing systems for web browsing. However, we recommend the following system requirements for best performance; operating system – MS Windows 7; internet browser – Google Chrome; resolution – 1,280×1,024.

We have tested the performance of EVpedia on the following systems; operating systems—MS Windows XP/7 and Apple OS X for personal computers, Google Android and Apple iOS for cell phones and tabloids; internet browsers – Google Chrome, Microsoft Internet Explorer, Apple Safari, and Mozilla Firefox. For network analysis, EVpedia requires Java Web Start (http://www.oracle.com/technetwork/java/javase/downloads/index.html).

Conclusion and future directions

EVpedia is an integrated database of high-throughput datasets from EVs derived from prokaryotes and eukaryotes. This database is scheduled to be updated every six months. This free web-based database should be a useful resource to elucidate fundamental roles of EVs derived from prokaryotes and eukaryotes.

Furthermore, for high-quality EV datasets, the unified criteria for high-throughput datasets are needed. First, the coherent standards for EV preparation should be defined. Although the detailed procedures vary with different studies, most studies commonly used combinations of filtration, differential centrifugation, and density gradient centrifugation methods to purify EV. In addition, coordinated standards for high-throughput data production are required because there are many systems and programs to produce and analyze high-throughput data, such as mass spectrometry-based proteomics, microarray-based transcriptomics, next-generation sequencing-based transcriptomics, and chromatography-based lipidomics (Citation24). Therefore, in order to collect together the dispersed data of EVs, it is crucial that a clear and detailed guideline for the preparation of EVs and analysis of their high-throughput data is set up.

Conflict of interest

The authors have declared no conflict of interest.

Acknowledgements

This study was supported by Mid-career Researcher Program (2012-0005634), National Junior Research Fellowship (2012-0008957), and Proteogenomic Research Program through the National Research Foundation of Korea (NRF) funded by the Korean Ministry of Education, Science and Technology (MEST).

References

  • Grimm R, Singh H, Rachel R, Typke D, Zillig W, Baumeister W. Electron tomography of ice-embedded prokaryotic cells. Biophys J. 1998; 74: 1031–42.
  • Lee EY, Choi DY, Kim DK, Kim JW, Park JO, Kim S, etal. Gram-positive bacteria produce membrane vesicles: proteomics-based characterization of Staphylococcus aureus-derived membrane vesicles. Proteomics. 2009; 9: 5425–36.
  • Lee EY, Choi DS, Kim KP, Gho YS. Proteomics in gram-negative bacterial outer membrane vesicles. Mass Spectrometry Rev. 2008; 27: 535–55.
  • Thery C, Ostrowski M, Segura E. Membrane vesicles as conveyors of immune responses. Nat Rev Immunol. 2009; 9: 581–93.
  • Choi DS, Kim DK, Kim YK, Gho YS. Proteomics, transcriptomics, and lipidomics of exosomes and ectosomes. Proteomics. 2013 10.1002/pmic.201200329.
  • Ellen AF, Albers SV, Huibers W, Pitcher A, Hobel CF, Schwarz H, etal. Proteomic analysis of secreted membrane vesicles of archaeal Sulfolobus species reveals the presence of endosome sorting complex components. Extremophiles. 2009; 13: 67–79.
  • Lee EY, Bang JY, Park GW, Choi DS, Kang JS, Kim HJ, etal. Global proteomic profiling of native outer membrane vesicles derived from Escherichia coli. Proteomics. 2007; 7: 3143–53.
  • Simpson RJ, Jensen SS, Lim JW. Proteomic profiling of exosomes: current perspectives. Proteomics. 2008; 8: 4083–99.
  • Valadi H, Ekstrom K, Bossios A, Sjostrand M, Lee JJ, Lotvall JO. Exosome-mediated transfer of mRNAs and microRNAs is a novel mechanism of genetic exchange between cells. Nat Cell Biol. 2007; 9: 654–9.
  • Hong BS, Cho JH, Kim H, Choi EJ, Rho S, Kim J, etal. Colorectal cancer cell-derived microvesicles are enriched in cell cycle-related mRNAs that promote proliferation of endothelial cells. BMC Genomics. 2009; 10: 556.
  • Subra C, Laulagnier K, Perret B, Record M. Exosome lipidomics unravels lipid sorting at the level of multivesicular bodies. Biochimie. 2007; 89: 205–12.
  • Kim CW, Lee HM, Lee TH, Kang C, Kleinman HK, Gho YS. Extracellular membrane vesicles from tumor cells promote angiogenesis via sphingomyelin. Cancer Res. 2002; 62: 6312–7.
  • Park KS, Choi KH, Kim YS, Hong BS, Kim OY, Kim JH, etal. Outer membrane vesicles derived from Escherichia coli induce systemic inflammatory response syndrome. PloS One. 2010; 5: e11334.
  • Pisitkun T, Shen RF, Knepper MA. Identification and proteomic profiling of exosomes in human urine. Proc Nat Acad Sci USA. 2004; 101: 13368–73.
  • Skog J, Wurdinger T, van Rijn S, Meijer DH, Gainche L, Sena-Esteves M, etal. Glioblastoma microvesicles transport RNA and proteins that promote tumour growth and provide diagnostic biomarkers. Nat Cell Biol. 2008; 10: 1470–6.
  • Simpson RJ, Kalra H, Mathivanan S. ExoCarta as a resource for exosomal research. J Extracellular Vesicles. 2012; 1 18374, doi: http://dx.doi.org/10.3402/jev.v1i0.18374.
  • Mathivanan S, Simpson RJ. ExoCarta: a compendium of exosomal proteins and RNA. Proteomics. 2009; 9: 4997–5000.
  • Kalra H, Simpson RJ, Ji H, Aikawa E, Altevogt P, Askenase P, etal. Vesiclepedia: a compendium for extracellular vesicles with continuous community annotation. PLoS Biol. 2012; 10: e1001450.
  • Choi DS, Yang JS, Choi EJ, Jang SC, Park S, Kim OY, etal. The protein interaction network of extracellular vesicles derived from human colorectal cancer cells. J Proteome Res. 2012; 11: 1144–51.
  • Wang Z, Hill S, Luther JM, Hachey DL, Schey KL. Proteomic analysis of urine exosomes by multidimensional protein identification technology (MudPIT). Proteomics. 2012; 12: 329–38.
  • Ekström K, Valadi H, Sjöstrand M, Malmhäll C, Bossios A, Eldh M, etal. Characterization of mRNA and microRNA in human mast cell-derived exosomes and their transfer to other mast cells and blood CD34 progenitor cells. J Extracellular Vesicles. 2012; 1 18389, doi: http://dx.doi.org/10.3402/jev.v1i0.18389.
  • Choi DS, Choi DY, Hong BS, Jang SC, Kim DK, Lee J, etal. Quantitative proteomics of extracellular vesicles derived from human primary and metastatic colorectal cancer cells. J Extracellular Vesicles. 2012; 1 18704, doi: http://dx.doi.org/10.3402/jev.v1i0.18704.
  • van Hoof A, Parker R. The exosome: a proteasome for RNA?. Cell. 1999; 99: 347–50.
  • Pedrioli PG, Eng JK, Hubley R, Vogelzang M, Deutsch EW, Raught B, etal. A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol. 2004; 22: 1459–66.