78
Views
2
CrossRef citations to date
0
Altmetric
Original Articles

Multispectra CWT-Based Algorithm (MCWT) in Mass Spectra for Peak Extraction

, &
Pages 869-882 | Received 22 Oct 2007, Accepted 10 Mar 2008, Published online: 10 Sep 2008
 

Abstract

An important objective in mass spectrometry (MS) is to identify a set of biomarkers that can be used to potentially distinguish patients between distinct treatments (or conditions) from tens or hundreds of spectra. A common two-step approach involving peak extraction and quantification is employed to identify the features of scientific interest. The selected features are then used for further investigation to understand underlying biological mechanism of individual protein or for development of genomic biomarkers to early diagnosis. However, the use of inadequate or ineffective peak detection and peak alignment algorithms in peak extraction step may lead to a high rate of false positives. Also, it is crucial to reduce the false positive rate in detecting biomarkers from ten or hundreds of spectra. Here a new procedure is introduced for feature extraction in mass spectrometry data that extends the continuous wavelet transform-based (CWT-based) algorithm to multiple spectra. The proposed multispectra CWT-based algorithm (MCWT) not only can perform peak detection for multiple spectra but also carry out peak alignment at the same time. The author' MCWT algorithm constructs a reference, which integrates information of multiple raw spectra, for feature extraction. The algorithm is applied to a SELDI-TOF mass spectra data set provided by CAMDA 2006 with known polypeptide m/z positions. This new approach is easy to implement and it outperforms the existing peak extraction method from the Bioconductor PROcess package.

ACKNOWLEDGMENTS

We thank Dr. James J. Chen for careful reading to the manuscript. This work was supported by grants from the National Science Council (NSC 95-2118-M-039-003-MY2 and NSC 95-2118-M-004-004).

Notes

Δ is the relative distance between an identified peak and its nearest real peak. Numbers in parentheses denote the relative proportions.

Δ is the relative distance between a real peak and its nearest identified peak. Numbers in parentheses denote the relative proportions.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.