Abstract
Proteomic technology has been found promising for classifying complex diseases that leads to early prediction. However, for effective classification, the extraction of good features that can represent the identities of different classes plays the frontal critical factor for any classification problems. In addition, another major problem associated with pattern recognition is how to effectively handle a large feature space. This paper addresses these two frontal issues for mass spectrometry (MS) classification. We apply the theory of linear predictive coding to extract features and fuzzy vector quantization to reduce the large feature space of MS data. The minimization of the fuzzy scattering matrix in the setting of the fuzzy c-means algorithm provides better grouping for feature classification. The proposed methodology was tested using two MS-based cancer datasets and the results are promising.