Classification of biological sequences by using a Data Mining approach

M Maddouri; M Elloumi

Conference proceeding

Classification of biological sequences by using a Data Mining approach

METMBS'01: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, pp.54-60

01/01/2001

Abstract

Biochemical Research Methods

Biochemistry & Molecular Biology

Computer Science

Computer Science, Interdisciplinary Applications

Engineering

Engineering, Biomedical

Life Sciences & Biomedicine

Science & Technology

Technology

In Molecular Biology, biological macromolecules, like DesaxyriboNucleic Acides (DNA) and proteins are coded by strings, called primary structures. For long time, Biologists gather these primary structures in large databases. Now, they focus on analyzing these primary, structures in order to extract useful knowledge. Data Mining approaches can be helpful to reach this goal, In this paper, we present a data mining approach based on Machine Learning techniques to do classification of biological sequences. B using our approach, we proceed within four steps : (i) During the first step, we construct the set of all the discriminant substrings, called Discriminant Descriptor (DD), associated with each family of primary structures, This construction is made thinks to an adaptation of the Karp, Miller and Rosenberg (KMR) algorithm. (ii) During the second step, we use the DDs constructed during the First step to code the families of primary structures by a table of examples versus attributes, called context. (iii) During the third step, we extract knowledge from the context constructed during the second step and represent it by production rules. This extraction is made by using an incremental. production rule approach. (iv) Finally, during the last steps we use the obtained production rules to do classification of primary structures.

Metrics

1 Record Views

Details

Title: Classification of biological sequences by using a Data Mining approach
Creators - without role: M Maddouri
M Elloumi
Contributors - without role: F Valafar
Publication Details: METMBS'01: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, pp.54-60
Publisher: C S R E A Press
Number of pages: 7
Identifiers: 9933205408331
Academic Unit: University of Jeddah; University of Bisha
Language: English
Resource Type: Conference proceeding