Generalized k-means algorithm on nominal dataset

S. H. Al-Harbi; A. M. Al-Shahri; Samaher H. Alharbi

doi:10.2495/DATA080051

Back

Generalized k-means algorithm on nominal dataset

Conference proceeding

Open access

Generalized k-means algorithm on nominal dataset

S. H. Al-Harbi, A. M. Al-Shahri and Samaher H. Alharbi

DATA MINING IX: DATA MINING, PROTECTION, DETECTION AND OTHER SECURITY TECHNOLOGIES, Vol.40, pp.43-51

WIT Transactions on Information and Communication Technologies

01/01/2008

DOI: https://doi.org/10.2495/DATA080051

Abstract

Computer Science

Computer Science, Artificial Intelligence

Computer Science, Interdisciplinary Applications

Science & Technology

Technology

Clustering has typically been a problem related to continuous fields. However, in data mining, often the data values are nominal and cannot be assigned meaningful continuous substitutes. The largest advantage of the k-means algorithm in data mining applications is its efficiency in clustering large data sets. The k-means algorithm usually uses the simple Euclidean metric which is only suitable for hyperspherical clusters, and its use is limited to numeric data. This paper extends our work on the D-CV metric which was introduced to deal with nominal data, and then demonstrates how the popular k-means clustering algorithm can be profitably modified to deal with the D-CV metric. Having adapted the k-means algorithm, the D-CV metric will be implemented and the results examined. With this development.

Files and links (1)

url

https://doi.org/10.2495/DATA080051View

Published (Version of record) Open

Metrics

1 Record Views

Details

Title: Generalized k-means algorithm on nominal dataset
Creators - without role: S. H. Al-Harbi - Ctr Informat Technol, Riyadh, Saudi Arabia
A. M. Al-Shahri - Ctr Informat Technol, Riyadh, Saudi Arabia
Samaher H. Alharbi - King Saud Bin Abdulaziz University for Health Sciences
Contributors - without role: A Zanasi
D A Gomar
NFF Ebecken
C A Brebbia
Publication Details: DATA MINING IX: DATA MINING, PROTECTION, DETECTION AND OTHER SECURITY TECHNOLOGIES, Vol.40, pp.43-51
Series: WIT Transactions on Information and Communication Technologies
Publisher: Wit Press
Number of pages: 9
Identifiers: 9921321308331
Academic Unit: King Saud Bin Abdulaziz University for Health Sciences
Language: English
Resource Type: Conference proceeding