Using an Out-of-Core Technique for Clustering Large Data Sets

Elio Masciari; Giuseppe Raimondo; Clara Pizzuti; Domenico Talia

doi:10.1109/DEXA.2001.953053

12th International Workshop on Database and Expert Systems Applications

Using an Out-of-Core Technique for Clustering Large Data Sets

Year: 2001, Pages: 0133

DOI Bookmark: 10.1109/DEXA.2001.953053

Authors

Elio Masciari, University of Calabria
Giuseppe Raimondo, University of Calabria
Clara Pizzuti, ISI-CNR
Domenico Talia, ISI-CNR

Abstract

Abstract: Data mining algorithms generally deal with very large data sets that do not fit in main memory. Therefore, techniques that manage huge data sets need to be developed. Any algorithm that is proposed for mining data should have to account for out-of-core data structures. However, most of the existing algorithms haven't yet addressed this issue. In this paper we describe the implementation of an out-of-core technique for the data analysis of very large data sets with the sequential and parallel version of the clustering algorithm AutoClass. We discuss the out-of-core technique and show performance results in terms of execution time and speed up.

Like what you’re reading?

Already a member?Sign In

Member Price

$11

Non-Member Price

$21

Add to Cart Sign In

Get this article FREE with a new membership!