Modeling Execution Times of Data Mining Problems in Grid Environments
Authors
Abstract
The problem of distributing data mining tasks in Grid environments in order to shorten overall execution times is addressed. The text categorization case study shows that gains heavily depend on data transfer required to distribute jobs. Therefore, simple andintuitive models of data transfer in Condor and Alchemi Grid environments are presented. In most cases the models reliable estimate the execution times of parallelized tasks.