CISUC

Speeding-up Text Classification In a GRID Computing Environment

Authors

Abstract

The amount of texts available in digital form has dramatically increased, giving rise to the need of fast text classifiers. The tasks involved can be parallelized and distributed in a GRID environment. This paper reports a study conducted on Reuters-21578 corpus, using a SVM learning machine. The task of text categorization is distributed in several platforms. The results achieved are very promising for speeding-up text categorization tasks and are valid independently of the learning machine.

Keywords

Text Classification, SVM, GRID Computing

Subject

Text mining; SVM;GRID

Related Project

GRID II - Global GRID for Data Mining with Soft Computing on Large Data Bases

Conference

ICMLA 2005, December 2005


Cited by

No citations found