6533b82afe1ef96bd128b69b

RESEARCH PRODUCT

Aspects Concerning SVM Method’s Scalability

Lucian VinţanMaria VinţanDaniel Morariu

subject

Text document classificationStructured support vector machinebusiness.industryComputer scienceDocument classificationcomputer.software_genreSupport vector machineText miningScalabilityData miningbusinessCluster analysiscomputerClassifier (UML)

description

In the last years the quantity of text documents is increasing continually and automatic document classification is an important challenge. In the text document classification the training step is essential in obtaining a good classifier. The quality of learning depends on the dimension of the training data. When working with huge learning data sets, problems regarding the training time that increases exponentially are occurring. In this paper we are presenting a method that allows working with huge data sets into the training step without increasing exponentially the training time and without significantly decreasing the classification accuracy.

https://doi.org/10.1007/978-3-540-74930-1_13