0000000000156923

AUTHOR

Alexander Gelbukh

0000-0001-7845-9039

showing 2 related works from this author

Defining classifier regions for WSD ensembles using word space features

2006

Based on recent evaluation of word sense disambiguation (WSD) systems [10], disambiguation methods have reached a standstill. In [10] we showed that it is possible to predict the best system for target word using word features and that using this 'optimal ensembling method' more accurate WSD ensembles can be built (3-5% over Senseval state of the art systems with the same amount of possible potential remaining). In the interest of developing if more accurate ensembles, w e here define the strong regions for three popular and effective classifiers used for WSD task (Naive Bayes – NB, Support Vector Machine – SVM, Decision Rules – D) using word features (word grain, amount of positive and neg…

0303 health sciencesProbability learningWord-sense disambiguationComputer sciencebusiness.industryPattern recognition02 engineering and technologyDecision ruleSupport vector machine03 medical and health sciencesNaive Bayes classifier0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingStatistical analysisArtificial intelligencePolysemybusinessClassifier (UML)030304 developmental biology
researchProduct

Case-Sensitivity of Classifiers for WSD: Complex Systems Disambiguate Tough Words Better

2007

We present a novel method for improving disambiguation accuracy by building an optimal ensemble (OE) of systems where we predict the best available system for target word using a priori case factors (e.g. amount of training per sense). We report promising results of a series of best-system prediction tests (best prediction accuracy is 0.92) and show that complex/simple systems disambiguate tough/easy words better. The method provides the following benefits: (1) higher disambiguation accuracy for virtually any base systems (current best OE yields close to 2% accuracy gain over Senseval-3 state of the art) and (2) economical way of building more effective ensembles of all types (e.g. optimal,…

Case sensitivity0303 health sciencesbusiness.industryComputer scienceComplex systemPattern recognition02 engineering and technologyMachine learningcomputer.software_genre03 medical and health sciencesClassifier (linguistics)0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingArtificial intelligenceState (computer science)businesscomputerWord (computer architecture)030304 developmental biology
researchProduct