6533b7d0fe1ef96bd125a24f

RESEARCH PRODUCT

false

subject

Vocabularybusiness.industryApplied Mathematicsmedia_common.quotation_subjectInformationSystems_INFORMATIONSTORAGEANDRETRIEVALVisual descriptorsComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONCodebookPattern recognitionKnn classifierUniversality (dynamical systems)ComputingMethodologies_PATTERNRECOGNITIONImage representationArtificial intelligenceCluster analysisbusinessAnalysisMathematicsmedia_common

description

Codebook is an effective image representation method. By clustering in local image descriptors, a codebook is shown to be a distinctive image feature and widely applied in object classification. In almost all existing works on codebooks, the building of the visual vocabulary follows a basic routine, that is, extracting local image descriptors and clustering with a user-designated number of clusters. The problem with this routine lies in that building a codebook for each single dataset is not efficient. In order to deal with this problem, we investigate the influence of vocabulary sizes on classification performance and vocabulary universality with the kNN classifier. Experimental results indicate that, under the condition that the vocabulary size is large enough, the vocabularies built from different datasets are exchangeable and universal.