6533b839fe1ef96bd12a643a

RESEARCH PRODUCT

Linear transform for simultaneous diagonalization of covariance and perceptual metric matrix in image coding

Irene EpifanioJesús MaloJaime Gutierrez

subject

Covariance matrixbusiness.industryPattern recognitionCovarianceWeightingMatrix (mathematics)Redundancy (information theory)Artificial IntelligenceSignal ProcessingDiscrete cosine transformComputer Vision and Pattern RecognitionArtificial intelligencebusinessSoftwareTransform codingMathematicsImage compression

description

Two types ofredundancies are contained in images: statistical redundancy and psychovisual redundancy. Image representation techniques for image coding should remove both redundancies in order to obtain good results. In order to establish an appropriate representation, the standard approach to transform coding only considers the statistical redundancy, whereas the psychovisual factors are introduced after the selection ofthe representation as a simple scalar weighting in the transform domain. In this work, we take into account the psychovisual factors in the de8nition of the representation together with the statistical factors, by means of the perceptual metric and the covariance matrix, respectively. In general the ellipsoids described by these matrices are not aligned. Therefore, the optimal basis for image representation should simultaneously diagonalize both matrices. This approach to the basis selection problem has several advantages in the particular application ofimage coding. As the transform domain is Euclidean (by de8nition), the quantizer design is highly simpli8ed and at the same time, the use ofscalar quantizers is truly justi8ed. The proposed representation is compared to covariance-based representations such as the DCT and the KLT or PCA using standard JPEG-like and Max-Lloyd quantizers. ? 2003 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

https://doi.org/10.1016/s0031-3203(02)00325-4