Search results for "Computer Vision and Pattern Recognition"

showing 10 items of 997 documents

Color image quality assessment measure using multivariate generalized Gaussian distribution

2014

This paper deals with color image quality assessment in the reduced-reference framework based on natural scenes statistics. In this context, we propose to model the statistics of the steer able pyramid coefficients by a Multivariate Generalized Gaussian distribution (MGGD). This model allows taking into account the high correlation between the components of the RGB color space. For each selected scale and orientation, we extract a parameter matrix from the three color components sub bands. In order to quantify the visual degradation, we use a closed-form of Kullback-Leibler Divergence (KLD) between two MGGDs. Using "TID 2008" benchmark, the proposed measure has been compared with the most i…

FOS: Computer and information sciencesColor histogramColor imagebusiness.industryComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionPattern recognitionColor spaceRGB color spacesymbols.namesakesymbolsPyramid (image processing)Artificial intelligencebusinessDivergence (statistics)Gaussian processGeneralized normal distributionMathematics

researchProduct

On color image quality assessment using natural image statistics

2014

Color distortion can introduce a significant damage in visual quality perception, however, most of existing reduced-reference quality measures are designed for gray scale images. In this paper, we consider a basic extension of well-known image-statistics based quality assessment measures to color images. In order to evaluate the impact of color information on the measures efficiency, two color spaces are investigated: RGB and CIELAB. Results of an extensive evaluation using TID 2013 benchmark demonstrates that significant improvement can be achieved for a great number of distortion type when the CIELAB color representation is used.

FOS: Computer and information sciencesColor histogrambusiness.industryColor imageComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONColor balanceFalse colorColor spaceICC profileColor depthRGB color modelComputer visionArtificial intelligencebusinessMathematics

researchProduct

Using Hankel matrices for dynamics-based facial emotion recognition and pain detection

2015

This paper proposes a new approach to model the temporal dynamics of a sequence of facial expressions. To this purpose, a sequence of Face Image Descriptors (FID) is regarded as the output of a Linear Time Invariant (LTI) system. The temporal dynamics of such sequence of descriptors are represented by means of a Hankel matrix. The paper presents different strategies to compute dynamics-based representation of a sequence of FID, and reports classification accuracy values of the proposed representations within different standard classification frameworks. The representations have been validated in two very challenging application domains: emotion recognition and pain detection. Experiments on…

FOS: Computer and information sciencesComputer Science - Artificial IntelligenceComputer Vision and Pattern Recognition (cs.CV)Speech recognitionFeature extractionComputer Science - Computer Vision and Pattern RecognitionPainLTI system theoryComputer Science - RoboticsLinear time invariant systemRepresentation (mathematics)Hidden Markov modelMathematicsEmotionSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniSequencebusiness.industryPattern recognitiondynamicsClassificationSupport vector machineArtificial Intelligence (cs.AI)Face (geometry)Artificial intelligencebusinessRobotics (cs.RO)Hankel matrix2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

researchProduct

Perceptually Optimized Image Rendering

2017

We develop a framework for rendering photographic images by directly optimizing their perceptual similarity to the original visual scene. Specifically, over the set of all images that can be rendered on a given display, we minimize the normalized Laplacian pyramid distance (NLPD), a measure of perceptual dissimilarity that is derived from a simple model of the early stages of the human visual system. When rendering images acquired with a higher dynamic range than that of the display, we find that the optimization boosts the contrast of low-contrast features without introducing significant artifacts, yielding results of comparable visual quality to current state-of-the-art methods, but witho…

FOS: Computer and information sciencesComputer Science - Artificial IntelligenceImage qualityComputer scienceComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONImage processing02 engineering and technologyLuminanceRendering (computer graphics)Computer Science - GraphicsOptics0202 electrical engineering electronic engineering information engineeringComputer visionPower functionComputingMethodologies_COMPUTERGRAPHICSbusiness.industryDynamic range020207 software engineeringAtomic and Molecular Physics and OpticsGraphics (cs.GR)Electronic Optical and Magnetic MaterialsArtificial Intelligence (cs.AI)Human visual system model020201 artificial intelligence & image processingComputer Vision and Pattern RecognitionArtificial intelligencebusinessImage compression

researchProduct

A blind Robust Image Watermarking Approach exploiting the DFT Magnitude

2019

Due to the current progress in Internet, digital contents (video, audio and images) are widely used. Distribution of multimedia contents is now faster and it allows for easy unauthorized reproduction of information. Digital watermarking came up while trying to solve this problem. Its main idea is to embed a watermark into a host digital content without affecting its quality. Moreover, watermarking can be used in several applications such as authentication, copy control, indexation, Copyright protection, etc. In this paper, we propose a blind robust image watermarking approach as a solution to the problem of copyright protection of digital images. The underlying concept of our method is to a…

FOS: Computer and information sciencesComputer Science - Cryptography and SecurityComputer sciencebusiness.industryComputer Vision and Pattern Recognition (cs.CV)Gaussian blurComputer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONWatermarkFilter (signal processing)Discrete Fourier transformsymbols.namesakeDigital imageGaussian noisesymbolsDiscrete cosine transformComputer visionArtificial intelligencebusinessDigital watermarkingCryptography and Security (cs.CR)Histogram equalization

researchProduct

End-to-end Optimized Image Compression

2016

We describe an image compression method, consisting of a nonlinear analysis transformation, a uniform quantizer, and a nonlinear synthesis transformation. The transforms are constructed in three successive stages of convolutional linear filters and nonlinear activation functions. Unlike most convolutional neural networks, the joint nonlinearity is chosen to implement a form of local gain control, inspired by those used to model biological neurons. Using a variant of stochastic gradient descent, we jointly optimize the entire model for rate-distortion performance over a database of training images, introducing a continuous proxy for the discontinuous loss function arising from the quantizer.…

FOS: Computer and information sciencesComputer Science - Information TheoryComputer Vision and Pattern Recognition (cs.CV)Information Theory (cs.IT)Computer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONData_CODINGANDINFORMATIONTHEORY

researchProduct

Symmetry meets AI

2021

We explore whether Neural Networks (NNs) can {\it discover} the presence of symmetries as they learn to perform a task. For this, we train hundreds of NNs on a {\it decoy task} based on well-controlled Physics templates, where no information on symmetry is provided. We use the output from the last hidden layer of all these NNs, projected to fewer dimensions, as the input for a symmetry classification task, and show that information on symmetry had indeed been identified by the original NN without guidance. As an interdisciplinary application of this procedure, we identify the presence and level of symmetry in artistic paintings from different styles such as those of Picasso, Pollock and Van…

FOS: Computer and information sciencesComputer Science - Machine Learning0303 health sciencesTheoretical computer scienceArtificial neural networkComputer Vision and Pattern Recognition (cs.CV)PhysicsQC1-999Computer Science - Computer Vision and Pattern RecognitionFOS: Physical sciencesGeneral Physics and Astronomy01 natural sciencesMachine Learning (cs.LG)Task (project management)High Energy Physics - Phenomenology03 medical and health sciencesHigh Energy Physics - Phenomenology (hep-ph)0103 physical sciencesHomogeneous spacePICASSOHidden layerSymmetry (geometry)010306 general physics030304 developmental biologySciPost Physics

researchProduct

Learning User's Confidence for Active Learning

2013

In this paper, we study the applicability of active learning in operative scenarios: more particularly, we consider the well-known contradiction between the active learning heuristics, which rank the pixels according to their uncertainty, and the user's confidence in labeling, which is related to both the homogeneity of the pixel context and user's knowledge of the scene. We propose a filtering scheme based on a classifier that learns the confidence of the user in labeling, thus minimizing the queries where the user would not be able to provide a class for the pixel. The capacity of a model to learn the user's confidence is studied in detail, also showing the effect of resolution is such a …

FOS: Computer and information sciencesComputer Science - Machine LearningActive learning (machine learning)Computer scienceComputer Vision and Pattern Recognition (cs.CV)SVM0211 other engineering and technologiesComputer Science - Computer Vision and Pattern RecognitionContext (language use)02 engineering and technologyMachine learningcomputer.software_genreTask (project management)Machine Learning (cs.LG)Classifier (linguistics)0202 electrical engineering electronic engineering information engineeringFOS: Electrical engineering electronic engineering information engineeringbad statesElectrical and Electronic Engineeringphotointerpretationuser's confidence021101 geological & geomatics engineeringActive learning (AL)Pixelbusiness.industryRank (computer programming)Image and Video Processing (eess.IV)very high resolution (VHR) imagery020206 networking & telecommunicationsElectrical Engineering and Systems Science - Image and Video ProcessingClass (biology)General Earth and Planetary SciencesArtificial intelligenceHeuristicsbusinesscomputerIEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

researchProduct

Active emulation of computer codes with Gaussian processes – Application to remote sensing

2020

Many fields of science and engineering rely on running simulations with complex and computationally expensive models to understand the involved processes in the system of interest. Nevertheless, the high cost involved hamper reliable and exhaustive simulations. Very often such codes incorporate heuristics that ironically make them less tractable and transparent. This paper introduces an active learning methodology for adaptively constructing surrogate models, i.e. emulators, of such costly computer codes in a multi-output setting. The proposed technique is sequential and adaptive, and is based on the optimization of a suitable acquisition function. It aims to achieve accurate approximations…

FOS: Computer and information sciencesComputer Science - Machine LearningActive learningActive learning (machine learning)Computer sciencemedia_common.quotation_subjectMachine Learning (stat.ML)Radiative transfer model02 engineering and technology01 natural sciencesMachine Learning (cs.LG)symbols.namesakeArtificial IntelligenceStatistics - Machine Learning0103 physical sciences0202 electrical engineering electronic engineering information engineeringCode (cryptography)Emulation010306 general physicsFunction (engineering)Gaussian processGaussian process emulatorGaussian processRemote sensingmedia_commonEmulationbusiness.industrySampling (statistics)Remote sensingSignal ProcessingGlobal Positioning Systemsymbols020201 artificial intelligence & image processingComputer codeComputer Vision and Pattern RecognitionbusinessHeuristicsSoftwareDesign of experimentsPattern Recognition

researchProduct

The Tsetlin Machine -- A Game Theoretic Bandit Driven Approach to Optimal Pattern Recognition with Propositional Logic

2018

Although simple individually, artificial neurons provide state-of-the-art performance when interconnected in deep networks. Arguably, the Tsetlin Automaton is an even simpler and more versatile learning mechanism, capable of solving the multi-armed bandit problem. Merely by means of a single integer as memory, it learns the optimal action in stochastic environments through increment and decrement operations. In this paper, we introduce the Tsetlin Machine, which solves complex pattern recognition problems with propositional formulas, composed by a collective of Tsetlin Automata. To eliminate the longstanding problem of vanishing signal-to-noise ratio, the Tsetlin Machine orchestrates the au…

FOS: Computer and information sciencesComputer Science - Machine LearningArtificial Intelligence (cs.AI)Computer Science - Artificial IntelligenceComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionMachine Learning (cs.LG)

researchProduct