In praise of artifice reloaded: Caution with natural image databases in modeling vision

6533b7d1fe1ef96bd125c1c2

RESEARCH PRODUCT

In praise of artifice reloaded: Caution with natural image databases in modeling vision

Marina Martinez-garcia Marina Martinez-garcia Marcelo Bertalmío Jesús Malo

subject

Subjective image quality databases Image quality Computer science Normalization (image processing)02 engineering and technology computer.software_genre Contrast masking Image (mathematics)lcsh:RC321-571 03 medical and health sciences 0302 clinical medicine Wavelet 0202 electrical engineering electronic engineering information engineering Psychophysics Natural (music)Wavelet + divisive normalization subjective image quality databases lcsh:Neurosciences. Biological psychiatry. Neuropsychiatry Artificial stimuli Original Research Natural stimuli wavelet + divisive normalization Database General Neuroscience contrast masking Range (mathematics)Norm (artificial intelligence)natural stimuli 020201 artificial intelligence & image processing artificial stimuli computer 030217 neurology & neurosurgery Neuroscience

description

Subjective image quality databases are a major source of raw data on how the visual system works in naturalistic environments. These databases describe the sensitivity of many observers to a wide range of distortions of different nature and intensity seen on top of a variety of natural images. Data of this kind seems to open a number of possibilities for the vision scientist to check the models in realistic scenarios. However, while these natural databases are great benchmarks for models developed in some other way (e.g., by using the well-controlled artificial stimuli of traditional psychophysics), they should be carefully used when trying to fit vision models. Given the high dimensionality of the image space, it is very likely that some basic phenomena are under-represented in the database. Therefore, a model fitted on these large-scale natural databases will not reproduce these under-represented basic phenomena that could otherwise be easily illustrated with well selected artificial stimuli. In this work we study a specific example of the above statement. A standard cortical model using wavelets and divisive normalization tuned to reproduce subjective opinion on a large image quality dataset fails to reproduce basic cross-masking. Here we outline a solution for this problem by using artificial stimuli and by proposing a modification that makes the model easier to tune. Then, we show that the modified model is still competitive in the large-scale database. Our simulations with these artificial stimuli show that when using steerable wavelets, the conventional unit norm Gaussian kernels in divisive normalization should be multiplied by high-pass filters to reproduce basic trends in masking. Basic visual phenomena may be misrepresented in large natural image datasets but this can be solved with model-interpretable stimuli. This is an additional argument in praise of artifice in line with Rust and Movshon (2005).

year	journal	country	edition	language
2019-01-01

10.3389/fnins.2019.00008 http://hdl.handle.net/10261/217874