Search results for "ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION"
showing 10 items of 982 documents
Streams as Seams: Carving trajectories out of the time-frequency matrix
2020
A time-frequency representation of sound is commonly obtained through the Short-Time Fourier Transform. Identifying and extracting the prominent frequency components of the spectrogram is important for sinusoidal modeling and sound processing. Borrowing a known image processing technique, known as seam carving, we propose an algorithm to track and extract the sinusoidal components from the sound spectrogram. Experiments show how this technique is well suited for sound whose prominent frequency components vary both in amplitude and in frequency. Moreover, seam carving naturally produces some auditory continuity effects. We compare this algorithm with two other sine extraction techniques, bas…
An improved photographic method to estimate the shading effect of obstructions
2012
Abstract A new photographic method is presented to evaluate the shading effects of obstructions on surfaces exposed to the sun. The method overcomes the difficulties caused by the need to accurately describe the surrounding objects to estimate the shading effects by means of the usual tools that use the spatial reconstruction of obstructions or cylindrical or polar suncharts. The photographs of the surrounding objects are used as the background on which the solar disc is depicted at the various hours of the day. In this way it is easily detectable if the sun is visible from the place where the photographs were taken or if the surrounding obstructions obscure the sun. In spite of the complex…
Prnu Pattern Alignment for Images and Videos Based on Scene Content
2019
This paper proposes a novel approach for registering the PRNU pattern between different camera acquisition modes by relying on the imaged scene content. First, images are aligned by establishing correspondences between local descriptors: The result can then optionally be refined by maximizing the PRNU correlation. Comparative evaluations show that this approach outperforms those based on brute-force and particle swarm optimization in terms of reliability, accuracy and speed. The proposed scene-based approach for PRNU pattern alignment is suitable for video source identification in multimedia forensics applications.
Deep Motion Model for Pedestrian Tracking in 360 Degrees Videos
2019
This paper proposes a deep convolutional neural network (CNN) for pedestrian tracking in 360◦ videos based on the target’s motion. The tracking algorithm takes advantage of a virtual Pan-Tilt-Zoom (vPTZ) camera simulated by means of the 360◦ video. The CNN takes in input a motion image, i.e. the difference of two images taken by using the vPTZ camera at different times by the same pan, tilt and zoom parameters. The CNN predicts the vPTZ camera parameter adjustments required to keep the target at the center of the vPTZ camera view. Experiments on a publicly available dataset performed in cross-validation demonstrate that the learned motion model generalizes, and that the proposed tracking algo…
Video object recognition and modeling by SIFT matching optimization
2014
In this paper we present a novel technique for object modeling and object recognition in video. Given a set of videos containing 360 degrees views of objects we compute a model for each object, then we analyze short videos to determine if the object depicted in the video is one of the modeled objects. The object model is built from a video spanning a 360 degree view of the object taken against a uniform background. In order to create the object model, the proposed techniques selects a few representative frames from each video and local features of such frames. The object recognition is performed selecting a few frames from the query video, extracting local features from each frame and looki…
Browser independent content based image resizing for liquid web layouts
2010
A typical problem for webdesigners is to realize pages that can be potentially accessed from a number of display devices with different screen sizes and resolutions. Liquid layouts can help for this purpose. However, they can not typically be applied to images, which need to be rescaled or deformed. In both cases usability could be deteriorated. Content-aware image resizing techniques can help for this goal by rescaling the images to the desired width while preserving important image structures. This paper presents a content-aware resizing technique which can be seamlessly integrated into web pages without any effort from the user. The results from the system application prove its effective…
Composition of SIFT features for robust image representation
2010
In this paper we propose a novel feature based on SIFT (Scale Invariant Feature Transform) algorithm1 for the robust representation of local visual contents. SIFT features have raised much interest for their power of description of visual content characterizing punctual information against variation of luminance and change of viewpoint and they are very useful to capture local information. For a single image hundreds of keypoints are found and they are particularly suitable for tasks dealing with image registration or image matching. In this work we stretched the spatial coverage of descriptors creating a novel feature as composition of keypoints present in an image region while maintaining…
Using Temporal Texture for Content-Based Video Retrieval
2000
Textures evolving over time are called temporal textures and are very common in everyday life. Examples are the smoke flowing or the wavy water of a river. The idea explored in this paper is that image features based on temporal texture could allow a better performance of current content-based video retrieval systems that are mainly based on static characteristics of representative frames, like color and texture. To this aim we analyze the spatio-temporal nature of texture and its application in content-based access to video databases. In particular, we represent temporal texture using the spatio-temporal autoregressive (STAR) model and a variation of self-organizing maps (SOM) where each n…
Multidirectional Scratch Detection and Restoration in Digitized Old Images
2010
Line scratches are common defects in old archived videos, but similar imperfections may occur in printed images, in most cases by reason of improper handling or inaccurate preservation of the support. Once an image is digitized, its defects become part of that image. Many state-of-the-art papers deal with long, thin, vertical lines in old movie frames, by exploiting both spatial and temporal information. In this paper we aim to face with a more challenging and general problem: the analysis of line scratches in still images, regardless of their orientation, color, and shape. We present a detection/restoration method to process this defect.
Unsupervised Clustering in Personal Photo Collections
2008
In this paper we propose a probabilistic approach for the automatic organization of collected pictures aiming at more effective representation in personal photo albums. Images are analyzed and described in two representation spaces, namely, faces and background. Faces are automatically detected, rectified and represented projecting the face itself in a common low dimensional eigenspace. Backgrounds are represented with low-level visual features based on RGB histogram and Gabor filter energy. Face and background information of each image in the collection is automatically organized by mean-shift clustering technique. Given the particular domain of personal photo libraries, where most of the …