Search results for " processing"
showing 10 items of 7549 documents
Perceptual Image Representations for Support Vector Machine Image Coding
2007
Support-vector-machine image coding relies on the ability of SVMs for function approximation. The size and the profile of the e-insensitivity zone of the support vector regressor (SVR) at some specific image representation determines (a) the amount of selected support vectors (the compression ratio), and (b) the nature of the introduced error (the compression distortion). However, the selection of an appropriate image representation is a key issue for a meaningful design of the e-insensitivity profile. For example, in image-coding applications, taking human perception into account is of paramount relevance to obtain a good rate-distortion performance. However, depending on the accuracy of t…
A novel Bayesian framework for relevance feedback in image content-based retrieval systems
2006
This paper presents a new algorithm for image retrieval in content-based image retrieval systems. The objective of these systems is to get the images which are as similar as possible to a user query from those contained in the global image database without using textual annotations attached to the images. The main problem in obtaining a robust and effective retrieval is the gap between the low level descriptors that can be automatically extracted from the images and the user intention. The algorithm proposed here to address this problem is based on the modeling of user preferences as a probability distribution on the image space. Following a Bayesian methodology, this distribution is the pr…
Real-time people counting system using a single video camera
2008
This is the copy of journal's version originally published in Proc. SPIE 6811. Reprinted with permission of SPIE: http://spie.org/x10.xml?WT.svl=tn7 There is growing interest in video-based solutions for people monitoring and counting in business and security applications. Compared to classic sensor-based solutions the video-based ones allow for more versatile functionalities, improved performance with lower costs. In this paper, we propose a real-time system for people counting based on single low-end non-calibrated video camera. The two main challenges addressed in this paper are: robust estimation of the scene background and the number of real persons in merge-split scenarios. The latter…
Efficient Skin Detection under Severe Illumination Changes and Shadows
2011
International audience; This paper presents an efficient method for human skin color detection with a mobile platform. The proposed method is based on modeling the skin distribution in a log-chromaticity color space which shows good invariance properties to changing illumination. The method is easy to implement and can cope with the requirements of real-world tasks such as illumination variations, shadows and moving camera. Extensive experiments show the good performance of the proposed method and its robustness against abrupt changes of illumination and shadows.
Measuring Effort in Subprocesses of Subtitling
2021
There has been noticeable growth in the use and production of intralingual and interlingual subtitles due to technological advances and accessibility legislation. While the reception of subtitles has been increasingly studied over the years, there are only a few empirical studies that investigate the process of subtitling. This contribution gives initial results from a study that investigates the impact of reference material during post-editing of NMT of audiovisual content via language. The focus is on transcription and translation processes, the two main subprocesses of the complex task of interlingual subtitling. Applying well-established methods from TPR, key-logging and eye tracking, t…
Resolving ambiguities in a grounded human-robot interaction
2009
In this paper we propose a trainable system that learns grounded language models from examples with a minimum of user intervention and without feedback. We have focused on the acquisition of grounded meanings of spatial and adjective/noun terms. The system has been used to understand and subsequently to generate appropriate natural language descriptions of real objects and to engage in verbal interactions with a human partner. We have also addressed the problem of resolving eventual ambiguities arising during verbal interaction through an information theoretic approach.
Challenges and Confusions in Learning Version Control with Git
2014
Scholars agree on the importance of incorporating use of version control systems (VCSs) into computing curricula, so as to be able to prepare students for today’s distributed and collaborative work places. One of the present-day distributed version control systems (DVCSs) is Git, the system we have used on several courses. In this paper, we report on the challenges for learning and using the system based on a survey data collected from a project-based course and our own teaching experiences during several different kinds of computing courses. The results of this analysis are discussed and recommendations are made. peerReviewed
A convolutional neural network framework for blind mesh visual quality assessment
2017
In this paper, we propose a new method for blind mesh visual quality assessment using a deep learning approach. To do this, we first extract visual representative features by computing locally curvature and dihedral angles from each distorted mesh. Then, we determine from these features a set of 2D patches which are learned to a convolutional neural network (CNN). The network consists of two convolutional layers with two max-pooling layers. Then, a multilayer perceptron (MLP) with two fully connected layers is integrated to summarize the learned representation into an output node. With this network structure, feature learning and regression are used to predict the quality score of a given d…
A Deep Learning Approach for Automated Fault Detection on Solar Modules Using Image Composites
2021
Aerial inspection of solar modules is becoming increasingly popular in automatizing operations and maintenance in large-scale photovoltaic power plants. Current practices are typically time-consuming as they make use of manual acquisitions and analysis of thousands of images to scan for faults and anomalies in the modules. In this paper, we explore and evaluate the use of computer vision and deep learning methods for automating the analysis of fault detection and classification in large scale photovoltaic module installations. We use convolutional neural networks to analyze thermal and visible color images acquired by cameras mounted on unmanned aerial vehicles. We generate composite images…
2020
Abstract Background and objective Deep learning approaches are common in image processing, but often rely on supervised learning, which requires a large volume of training images, usually accompanied by hand-crafted labels. As labelled data are often not available, it would be desirable to develop methods that allow such data to be compiled automatically. In this study, we used a Generative Adversarial Network (GAN) to generate realistic B-mode musculoskeletal ultrasound images, and tested the suitability of two automated labelling approaches. Methods We used a model including two GANs each trained to transfer an image from one domain to another. The two inputs were a set of 100 longitudina…