Search results for "RECOGNITION"
showing 10 items of 3607 documents
Gesture Recognition for Improved User Experience in a Smart Environment
2013
Ambient Intelligence (AmI) is a new paradigm that specifically aims at exploiting sensory and context information in order to adapt the environment to the user's preferences; one of its key features is the attempt to consider common devices as an integral part of the system in order to support users in carrying out their everyday life activities without affecting their normal behavior. Our proposal consists in the definition of a gesture recognition module allowing users to interact as naturally as possible with the actuators available in a smart office, by controlling their operation mode and by querying them about their current state. To this end, readings obtained from a state-of-the-art…
An integrated dialect analysis tool using phonetics and acoustics
2019
This study aimed to verify a computational phonetic and acoustic analysis tool created in the MATLAB environment. A dataset was obtained containing 3 broad American dialects (Northern, Western and New England) from the TIMIT database using words that also appeared in the Swadesh list. Each dialect consisted of 20 speakers uttering 10 sentences. Verification using phonetic comparisons between dialects was made by calculating the Levenshtein distance in Gabmap and the proposed software tool. Agreement between the linguistic distances using each analysis method was found. Each tool showed increasing linguistic distance as a function of increasing geographic distance, in a similar shape to Segu…
Usage of HMM-Based Speech Recognition Methods for Automated Determination of a Similarity Level Between Languages
2019
The problem of automated determination of language similarity (or even defining of a distance on the space of languages) could be solved in different ways – working with phonetic transcriptions, with speech recordings or both of them. For the recordings, we propose and test a HMM-based one: in the first part of our article we successfully try language detection, afterwards we are trying to calculate distances between HMM-based models, using different metrics and divergences. The Kullback-Leibler divergence is the only one we got good results with – it means that the calculated distances between languages correspond to analytical understanding of similarity between them. Even if it does not …
Optical implementation of the weighted sliced orthogonal nonlinear generalized correlation for nonuniform illumination conditions.
2002
Optical pattern recognition under variations of illumination is an important issue. The sliced orthogonal nonlinear generalized (SONG) correlation has been proposed as an optical pattern recognition tool to discriminate with high efficiency between objects. But, at the same time, the SONG correlation is very sensitive to gray-scale image variations. In a previous work, we expanded the definition of the SONG correlation to the Weighted SONG (WSONG) correlation to modify the discrimination capability in a controlled way. Here, we propose to use the WSONG when pattern recognition is obtained by means of optical correlation under nonuniform illumination. The calculation of the WSONG correlation…
Variable fractional Fourier processor: a simple implementation
1997
A new set of optical implementations of the fractional Fourier transform (FRT) is developed by use of Wigner matrix algebra. The reinterpretation of some elementary operations that synthesize a rotation in the phase-space domain allows us to propose a lensless setup for obtaining the FRT. This compact configuration is also very flexible, because the fractional degree of the transformation can be varied continuously by shifting the input and the output planes along the optical axis by proper amounts. The above results permit one to build an optical FRT processor formed by two FRT systems in cascade, with a spatial filter between them. We present the design of such a variable FRT processor, w…
Single-channel polychromatic pattern recognition by the use of a joint-transform correlator.
2010
We present a single-channel system for color image recognition that is based on a joint-transform correlator setup. The color images are encoded as phase and amplitude functions, inspired from the Munsell color representation. A real-time implementation of the new codification method can be achieved by the use of a spatial light modulator operating in phase-only modulation mode. We determine the optimal codification for a linear color-phase code. Its performance is compared with a conventional multichannel correlator by means of computer simulations. Experimental results are also presented.
ICA of full complex-valued fMRI data using phase information of spatial maps.
2015
Background ICA of complex-valued fMRI data is challenging because of the ambiguous and noisy nature of the phase. A typical solution is to remove noisy regions from fMRI data prior to ICA. However, it may be more optimal to carry out ICA of full complex-valued fMRI data, since any filtering or voxel-based processing may disrupt information that can be useful to ICA. New method We enable ICA of the full complex-valued fMRI data by utilizing phase information of estimated spatial maps (SMs). The SM phases are first adjusted to properly represent spatial phase changes of all voxels based on estimated time courses (TCs), and then these are used to segment the voxels into BOLD-related and unwant…
Using privacy-transformed speech in the automatic speech recognition acoustic model training
2020
Automatic Speech Recognition (ASR) requires huge amounts of real user speech data to reach state-of-the-art performance. However, speech data conveys sensitive speaker attributes like identity that can be inferred and exploited for malicious purposes. Therefore, there is an interest in the collection of anonymized speech data that is processed by some voice conversion method. In this paper, we evaluate one of the voice conversion methods on Latvian speech data and also investigate if privacy-transformed data can be used to improve ASR acoustic models. Results show the effectiveness of voice conversion against state-of-the-art speaker verification models on Latvian speech and the effectivene…
Specific Protein Docking to Chelator Lipid Monolayers Monitored by FT-IR Spectroscopy at the Air–Water Interface
1996
Salient Pixels and Dimensionality Reduction for Display of Multi/Hyperspectral Images
2012
International audience; Dimensionality Reduction (DR) of spectral images is a common approach to different purposes such as visualization, noise removal or compression. Most methods such as PCA or band selection use either the entire population of pixels or a uniformly sampled subset in order to compute a projection matrix. By doing so, spatial information is not accurately handled and all the objects contained in the scene are given the same emphasis. Nonetheless, it is possible to focus the DR on the separation of specific Objects of Interest (OoI), simply by neglecting all the others. In PCA for instance, instead of using the variance of the scene in each spectral channel, we show that i…