Search results for "RECOGNITION"

showing 10 items of 3607 documents

Gesture Recognition for Improved User Experience in a Smart Environment

2013

Ambient Intelligence (AmI) is a new paradigm that specifically aims at exploiting sensory and context information in order to adapt the environment to the user's preferences; one of its key features is the attempt to consider common devices as an integral part of the system in order to support users in carrying out their everyday life activities without affecting their normal behavior. Our proposal consists in the definition of a gesture recognition module allowing users to interact as naturally as possible with the actuators available in a smart office, by controlling their operation mode and by querying them about their current state. To this end, readings obtained from a state-of-the-art…

Source dataAmbient intelligenceAmbient Intelligencebusiness.industryComputer scienceGesture RecognitionProbabilistic logicUsabilityMachine learningcomputer.software_genreSupport vector machineGesture recognitionArtificial intelligencebusinessClassifier (UML)computerGesture

researchProduct

An integrated dialect analysis tool using phonetics and acoustics

2019

This study aimed to verify a computational phonetic and acoustic analysis tool created in the MATLAB environment. A dataset was obtained containing 3 broad American dialects (Northern, Western and New England) from the TIMIT database using words that also appeared in the Swadesh list. Each dialect consisted of 20 speakers uttering 10 sentences. Verification using phonetic comparisons between dialects was made by calculating the Levenshtein distance in Gabmap and the proposed software tool. Agreement between the linguistic distances using each analysis method was found. Each tool showed increasing linguistic distance as a function of increasing geographic distance, in a similar shape to Segu…

Space (punctuation)Dialectometry050101 languages & linguisticsLinguistics and LanguageSpeech recognition05 social sciencesPhoneticsLinguistic distanceLevenshtein distance050105 experimental psychologyLanguage and LinguisticsVariation (linguistics)Swadesh listVowel0501 psychology and cognitive sciencesMathematicsLingua

researchProduct

Usage of HMM-Based Speech Recognition Methods for Automated Determination of a Similarity Level Between Languages

2019

The problem of automated determination of language similarity (or even defining of a distance on the space of languages) could be solved in different ways – working with phonetic transcriptions, with speech recordings or both of them. For the recordings, we propose and test a HMM-based one: in the first part of our article we successfully try language detection, afterwards we are trying to calculate distances between HMM-based models, using different metrics and divergences. The Kullback-Leibler divergence is the only one we got good results with – it means that the calculated distances between languages correspond to analytical understanding of similarity between them. Even if it does not …

Space (punctuation)Kullback–Leibler divergenceLanguage identificationSimilarity (network science)Computer scienceSpeech recognitionComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Hidden Markov modelUSableDivergence (statistics)

researchProduct

Optical implementation of the weighted sliced orthogonal nonlinear generalized correlation for nonuniform illumination conditions.

2002

Optical pattern recognition under variations of illumination is an important issue. The sliced orthogonal nonlinear generalized (SONG) correlation has been proposed as an optical pattern recognition tool to discriminate with high efficiency between objects. But, at the same time, the SONG correlation is very sensitive to gray-scale image variations. In a previous work, we expanded the definition of the SONG correlation to the Weighted SONG (WSONG) correlation to modify the discrimination capability in a controlled way. Here, we propose to use the WSONG when pattern recognition is obtained by means of optical correlation under nonuniform illumination. The calculation of the WSONG correlation…

Spatial filterbusiness.industryComputer scienceMaterials Science (miscellaneous)Binary imageNonlinear opticsImage processingIndustrial and Manufacturing EngineeringCorrelationNonlinear systemsymbols.namesakeOpticsFourier transformComputer Science::SoundComputer Science::Computer Vision and Pattern RecognitionPattern recognition (psychology)symbolsBusiness and International ManagementbusinessImage resolutionApplied optics

researchProduct

Variable fractional Fourier processor: a simple implementation

1997

A new set of optical implementations of the fractional Fourier transform (FRT) is developed by use of Wigner matrix algebra. The reinterpretation of some elementary operations that synthesize a rotation in the phase-space domain allows us to propose a lensless setup for obtaining the FRT. This compact configuration is also very flexible, because the fractional degree of the transformation can be varied continuously by shifting the input and the output planes along the optical axis by proper amounts. The above results permit one to build an optical FRT processor formed by two FRT systems in cascade, with a spatial filter between them. We present the design of such a variable FRT processor, w…

Spatial filterbusiness.industryComputer scienceTopologyAtomic and Molecular Physics and OpticsFractional Fourier transformElectronic Optical and Magnetic Materialslaw.inventionLens (optics)Optical axissymbols.namesakeFourier transformOpticsTransformation (function)CascadelawMathematics::Quantum AlgebrasymbolsComputer Vision and Pattern RecognitionbusinessRotation (mathematics)Journal of the Optical Society of America A

researchProduct

Single-channel polychromatic pattern recognition by the use of a joint-transform correlator.

2010

We present a single-channel system for color image recognition that is based on a joint-transform correlator setup. The color images are encoded as phase and amplitude functions, inspired from the Munsell color representation. A real-time implementation of the new codification method can be achieved by the use of a spatial light modulator operating in phase-only modulation mode. We determine the optimal codification for a linear color-phase code. Its performance is compared with a conventional multichannel correlator by means of computer simulations. Experimental results are also presented.

Spatial light modulatorChannel (digital image)Color imageComputer sciencebusiness.industryMaterials Science (miscellaneous)ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONPhase (waves)Industrial and Manufacturing EngineeringOpticsModulationPattern recognition (psychology)Code (cryptography)Business and International ManagementJoint (audio engineering)businessApplied optics

researchProduct

ICA of full complex-valued fMRI data using phase information of spatial maps.

2015

Background ICA of complex-valued fMRI data is challenging because of the ambiguous and noisy nature of the phase. A typical solution is to remove noisy regions from fMRI data prior to ICA. However, it may be more optimal to carry out ICA of full complex-valued fMRI data, since any filtering or voxel-based processing may disrupt information that can be useful to ICA. New method We enable ICA of the full complex-valued fMRI data by utilizing phase information of estimated spatial maps (SMs). The SM phases are first adjusted to properly represent spatial phase changes of all voxels based on estimated time courses (TCs), and then these are used to segment the voxels into BOLD-related and unwant…

Spatial map phaseAdultComputer scienceIndependent component analysis (ICA)Neuroscience(all)computer.software_genreta3112030218 nuclear medicine & medical imaging03 medical and health sciences0302 clinical medicineRobustness (computer science)VoxelImage Processing Computer-AssistedHumansComputer visionInfomaxPhase de-ambiguityta217ta113business.industryGeneral NeuroscienceComplex valuedBrainPattern recognitionMaximizationPhase positioningMagnetic Resonance ImagingComplex-valued fMRI dataPhase maskingSpatial mapsArtificial intelligencebusinesscomputer030217 neurology & neurosurgeryPsychomotor PerformanceJournal of neuroscience methods

researchProduct

Using privacy-transformed speech in the automatic speech recognition acoustic model training

2020

Automatic Speech Recognition (ASR) requires huge amounts of real user speech data to reach state-of-the-art performance. However, speech data conveys sensitive speaker attributes like identity that can be inferred and exploited for malicious purposes. Therefore, there is an interest in the collection of anonymized speech data that is processed by some voice conversion method. In this paper, we evaluate one of the voice conversion methods on Latvian speech data and also investigate if privacy-transformed data can be used to improve ASR acoustic models. Results show the effectiveness of voice conversion against state-of-the-art speaker verification models on Latvian speech and the effectivene…

Speaker verificationevaluationvoice conversionComputer scienceSpeech recognitionautomatic speech recognitionLatvianAcoustic model[INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG]privacylanguage.human_language[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]anonymization[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG][INFO.INFO-CL] Computer Science [cs]/Computation and Language [cs.CL]Identity (object-oriented programming)languageConversion methodautomatic speaker verification

researchProduct

Specific Protein Docking to Chelator Lipid Monolayers Monitored by FT-IR Spectroscopy at the Air–Water Interface

1996

Specific proteinAir water interfaceChemistryAnalytical chemistryInfrared spectroscopyGeneral MedicineGeneral ChemistryCatalysisMolecular recognitionDocking (molecular)MonolayerFt ir spectroscopyPhysical chemistryChelationAngewandte Chemie International Edition in English

researchProduct

Salient Pixels and Dimensionality Reduction for Display of Multi/Hyperspectral Images

2012

International audience; Dimensionality Reduction (DR) of spectral images is a common approach to different purposes such as visualization, noise removal or compression. Most methods such as PCA or band selection use either the entire population of pixels or a uniformly sampled subset in order to compute a projection matrix. By doing so, spatial information is not accurately handled and all the objects contained in the scene are given the same emphasis. Nonetheless, it is possible to focus the DR on the separation of specific Objects of Interest (OoI), simply by neglecting all the others. In PCA for instance, instead of using the variance of the scene in each spectral channel, we show that i…

Spectral Images[ INFO.INFO-TS ] Computer Science [cs]/Signal and Image ProcessingChannel (digital image)Computer scienceMultispectral image0211 other engineering and technologiesComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION02 engineering and technology[ SPI.SIGNAL ] Engineering Sciences [physics]/Signal and Image processingProjection (linear algebra)[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing0202 electrical engineering electronic engineering information engineeringIAPRComputer vision021101 geological & geomatics engineeringSaliencyPixelbusiness.industryDimensionality reductionHyperspectral imagingPattern recognitionDimensionality reductionVisualizationComputer Science::Computer Vision and Pattern Recognition020201 artificial intelligence & image processingArtificial intelligenceFocus (optics)business[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing

researchProduct