Search results for "speech recognition"

showing 10 items of 357 documents

How is Visual Recognition Entrained by Auditory Background Rhythms?

2014

AbstractRecent studies have reported that the oscillations of auditory attention entrained by a background rhythmic sequence can influence performance in visual recognition tasks. We have designed an experimental paradigm in which a visual item (either a bisyllabic word or a familiar face) is displayed on screen in two consecutive parts while a musical rhythm is played in the background. Depending on the timing conditions, the first or the second part of the item could be presented either in-synchrony or out-of-synchrony with the beats of the auditory rhythm. In a first series of experiments, participants performed a lexical decision task on bisyllabic 5-letter strings. Results show that wh…

Auditory rhythmSpeech recognitionmedia_common.quotation_subjectPoison controlEntrainmentRhythmPerceptionWord recognitionLexical decision taskGeneral Materials ScienceVisual WordSyllabic verseVisual recognitionLevels-of-processing effectPsychologymedia_commonProcedia - Social and Behavioral Sciences
researchProduct

Video preprocessing for audiovisual indexing

2003

We address the problem of detecting shots of subjects that are interviewed in news sequences. This is useful since usually these kinds of scenes contain important and reusable information that can be used for other news programs. In a previous paper, we presented a technique based on a priori knowledge of the editing techniques used in news sequences which allowed a fast search of news stories (see Albiol, A. et al., 3rd Int. Conf. on Audio and Video-based Biometric Person Authentication, p.366-71, 2001). We now present a new shot descriptor technique which improves the previous search results by using a simple, yet efficient, algorithm, based on the information contained in consecutive fra…

AuthenticationSequenceInformation retrievalContextual image classificationBiometricsComputer scienceSpeech recognitionSearch engine indexingcomputer.software_genreObject detectionReduction (complexity)Face (geometry)PreprocessorAudio signal processingcomputerImage retrievalIEEE International Conference on Acoustics Speech and Signal Processing
researchProduct

Detection of steering direction using EEG recordings based on sample entropy and time-frequency analysis.

2016

Monitoring driver's intentions beforehand is an ambitious aim, which will bring a huge impact on the society by preventing traffic accidents. Hence, in this preliminary study we recorded high resolution electroencephalography (EEG) from 5 subjects while driving a car under real conditions along with an accelerometer which detects the onset of steering. Two sensor-level analyses, sample entropy and time-frequency analysis, have been implemented to observe the dynamics before the onset of steering. Thus, in order to classify the steering direction we applied a machine learning algorithm consisting of: dimensionality reduction and classification using principal-component-analysis (PCA) and sup…

Automobile DrivingSupport Vector MachineComputer scienceSpeech recognitionEntropyElectroencephalography03 medical and health sciencesEntropy (classical thermodynamics)0302 clinical medicine0502 economics and businessAccelerometrymedicineEntropy (information theory)HumansEntropy (energy dispersal)Entropy (arrow of time)050210 logistics & transportationPrincipal Component Analysismedicine.diagnostic_testbusiness.industryEntropy (statistical thermodynamics)Dimensionality reduction05 social sciencesPattern recognitionElectroencephalographyTime–frequency analysisSupport vector machineSample entropyPrincipal component analysisArtificial intelligencebusiness030217 neurology & neurosurgeryAlgorithmsEntropy (order and disorder)Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
researchProduct

Background noise suppression for acoustic localization by means of an adaptive energy detection approach

2008

A microphone array can be employed to localize dominant acoustic sources in a given noisy environment. This capability is successfully used in good signal to noise ratio (SNR) conditions but its accuracy decreases considerably in the presence of other background noise sources. In order to counteract this effect, a novel approach that combines the information provided by a Gaussian energy detector (GED) with the approved localization method SRP-PHAT is presented in this paper. To evaluate the presented technique, several acoustic sources (speech and impulsive sounds) were considered in a variety of different scenarios to demonstrate the robustness and the accuracy of the system proposed.

Background noisesymbols.namesakeMicrophone arraySignal-to-noise ratioComputer Science::SoundComputer scienceRobustness (computer science)AcousticsGaussianSpeech recognitionDetectorsymbolsNoise control2008 IEEE International Conference on Acoustics, Speech and Signal Processing
researchProduct

2017

In continuous flash suppression (CFS), a dynamic noise masker, presented to one eye, suppresses conscious perception of a test stimulus, presented to the other eye, until the suppressed stimulus comes to awareness after few seconds. But what do we see breaking the dominance of the masker in the transition period? We addressed this question with a dual-task in which observers indicated (i) whether the test object was left or right of the fixation mark (localization) and (ii) whether it was a face or a house (categorization). As done recently (Stein et al., 2011), we used two experimental varieties to rule out confounds with decisional strategy. In the terminated mode, stimulus and masker wer…

Binocular rivalrygenetic structuresConscious perceptionSpeech recognitionStimulus (physiology)Test object050105 experimental psychology03 medical and health sciencesBehavioral Neuroscience0302 clinical medicineContinuous flash suppression0501 psychology and cognitive sciencesComputer visionDynamic noiseBiological Psychiatrybusiness.industry05 social sciencesCognitive neuroscience of visual object recognitionPsychiatry and Mental healthNeuropsychology and Physiological PsychologyNeurologyCategorizationArtificial intelligencePsychologybusiness030217 neurology & neurosurgeryFrontiers in Human Neuroscience
researchProduct

Quantitative comparison of motion history image variants for video-based depression assessment

2017

Abstract Depression is the most prevalent mood disorder and a leading cause of disability worldwide. Automated video-based analyses may afford objective measures to support clinical judgments. In the present paper, categorical depression assessment is addressed by proposing a novel variant of the Motion History Image (MHI) which considers Gabor-inhibited filtered data instead of the original image. Classification results obtained with this method on the AVEC’14 dataset are compared to those derived using (a) an earlier MHI variant, the Landmark Motion History Image (LMHI), and (b) the original MHI. The different motion representations were tested in several combinations of appearance-based …

BiometricsComputer scienceSpeech recognitionlcsh:TK7800-836002 engineering and technologyConvolutional neural networkMotion (physics)[SPI]Engineering Sciences [physics]Image processingMachine learning0502 economics and business[ SPI ] Engineering Sciences [physics]0202 electrical engineering electronic engineering information engineeringElectrical and Electronic EngineeringCategorical variableComputingMilieux_MISCELLANEOUSLandmarkbusiness.industrylcsh:Electronics05 social sciencesAffective computingFacial image analysisPattern recognitionMotion history imageMoodSignal ProcessingPattern recognition (psychology)Depression assessment020201 artificial intelligence & image processingArtificial intelligenceF1 scorebusiness050203 business & managementInformation SystemsEURASIP Journal on Image and Video Processing
researchProduct

Mutual information-based feature selection for low-cost BCIs based on motor imagery

2016

In the present study a feature selection algorithm based on mutual information (MI) was applied to electro-encephalographic (EEG) data acquired during three different motor imagery tasks from two dataset: Dataset I from BCI Competition IV including full scalp recordings from four subjects, and new data recorded from three subjects using the popular low-cost Emotiv EPOC EEG headset. The aim was to evaluate optimal channels and band-power (BP) features for motor imagery tasks discrimination, in order to assess the feasibility of a portable low-cost motor imagery based Brain-Computer Interface (BCI) system. The minimal sub set of features most relevant to task description and less redundant to…

Brain-Computer InterfaceSupport Vector MachineDatabases FactualComputer scienceHeadsetSpeech recognitionFeature extractionBiomedical EngineeringReproducibility of ResultHealth InformaticsFeature selection02 engineering and technologyElectroencephalography03 medical and health sciences0302 clinical medicineMotor imagery0202 electrical engineering electronic engineering information engineeringmedicineHumans1707medicine.diagnostic_testbusiness.industryReproducibility of ResultsElectroencephalographyPattern recognitionMutual informationModels TheoreticalAlgorithmSupport vector machineBrain-Computer InterfacesSignal ProcessingSettore ING-INF/06 - Bioingegneria Elettronica E InformaticaEidetic Imagery020201 artificial intelligence & image processingArtificial intelligencebusinessAlgorithms030217 neurology & neurosurgeryHuman2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)
researchProduct

Analysis of neuronal networks in the visual system of the cat using statistical signals--simple and complex cells. Part II.

1978

Superimposing additively a two-dimensional noise process to deterministic input signals (bars) the neurons of area 17 show a class-specific reaction for the task of signal extraction. Moving both parts of the signals simultaneously and varying the signal to noise ratio (S/N) the simple cells achieve the same performance as resulted from the psychophysical experiment. Type I complex cells extract moving deterministic signals (i.e. bars) from the stationary noise, whereas in the answers of Type II complex cells the statistical parts of the signals predominate. Considering the different cell types each as a series of a linear and a nonlinear system one obtains the cell specific space-time freq…

Cerebral CortexNeuronsGeneral Computer ScienceSeries (mathematics)Noise (signal processing)Computer scienceSpeech recognitionModels NeurologicalStatistics as TopicProcess (computing)Complex systemElectrophysiologyForm PerceptionNonlinear systemAmplitudeSignal-to-noise ratioPattern Recognition VisualSimple (abstract algebra)CatsAnimalsVisual PathwaysBiological systemMathematicsBiotechnologyBiological cybernetics
researchProduct

Beyond decomposition: Processing zero-derivations in English visual word recognition

2019

Four experiments investigate the effects of covert morphological complexity during visual word recognition. Zero-derivations occur in English in which a change of word class occurs without any change in surface form (e.g., a boat-to boat; to soak-a soak). Boat is object-derived and is a basic noun (N), whereas soak is action-derived and is a basic verb (V). As the suffix {-ing} is only attached to verbs, deriving boating from its base, requires two steps, boat(N) > boat(V) > boating(V), while soaking can be derived in one step from soak(V). Experiments 1 to 3 used masked priming at different prime durations to test matched sets of one- and two-step verbs for morphological (soaking-SOA…

Cognitive NeuroscienceSpeech recognitionExperimental and Cognitive PsychologyVerbNeuropsychological TestsVocabulary050105 experimental psychology03 medical and health sciencesPrime (symbol)0302 clinical medicineNounReaction TimeHumans0501 psychology and cognitive sciencesLanguageBrain Mapping05 social sciencesPart of speechZero (linguistics)SemanticsNeuropsychology and Physiological PsychologyPattern Recognition VisualCovertSuffixPsychologyPriming (psychology)030217 neurology & neurosurgeryPhotic StimulationCortex
researchProduct

Rapid categorization of sound objects in anesthetized rats as indexed by the electrophysiological mismatch response

2014

It is not known whether animals can, similarly to humans, categorize auditory objects based on an abstract rule in combining their physical features. We recorded local-field potentials from the dura above the primary auditory cortex in urethane-anesthetized rats presented with sound series occasionally violating a rule (e.g., "the higher the frequency, the weaker the intensity"). In a separate control condition, the same frequency and intensity levels were applied in the sound objects, but they obeyed no rule. Responses found selectively to the violations of the rule suggest that an abstract rule was represented in the rat brain, enabling auditory categorization.

Cognitive NeuroscienceSpeech recognitionMismatch negativityExperimental and Cognitive PsychologyLocal field potentialAuditory cortex050105 experimental psychology03 medical and health sciences0302 clinical medicineDevelopmental NeuroscienceEvent-related potential0501 psychology and cognitive sciences10. No inequalityBiological PsychiatryCommunicationCategorical perceptionEndocrine and Autonomic Systemsbusiness.industryGeneral Neuroscience05 social sciencesIntensity (physics)ElectrophysiologyNeuropsychology and Physiological PsychologyNeurologyCategorizationbusinessPsychology030217 neurology & neurosurgeryPsychophysiology
researchProduct