Search results for "processing"

showing 10 items of 8572 documents

Fully automatic face recognition system using a combined audio-visual approach

2005

This paper presents a novel audio and video information fusion approach that greatly improves automatic recognition of people in video sequences. To that end, audio and video information is first used independently to obtain confidence values that indicate the likelihood that a specific person appears in a video shot. Finally, a post-classifier is applied to fuse audio and visual confidence values. The system has been tested on several news sequences and the results indicate that a significant improvement in the recognition rate can be achieved when both modalities are used together.

Audio miningDynamic time warpingModalitiesComputer sciencebusiness.industryShot (filmmaking)Speech recognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONVideo sequenceFacial recognition systemVideo trackingSignal ProcessingFuse (electrical)Computer visionArtificial intelligenceElectrical and Electronic EngineeringbusinessIEE Proceedings - Vision, Image, and Signal Processing
researchProduct

2015

Visuo-auditory sensory substitution systems are augmented reality devices that translate a video stream into an audio stream in order to help the blind in daily tasks requiring visuo-spatial information. In this work, we present both a new mobile device and a transcoding method specifically designed to sonify moving objects. Frame differencing is used to extract spatial features from the video stream and two-dimensional spatial information is converted into audio cues using pitch, interaural time difference and interaural level difference. Using numerical methods, we attempt to reconstruct visuo-spatial information based on audio signals generated from various video stimuli. We show that de…

Audio signalComputer Networks and Communicationsbusiness.industryComputer scienceSpeech recognitionMotion detectionTranscodingAudio signal flowVideo processingcomputer.software_genreSensory substitutionArtificial IntelligenceHardware and ArchitectureSonificationComputer visionArtificial intelligencebusinessAudio signal processingcomputerSoftwareInformation SystemsFrontiers in ICT
researchProduct

Steered Response Power Localization of Acoustic Passband Signals

2017

The vast majority of localization approaches using phase transform (PHAT) consider that the sources of interest are wideband low-pass sources. While this may be the usual case for common audio signals such as speech, PHAT methods are affected negatively by modulation artifacts when the sources to be localized are passband signals. In these cases, steered response power PHAT localization becomes less robust. This letter analyzes the form of generalized cross-correlation functions with PHAT when passband acoustic signals are considered, proposing approaches for increasing the localization performance through the mitigation of these negative effects.

Audio signalComputer scienceApplied MathematicsSpeech recognitionAcousticsBandwidth (signal processing)020206 networking & telecommunications02 engineering and technology030507 speech-language pathology & audiology03 medical and health sciencesModulationSignal Processing0202 electrical engineering electronic engineering information engineeringElectrical and Electronic EngineeringWideband0305 other medical sciencePassbandIEEE Signal Processing Letters
researchProduct

On the Design of Probe Signals in Wireless Acoustic Sensor Networks Self-Positioning Algorithms

2018

A wireless acoustic sensor network comprises a distributed group of devices equipped with audio transducers. Typically, these devices can interoperate with each other using wireless links and perform collaborative audio signal processing. Ranging and self-positioning of the network nodes are examples of tasks that can be carried out collaboratively using acoustic signals. However, the environmental conditions can distort the emitted signals and complicate the ranging process. In this context, the selection of proper acoustic signals can facilitate the attainment of this goal and improve the localization accuracy. This letter deals with the design and evaluation of acoustic probe signals all…

Audio signalComputer sciencebusiness.industryApplied Mathematics020208 electrical & electronic engineeringReal-time computingBandwidth (signal processing)020206 networking & telecommunicationsRanging02 engineering and technologycomputer.software_genreTransducerSignal Processing0202 electrical engineering electronic engineering information engineeringChirpWirelessElectrical and Electronic EngineeringAudio signal processingbusinessFrequency modulationcomputerIEEE Signal Processing Letters
researchProduct

On the feasibility of personal audio systems over a network of distributed loudspeakers

2018

Los sistemas de reproducción de audio personal se ocupan de la creación de zonas sonoras personales dentro de una habitación sin necesidad de utilizar auriculares. Estos sistemas utilizan un conjunto de altavoces y diseñan los filtros necesarios en cada altavoz con el fin de que la señal de audio deseada llegue a cada persona en la sala lo más libre de interferencias posible. Existen propuestas muy interesantes en la literatura que hacen uso de arrays circulares o lineales, pero en este trabajo estudiamos el problema considerando una red de altavoces distribuidos controlados por un conjunto de nodos acústicos, que pueden intercambiar información a través de una red. Enunciamos el modelo de …

Audio signalbusiness.product_category:CIENCIAS TECNOLÓGICAS [UNESCO]MicrophoneComputer scienceAcoustics020206 networking & telecommunications02 engineering and technologypersonal audio systemsUNESCO::CIENCIAS TECNOLÓGICASGeneralLiterature_MISCELLANEOUSSignal-to-noise ratioSound reinforcement system0202 electrical engineering electronic engineering information engineeringElectronic engineering020201 artificial intelligence & image processingLoudspeakerDirectional soundwireless acoustic sensor networksbusinessHeadphones
researchProduct

Both attention and prediction are necessary for adaptive neuronal tuning in sensory processing

2014

International audience; The brain as a proactive system processes sensory information under the top-down influence of attention and prediction. However, the relation between attention and prediction remains undetermined given the conflation of these two mechanisms in the literature. To evaluate whether attention and prediction are dependent of each other, and if so, how these two top-down mechanisms may interact in sensory processing, we orthogonally manipulated attention and prediction in a target detection task. Participants were instructed to pay attention to one of two interleaved stimulus streams of predictable/unpredictable tone frequency. We found that attention and prediction intera…

Auditory areaSensory systemElectroencephalographyStimulus (physiology)event-related potentials050105 experimental psychologylcsh:RC321-57103 medical and health sciencesBehavioral Neuroscience[SCCO]Cognitive science0302 clinical medicineEvent-related potentialNeuronal tuningmedicine0501 psychology and cognitive sciencesOriginal Research ArticleElectroencephalography (EEG)tarkkaavaisuussensory processinglcsh:Neurosciences. Biological psychiatry. NeuropsychiatryDipole sourceBiological Psychiatryta515medicine.diagnostic_test[SCCO.NEUR]Cognitive science/Neuroscience05 social sciencesCorrectionpredictionConflationattentionPsychiatry and Mental healthNeuropsychology and Physiological PsychologyNeurologyevent-related potentials (ERPs)PsychologyNeuroscience030217 neurology & neurosurgeryelectroencephalographyNeuroscienceFrontiers in human neuroscience
researchProduct

Comprehensive auditory discrimination profiles recorded with a fast parametric musical multi-feature mismatch negativity paradigm

2016

Abstract Objective Mismatch negativity (MMN), a component of the auditory event-related potential (ERP) in response to auditory-expectancy violation, is sensitive to central auditory processing deficits associated with several clinical conditions and to auditory skills deriving from musical expertise. This sensitivity is more evident for stimuli integrated in complex sound contexts. This study tested whether increasing magnitudes of deviation (levels) entail increasing MMN amplitude (or decreasing latency), aiming to create a balanced version of the musical multi-feature paradigm towards measurement of extensive auditory discrimination profiles in auditory expertise or deficits. Methods Usi…

Auditory perceptionAdultMalemedicine.medical_specialtyCentral auditory processingcentral auditory processingMismatch negativityContext (language use)AudiologyEvent-related potential (ERP)behavioral disciplines and activitiesta3112050105 experimental psychologyDiscrimination Learning03 medical and health sciences0302 clinical medicineRhythmEvent-related potentialPhysiology (medical)medicineHumans0501 psychology and cognitive sciencesDiscrimination learning10. No inequalitysound discriminationCommunicationbusiness.industrySensory memory05 social sciencesElectroencephalographyevent-related potential (ERP)mismatch negativity (MMN)Sensory SystemsNeurologyAcoustic StimulationSound discriminationAuditory PerceptionEvoked Potentials AuditoryFemaleNeurology (clinical)businessPsychologyMismatch negativity (MMN)Timbre030217 neurology & neurosurgeryMusicClinical Neurophysiology
researchProduct

Neural Processing of Congruent and Incongruent Audiovisual Speech in School-Age Children and Adults

2017

Auditory perceptionLinguistics and Languagemedicine.medical_specialtyVisual perceptionmedicine.diagnostic_testmedia_common.quotation_subject05 social sciencesAudiologyElectroencephalographyAudiovisual Aids050105 experimental psychologyLanguage and LinguisticsEducation03 medical and health sciences0302 clinical medicinePerceptionNeural processingTask analysismedicine0501 psychology and cognitive sciencesMcGurk effectPsychology030217 neurology & neurosurgerymedia_commonLanguage Learning
researchProduct

Event-related brain potentials to change in the frequency and temporal structure of sounds in typically developing 5-6-year-old children.

2015

The brain's ability to recognize different acoustic cues (e.g., frequency changes in rapid temporal succession) is important for speech perception and thus for successful language development. Here we report on distinct event-related potentials (ERPs) in 5-6-year-old children recorded in a passive oddball paradigm to repeated tone pair stimuli with a frequency change in the second tone in the pair, replicating earlier findings. An occasional insertion of a third tone within the tone pair generated a more merged pattern, which has not been reported previously in 5-6-year-old children. Both types of deviations elicited pre-attentive discriminative mismatch negativity (MMN) and late discrimina…

Auditory perceptionMalemedicine.medical_specialtySpeech perceptionlate discriminative negativity (LDN)Mismatch negativityContingent Negative VariationElectroencephalographyAudiologyta3112behavioral disciplines and activitiesBrain mappingTone (musical instrument)Physiology (medical)medicineReaction TimeHumansEEGChildOddball paradigmta515auditory processingCommunicationAnalysis of VarianceBrain Mappingmedicine.diagnostic_testbusiness.industryGeneral NeuroscienceBrainElectroencephalographyT-complexmismatch negativity (MMN)Contingent negative variationNeuropsychology and Physiological PsychologySoundAcoustic StimulationChild PreschoolAuditory PerceptionEvoked Potentials AuditoryFemalePsychologybusinessN250psychological phenomena and processesInternational journal of psychophysiology : official journal of the International Organization of Psychophysiology
researchProduct

How is Visual Recognition Entrained by Auditory Background Rhythms?

2014

AbstractRecent studies have reported that the oscillations of auditory attention entrained by a background rhythmic sequence can influence performance in visual recognition tasks. We have designed an experimental paradigm in which a visual item (either a bisyllabic word or a familiar face) is displayed on screen in two consecutive parts while a musical rhythm is played in the background. Depending on the timing conditions, the first or the second part of the item could be presented either in-synchrony or out-of-synchrony with the beats of the auditory rhythm. In a first series of experiments, participants performed a lexical decision task on bisyllabic 5-letter strings. Results show that wh…

Auditory rhythmSpeech recognitionmedia_common.quotation_subjectPoison controlEntrainmentRhythmPerceptionWord recognitionLexical decision taskGeneral Materials ScienceVisual WordSyllabic verseVisual recognitionLevels-of-processing effectPsychologymedia_commonProcedia - Social and Behavioral Sciences
researchProduct