Search results for "processing"
showing 10 items of 8572 documents
Fully automatic face recognition system using a combined audio-visual approach
2005
This paper presents a novel audio and video information fusion approach that greatly improves automatic recognition of people in video sequences. To that end, audio and video information is first used independently to obtain confidence values that indicate the likelihood that a specific person appears in a video shot. Finally, a post-classifier is applied to fuse audio and visual confidence values. The system has been tested on several news sequences and the results indicate that a significant improvement in the recognition rate can be achieved when both modalities are used together.
2015
Visuo-auditory sensory substitution systems are augmented reality devices that translate a video stream into an audio stream in order to help the blind in daily tasks requiring visuo-spatial information. In this work, we present both a new mobile device and a transcoding method specifically designed to sonify moving objects. Frame differencing is used to extract spatial features from the video stream and two-dimensional spatial information is converted into audio cues using pitch, interaural time difference and interaural level difference. Using numerical methods, we attempt to reconstruct visuo-spatial information based on audio signals generated from various video stimuli. We show that de…
Steered Response Power Localization of Acoustic Passband Signals
2017
The vast majority of localization approaches using phase transform (PHAT) consider that the sources of interest are wideband low-pass sources. While this may be the usual case for common audio signals such as speech, PHAT methods are affected negatively by modulation artifacts when the sources to be localized are passband signals. In these cases, steered response power PHAT localization becomes less robust. This letter analyzes the form of generalized cross-correlation functions with PHAT when passband acoustic signals are considered, proposing approaches for increasing the localization performance through the mitigation of these negative effects.
On the Design of Probe Signals in Wireless Acoustic Sensor Networks Self-Positioning Algorithms
2018
A wireless acoustic sensor network comprises a distributed group of devices equipped with audio transducers. Typically, these devices can interoperate with each other using wireless links and perform collaborative audio signal processing. Ranging and self-positioning of the network nodes are examples of tasks that can be carried out collaboratively using acoustic signals. However, the environmental conditions can distort the emitted signals and complicate the ranging process. In this context, the selection of proper acoustic signals can facilitate the attainment of this goal and improve the localization accuracy. This letter deals with the design and evaluation of acoustic probe signals all…
On the feasibility of personal audio systems over a network of distributed loudspeakers
2018
Los sistemas de reproducción de audio personal se ocupan de la creación de zonas sonoras personales dentro de una habitación sin necesidad de utilizar auriculares. Estos sistemas utilizan un conjunto de altavoces y diseñan los filtros necesarios en cada altavoz con el fin de que la señal de audio deseada llegue a cada persona en la sala lo más libre de interferencias posible. Existen propuestas muy interesantes en la literatura que hacen uso de arrays circulares o lineales, pero en este trabajo estudiamos el problema considerando una red de altavoces distribuidos controlados por un conjunto de nodos acústicos, que pueden intercambiar información a través de una red. Enunciamos el modelo de …
Both attention and prediction are necessary for adaptive neuronal tuning in sensory processing
2014
International audience; The brain as a proactive system processes sensory information under the top-down influence of attention and prediction. However, the relation between attention and prediction remains undetermined given the conflation of these two mechanisms in the literature. To evaluate whether attention and prediction are dependent of each other, and if so, how these two top-down mechanisms may interact in sensory processing, we orthogonally manipulated attention and prediction in a target detection task. Participants were instructed to pay attention to one of two interleaved stimulus streams of predictable/unpredictable tone frequency. We found that attention and prediction intera…
Comprehensive auditory discrimination profiles recorded with a fast parametric musical multi-feature mismatch negativity paradigm
2016
Abstract Objective Mismatch negativity (MMN), a component of the auditory event-related potential (ERP) in response to auditory-expectancy violation, is sensitive to central auditory processing deficits associated with several clinical conditions and to auditory skills deriving from musical expertise. This sensitivity is more evident for stimuli integrated in complex sound contexts. This study tested whether increasing magnitudes of deviation (levels) entail increasing MMN amplitude (or decreasing latency), aiming to create a balanced version of the musical multi-feature paradigm towards measurement of extensive auditory discrimination profiles in auditory expertise or deficits. Methods Usi…
Neural Processing of Congruent and Incongruent Audiovisual Speech in School-Age Children and Adults
2017
Event-related brain potentials to change in the frequency and temporal structure of sounds in typically developing 5-6-year-old children.
2015
The brain's ability to recognize different acoustic cues (e.g., frequency changes in rapid temporal succession) is important for speech perception and thus for successful language development. Here we report on distinct event-related potentials (ERPs) in 5-6-year-old children recorded in a passive oddball paradigm to repeated tone pair stimuli with a frequency change in the second tone in the pair, replicating earlier findings. An occasional insertion of a third tone within the tone pair generated a more merged pattern, which has not been reported previously in 5-6-year-old children. Both types of deviations elicited pre-attentive discriminative mismatch negativity (MMN) and late discrimina…
How is Visual Recognition Entrained by Auditory Background Rhythms?
2014
AbstractRecent studies have reported that the oscillations of auditory attention entrained by a background rhythmic sequence can influence performance in visual recognition tasks. We have designed an experimental paradigm in which a visual item (either a bisyllabic word or a familiar face) is displayed on screen in two consecutive parts while a musical rhythm is played in the background. Depending on the timing conditions, the first or the second part of the item could be presented either in-synchrony or out-of-synchrony with the beats of the auditory rhythm. In a first series of experiments, participants performed a lexical decision task on bisyllabic 5-letter strings. Results show that wh…