Search results for "Audi"

showing 10 items of 3302 documents

Fully automatic face recognition system using a combined audio-visual approach

2005

This paper presents a novel audio and video information fusion approach that greatly improves automatic recognition of people in video sequences. To that end, audio and video information is first used independently to obtain confidence values that indicate the likelihood that a specific person appears in a video shot. Finally, a post-classifier is applied to fuse audio and visual confidence values. The system has been tested on several news sequences and the results indicate that a significant improvement in the recognition rate can be achieved when both modalities are used together.

Audio miningDynamic time warpingModalitiesComputer sciencebusiness.industryShot (filmmaking)Speech recognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONVideo sequenceFacial recognition systemVideo trackingSignal ProcessingFuse (electrical)Computer visionArtificial intelligenceElectrical and Electronic EngineeringbusinessIEE Proceedings - Vision, Image, and Signal Processing
researchProduct

Adaptive Mid-Term Representations for Robust Audio Event Classification

2018

Low-level audio features are commonly used in many audio analysis tasks, such as audio scene classification or acoustic event detection. Due to the variable length of audio signals, it is a common approach to create fixed-length feature vectors consisting of a set of statistics that summarize the temporal variability of such short-term features. To avoid the loss of temporal information, the audio event can be divided into a set of mid-term segments or texture windows. However, such an approach requires to estimate accurately the onset and offset times of the audio events in order to obtain a robust mid-term statistical description of their temporal evolution. This paper proposes the use of…

Audio signalAcoustics and UltrasonicsComputer sciencebusiness.industryFeature vectorPattern recognition01 natural sciences030507 speech-language pathology & audiology03 medical and health sciencesComputational MathematicsNonlinear systemFraming (construction)Acoustic event detection0103 physical sciencesAudio analyzerComputer Science (miscellaneous)SegmentationArtificial intelligenceElectrical and Electronic Engineering0305 other medical sciencebusiness010301 acousticsTemporal informationIEEE/ACM Transactions on Audio, Speech, and Language Processing
researchProduct

2015

Visuo-auditory sensory substitution systems are augmented reality devices that translate a video stream into an audio stream in order to help the blind in daily tasks requiring visuo-spatial information. In this work, we present both a new mobile device and a transcoding method specifically designed to sonify moving objects. Frame differencing is used to extract spatial features from the video stream and two-dimensional spatial information is converted into audio cues using pitch, interaural time difference and interaural level difference. Using numerical methods, we attempt to reconstruct visuo-spatial information based on audio signals generated from various video stimuli. We show that de…

Audio signalComputer Networks and Communicationsbusiness.industryComputer scienceSpeech recognitionMotion detectionTranscodingAudio signal flowVideo processingcomputer.software_genreSensory substitutionArtificial IntelligenceHardware and ArchitectureSonificationComputer visionArtificial intelligencebusinessAudio signal processingcomputerSoftwareInformation SystemsFrontiers in ICT
researchProduct

Steered Response Power Localization of Acoustic Passband Signals

2017

The vast majority of localization approaches using phase transform (PHAT) consider that the sources of interest are wideband low-pass sources. While this may be the usual case for common audio signals such as speech, PHAT methods are affected negatively by modulation artifacts when the sources to be localized are passband signals. In these cases, steered response power PHAT localization becomes less robust. This letter analyzes the form of generalized cross-correlation functions with PHAT when passband acoustic signals are considered, proposing approaches for increasing the localization performance through the mitigation of these negative effects.

Audio signalComputer scienceApplied MathematicsSpeech recognitionAcousticsBandwidth (signal processing)020206 networking & telecommunications02 engineering and technology030507 speech-language pathology & audiology03 medical and health sciencesModulationSignal Processing0202 electrical engineering electronic engineering information engineeringElectrical and Electronic EngineeringWideband0305 other medical sciencePassbandIEEE Signal Processing Letters
researchProduct

Enabling Real-Time Computation of Psycho-Acoustic Parameters in Acoustic Sensors Using Convolutional Neural Networks

2020

Sensor networks have become an extremely useful tool for monitoring and analysing many aspects of our daily lives. Noise pollution levels are very important today, especially in cities where the number of inhabitants and disturbing sounds are constantly increasing. Psycho-acoustic parameters are a fundamental tool for assessing the degree of discomfort produced by different sounds and, combined with wireless acoustic sensor networks (WASNs), could enable, for example, the efficient implementation of acoustic discomfort maps within smart cities. However, the continuous monitoring of psycho-acoustic parameters to create time-dependent discomfort maps requires a high computational demand that …

Audio signalComputer scienceNoise pollutionbusiness.industryComputation010401 analytical chemistryReal-time computing01 natural sciencesConvolutional neural network0104 chemical sciencesWirelessElectrical and Electronic EngineeringbusinessInstrumentationWireless sensor networkIEEE Sensors Journal
researchProduct

On the Design of Probe Signals in Wireless Acoustic Sensor Networks Self-Positioning Algorithms

2018

A wireless acoustic sensor network comprises a distributed group of devices equipped with audio transducers. Typically, these devices can interoperate with each other using wireless links and perform collaborative audio signal processing. Ranging and self-positioning of the network nodes are examples of tasks that can be carried out collaboratively using acoustic signals. However, the environmental conditions can distort the emitted signals and complicate the ranging process. In this context, the selection of proper acoustic signals can facilitate the attainment of this goal and improve the localization accuracy. This letter deals with the design and evaluation of acoustic probe signals all…

Audio signalComputer sciencebusiness.industryApplied Mathematics020208 electrical & electronic engineeringReal-time computingBandwidth (signal processing)020206 networking & telecommunicationsRanging02 engineering and technologycomputer.software_genreTransducerSignal Processing0202 electrical engineering electronic engineering information engineeringChirpWirelessElectrical and Electronic EngineeringAudio signal processingbusinessFrequency modulationcomputerIEEE Signal Processing Letters
researchProduct

A matlab toolbox for music information retrieval

2008

We present MIRToolbox, an integrated set of functions written in Matlab, dedicated to the extraction from audio files of musical features related, among others, to timbre, tonality, rhythm or form. The objective is to offer a state of the art of computational approaches in the area of Music Information Retrieval (MIR). The design is based on a modular framework: the different algorithms are decomposed into stages, formalized using a minimal set of elementary mechanisms, and integrating different variants proposed by alternative approaches — including new strategies we have developed —, that users can select and parametrize. These functions can adapt to a large area of objects as input.

Audio signalInformation retrievalComputer sciencebusiness.industryModular designSet (abstract data type)Music information retrievalState (computer science)TonalitybusinessMATLABcomputerTimbrecomputer.programming_language
researchProduct

Doppler Estimation and Correction for JANUS Underwater Communications

2020

In recent years, underwater communications have seen a growing interest pushed by marine research, oceanography, marine commercial operations, offshore oil industry and defense applications. Generally, underwater communications employ audio signals which can propagate relatively far but are also significantly affected by Doppler distortions. In fact, physical properties of the water and spatial changes due to tides, currents and waves can cause channel variations or unwanted movements of the transmitter or receiver. This study shows how to compensate for the Doppler effect in transmission employing the JANUS standard, a popular modulation scheme for underwater communication. Differently for…

Audio signalSettore ING-INF/03 - TelecomunicazioniComputer scienceAcousticsTransmitterambiguity function Doppler JANUS Underwater Watermarksymbols.namesakeTransmission (telecommunications)ModulationDistortionModulation (music)symbolsDoppler effectUnderwater acoustic communicationCommunication channelGLOBECOM 2020 - 2020 IEEE Global Communications Conference
researchProduct

Dance to your own drum: Identification of musical genre and individual dancer from motion capture using machine learning

2020

Machine learning has been used to accurately classify musical genre using features derived from audio signals. Musical genre, as well as lower-level audio features of music, have also been shown to...

Audio signalVisual Arts and Performing ArtsDanceInformationSystems_INFORMATIONINTERFACESANDPRESENTATION(e.g.HCI)Computer sciencebusiness.industry05 social sciencesComputingMilieux_PERSONALCOMPUTING06 humanities and the artsDrumMusicalMachine learningcomputer.software_genreMotion capture050105 experimental psychology060404 musicIdentification (information)Embodied cognition0501 psychology and cognitive sciencesArtificial intelligencebusinesscomputer0604 artsMusicJournal of New Music Research
researchProduct

On the feasibility of personal audio systems over a network of distributed loudspeakers

2018

Los sistemas de reproducción de audio personal se ocupan de la creación de zonas sonoras personales dentro de una habitación sin necesidad de utilizar auriculares. Estos sistemas utilizan un conjunto de altavoces y diseñan los filtros necesarios en cada altavoz con el fin de que la señal de audio deseada llegue a cada persona en la sala lo más libre de interferencias posible. Existen propuestas muy interesantes en la literatura que hacen uso de arrays circulares o lineales, pero en este trabajo estudiamos el problema considerando una red de altavoces distribuidos controlados por un conjunto de nodos acústicos, que pueden intercambiar información a través de una red. Enunciamos el modelo de …

Audio signalbusiness.product_category:CIENCIAS TECNOLÓGICAS [UNESCO]MicrophoneComputer scienceAcoustics020206 networking & telecommunications02 engineering and technologypersonal audio systemsUNESCO::CIENCIAS TECNOLÓGICASGeneralLiterature_MISCELLANEOUSSignal-to-noise ratioSound reinforcement system0202 electrical engineering electronic engineering information engineeringElectronic engineering020201 artificial intelligence & image processingLoudspeakerDirectional soundwireless acoustic sensor networksbusinessHeadphones
researchProduct