0000000000422358

AUTHOR

Jose J. Lopez

showing 5 related works from this author

Stereo to Wave-Field Synthesis music up-mixing: An objective and subjective evaluation

2008

Sound source separation techniques are known to be very useful in many applications. High fidelity and audio oriented applications are a challenging issue in this topic, however, existing algorithms are far from performing with such a high quality. In this paper, a subjective and objective evaluation are carried out for several algorithms designed for dealing with stereo music mixtures. The performance of these algorithms applied to acoustic scene resynthesis in a Wave Field Synthesis system is discussed.

EngineeringWave field synthesisbusiness.industrySpeech recognitionSound source separationmedia_common.quotation_subjectField (computer science)High fidelitySource separationQuality (business)Computer visionArtificial intelligenceObjective evaluationbusinessMixing (physics)media_common2008 3rd International Symposium on Communications, Control and Signal Processing
researchProduct

An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments

2020

The problem of training with a small set of positive samples is known as few-shot learning (FSL). It is widely known that traditional deep learning (DL) algorithms usually show very good performance when trained with large datasets. However, in many applications, it is not possible to obtain such a high number of samples. In the image domain, typical FSL applications include those related to face recognition. In the audio domain, music fraud or speaker recognition can be clearly benefited from FSL methods. This paper deals with the application of FSL to the detection of specific and intentional acoustic events given by different types of sound alarms, such as door bells or fire alarms, usin…

FOS: Computer and information sciencesComputer Science - Machine LearningSound (cs.SD)sound processingaudio datasetmachine listeningUNESCO::CIENCIAS TECNOLÓGICASComputer Science - SoundMachine Learning (cs.LG)classificationArtificial IntelligenceAudio and Speech Processing (eess.AS)Signal ProcessingFOS: Electrical engineering electronic engineering information engineeringfew-shot learningopen-set recognitionComputer Vision and Pattern RecognitionSoftwareElectrical Engineering and Systems Science - Audio and Speech Processing
researchProduct

A Bayesian direction-of-arrival model for an undetermined number of sources using a two-microphone array.

2014

Sound source localization using a two-microphone array is an active area of research, with considerable potential for use with video conferencing, mobile devices, and robotics. Based on the observed time-differences of arrival between sound signals, a probability distribution of the location of the sources is considered to estimate the actual source positions. However, these algorithms assume a given number of sound sources. This paper describes an updated research account on the solution presented in Escolano et al. [J. Acoust. Am. Soc. 132(3), 1257-1260 (2012)], where nested sampling is used to explore a probability distribution of the source position using a Laplacian mixture model, whic…

Microphone arrayAcoustics and UltrasonicsComputer scienceAcousticsBayesian probabilityDirection of arrivalSampling (statistics)DOAAcoustic source localizationMicrophone arraySpeech processingMixture modelBayesianSound source localizationArts and Humanities (miscellaneous)TEORIA DE LA SEÑAL Y COMUNICACIONESProbability distributionAlgorithmNested sampling algorithmThe Journal of the Acoustical Society of America
researchProduct

A Pedagogical Software for the Analysis of Loudspeaker Systems

2009

In this paper, a pedagogical software for the design and analysis of loudspeaker systems is presented, with emphasis on training students in the interaction between system parameters. Loudspeakers are complex electromechanical system, whose behavior is neither intuitive nor easy to understand by inexperienced students. Although commercial simulation software provides computation of the equations governing loudspeaker systems, they are mainly focused on the final product and are intended for experienced audio engineers. The proposed software was intentionally designed for education and training purposes. This software allows students to investigate loudspeaker systems that involve complex in…

Interactive computingCommercial softwareEngineeringDiaphragm (acoustics)business.industryComputationcomputer.software_genreEducationSimulation softwareSoftwareHuman–computer interactionLoudspeakerElectrical and Electronic EngineeringbusinesscomputerSimulationGraphical user interfaceIEEE Transactions on Education
researchProduct

Improving Isolation of Blindly Separated Sources Using Time-Frequency Masking

2008

A refinement technique based on time-frequency masking is proposed to improve source isolation in blind audio source separation algorithms. The refinement technique uses an energy-normalized source-to-interference ratio in order to identify and eliminate interfering energy from the extracted sources. Some examples using this refinement method with different separation algorithms are discussed. The results show that source isolation can be significantly enhanced with negligible degradation of the separated sources.

Masking (art)business.industryComputer scienceApplied MathematicsSpeech recognitionPattern recognitioncomputer.software_genreBlind signal separationIndependent component analysisTime–frequency analysisSignal ProcessingSource separationArtificial intelligenceIsolation (database systems)Electrical and Electronic EngineeringAudio signal processingbusinesscomputerEnergy (signal processing)IEEE Signal Processing Letters
researchProduct