0000000000422358
AUTHOR
Jose J. Lopez
Stereo to Wave-Field Synthesis music up-mixing: An objective and subjective evaluation
Sound source separation techniques are known to be very useful in many applications. High fidelity and audio oriented applications are a challenging issue in this topic, however, existing algorithms are far from performing with such a high quality. In this paper, a subjective and objective evaluation are carried out for several algorithms designed for dealing with stereo music mixtures. The performance of these algorithms applied to acoustic scene resynthesis in a Wave Field Synthesis system is discussed.
An Open-set Recognition and Few-Shot Learning Dataset for Audio Event Classification in Domestic Environments
The problem of training with a small set of positive samples is known as few-shot learning (FSL). It is widely known that traditional deep learning (DL) algorithms usually show very good performance when trained with large datasets. However, in many applications, it is not possible to obtain such a high number of samples. In the image domain, typical FSL applications include those related to face recognition. In the audio domain, music fraud or speaker recognition can be clearly benefited from FSL methods. This paper deals with the application of FSL to the detection of specific and intentional acoustic events given by different types of sound alarms, such as door bells or fire alarms, usin…
A Bayesian direction-of-arrival model for an undetermined number of sources using a two-microphone array.
Sound source localization using a two-microphone array is an active area of research, with considerable potential for use with video conferencing, mobile devices, and robotics. Based on the observed time-differences of arrival between sound signals, a probability distribution of the location of the sources is considered to estimate the actual source positions. However, these algorithms assume a given number of sound sources. This paper describes an updated research account on the solution presented in Escolano et al. [J. Acoust. Am. Soc. 132(3), 1257-1260 (2012)], where nested sampling is used to explore a probability distribution of the source position using a Laplacian mixture model, whic…
A Pedagogical Software for the Analysis of Loudspeaker Systems
In this paper, a pedagogical software for the design and analysis of loudspeaker systems is presented, with emphasis on training students in the interaction between system parameters. Loudspeakers are complex electromechanical system, whose behavior is neither intuitive nor easy to understand by inexperienced students. Although commercial simulation software provides computation of the equations governing loudspeaker systems, they are mainly focused on the final product and are intended for experienced audio engineers. The proposed software was intentionally designed for education and training purposes. This software allows students to investigate loudspeaker systems that involve complex in…
Improving Isolation of Blindly Separated Sources Using Time-Frequency Masking
A refinement technique based on time-frequency masking is proposed to improve source isolation in blind audio source separation algorithms. The refinement technique uses an energy-normalized source-to-interference ratio in order to identify and eliminate interfering energy from the extracted sources. Some examples using this refinement method with different separation algorithms are discussed. The results show that source isolation can be significantly enhanced with negligible degradation of the separated sources.