6533b859fe1ef96bd12b7897

RESEARCH PRODUCT

Nonnegative signal factorization with learnt instrument models for sound source separation in close-microphone recordings

Pedro Vera-candeasMaximo CobosFrancisco J. Rodriguez-serranoJ.j. Carabias-orti

subject

ReverberationInstruments musicalsComputer sciencebusiness.industryMicrophoneMúsica -- InformàticaSignalNon-negative matrix factorizationSet (abstract data type)FactorizationInterference (communication)Source separationComputer visionArtificial intelligenceMicròfonsbusiness

description

Close-microphone techniques are extensively employed in many live music recordings, allowing for interference rejection and reducing the amount of reverberation in the resulting instrument tracks. However, despite the use of directional microphones, the recorded tracks are not completely free from source interference, a problem which is commonly known as microphone leakage. While source separation methods are potentially a solution to this problem, few approaches take into account the huge amount of prior information available in this scenario. In fact, besides the special properties of close-microphone tracks, the knowledge on the number and type of instruments making up the mixture can also be successfully exploited for improved separation performance. In this paper, a nonnegative matrix factorization (NMF) method making use of all the above information is proposed. To this end, a set of instrument models are learnt from a training database and incorporated into a multichannel extension of the NMF algorithm. Several options to initialize the algorithm are suggested, exploring their performance in multiple music tracks and comparing the results to other state-of-the-art approaches. This work was supported by the Andalusian Business, Science and Innovation Council under project P2010- TIC-6762, (FEDER) the Spanish Ministry of Economy and Competitiveness under the projects TEC2012-38142-C04-03 and TEC2012-37945-C02-02

https://doi.org/10.1186/1687-6180-2013-184