Computation of Psycho-Acoustic Annoyance Using Deep Neural Networks

6533b82cfe1ef96bd128f66c

RESEARCH PRODUCT

Computation of Psycho-Acoustic Annoyance Using Deep Neural Networks

Maximo Cobos Jesus Lopez-ballester Jaume Segura-garcia Adolfo Pastor-aparicio Santiago Felici-castell

subject

Computer science Computation subjective annoyance Context (language use)Annoyance 02 engineering and technology computer.software_genre 01 natural sciences Convolutional neural network lcsh:Technology Reduction (complexity)lcsh:Chemistry convolutional neural networks 0202 electrical engineering electronic engineering information engineering Wireless General Materials Science wireless acoustic sensor networks Instrumentation lcsh:QH301-705.5 Fluid Flow and Transfer Processes business.industry lcsh:T Process Chemistry and Technology 010401 analytical chemistry General Engineering Regression analysis lcsh:QC1-999 0104 chemical sciences Computer Science Applications psycho-acoustic parameters Transmission (telecommunications)lcsh:Biology (General)lcsh:QD1-999 lcsh:TA1-2040 020201 artificial intelligence & image processing Data mining business lcsh:Engineering (General). Civil engineering (General)Zwicker model computer lcsh:Physics

description

Psycho-acoustic parameters have been extensively used to evaluate the discomfort or pleasure produced by the sounds in our environment. In this context, wireless acoustic sensor networks (WASNs) can be an interesting solution for monitoring subjective annoyance in certain soundscapes, since they can be used to register the evolution of such parameters in time and space. Unfortunately, the calculation of the psycho-acoustic parameters involved in common annoyance models implies a significant computational cost, and makes difficult the acquisition and transmission of these parameters at the nodes. As a result, monitoring psycho-acoustic annoyance becomes an expensive and inefficient task. This paper proposes the use of a deep convolutional neural network (CNN) trained on a large urban sound dataset capable of efficiently predicting psycho-acoustic annoyance from raw audio signals continuously. We evaluate the proposed regression model and compare the resulting computation times with the ones obtained by the conventional direct calculation approach. The results confirm that the proposed model based on CNN achieves high precision in predicting psycho-acoustic annoyance, predicting annoyance values with an average quadratic error of around 3%. It also achieves a very significant reduction in processing time, which is up to 300 times faster than direct calculation, making CNN designed a clear exponent to work in IoT devices.

year	journal	country	edition	language
2019-08-02	Applied Sciences

10.3390/app9153136 http://dx.doi.org/10.3390/app9153136