Search results for "Computer Science::Sound"
showing 10 items of 39 documents
Comparison of perceptually uniform quantisation with average error minimisation in image transform coding
1999
An alternative transform coder design criterion based on restricting the maximum perceptual error of each coefficient is proposed. This perceptually uniform quantisation of the transform domain ensures that the perceptual error will be below a certain limit regardless of the particular input image. The results show that the proposed criterion improves the subjective quality of the conventional average error criterion even if it is weighted with the same perceptual metric.
An integrated architecture for speech-input multi-target machine translation
2007
The aim of this work is to show the ability of finite-state transducers to simultaneously translate speech into multiple languages. Our proposal deals with an extension of stochastic finite-state transducers that can produce more than one output at the same time. These kind of devices offer great versatility for the integration with other finite-state devices such as acoustic models in order to produce a speech translation system. This proposal has been evaluated in a practical situation, and its results have been compared with those obtained using a standard mono-target speech transducer.
Speech-input multi-target machine translation
2007
In order to simultaneously translate speech into multiple languages an extension of stochastic finite-state transducers is proposed. In this approach the speech translation model consists of a single network where acoustic models (in the input) and the multilingual model (in the output) are embedded. The multi-target model has been evaluated in a practical situation, and the results have been compared with those obtained using several mono-target models. Experimental results show that the multi-target one requires less amount of memory. In addition, a single decoding is enough to get the speech translated into multiple languages.
Knots, Music and DNA
2020
Musical gestures connect the symbolic layer of the score to the physical layer of sound. I focus here on the mathematical theory of musical gestures, and I propose its generalization to include braids and knots. In this way, it is possible to extend the formalism to cover more case studies, especially regarding conducting gestures. Moreover, recent developments involving comparisons and similarities between gestures of orchestral musicians can be contextualized in the frame of braided monoidal categories. Because knots and braids can be applied to both music and biology (they apply to knotted proteins, for example), I end the article with a new musical rendition of DNA.
Implicit Wiener Filtering for Speech Enhancement In Non-Stationary Noise
2021
Speech quality is degraded in the presence of background noise, which reduces the quality of experience (QoE) of the end-user and therefore motivates the usage of speech enhancement algorithms. A large number of approaches have been proposed in this context. However most of them have focused on the case where the noise is stationary, an assumption that seldom holds in practice. For instance, in mobile telephony, noise sources with a marked non-stationary spectral signature include vehicles, machines, and other speakers to name a few. On the other hand, the usage of frequency-domain information in existing algorithms for speech enhancement in non-stationary noise environments can be made mor…
Plasma Effecton Pulse Parameters of Sound Wave
2019
This article presents the experimental results of the effect of gas discharge plasma on an acoustic wave. It is shown that after a certain intensity of the acoustic wave, the presence of plasma leads to a decrease in the Q -factor of the acoustic resonator, a shift in the acoustic resonance in the gas discharge tube at a lower frequency, and after switching off the acoustics, low-frequency gas oscillations with a frequency of 2 Hz are generated in the tube.
Generation of stimulus features for analysis of FMRI during natural auditory experiences
2014
In contrast to block and event-related designs for fMRI experiments, it becomes much more difficult to extract events of interest in the complex continuous stimulus for finding corresponding blood-oxygen-level dependent (BOLD) responses. Recently, in a free music listening fMRI experiment, acoustic features of the naturalistic music stimulus were first extracted, and then principal component analysis (PCA) was applied to select the features of interest acting as the stimulus sequences. For feature generation, kernel PCA has shown its superiority over PCA in various applications, since it can implicitly exploit nonlinear relationship among features and such relationship seems to exist genera…
Optimizing auditory images and distance metrics for self‐organizing timbre maps*
1996
Abstract The effect of using different auditory images and distance metrics on the final configuration of a self‐organized timbre map is examined by comparing distance matrices, obtained from simulations, with a similarity rating matrix, obtained using the same set of stimuli as in the simulations. Gradient images, which are intended to represent idealizations of physiological gradient maps in the auditory pathway, are constructed. The optimal auditory image and distance metric, with respect to the similarity rating data, are searched using the gradient method.
KernelICA : Kernel Independent Component Analysis
2021
The kernel independent component analysis (kernel ICA) method introduced by Bach and Jordan (2003) . The incomplete Cholesky decomposition used in kernel ICA is provided as separate function. nonPeerReviewed
Non-linear RLS-based algorithm for pattern classification
2006
A new non-linear recursive least squares (RLS) algorithm is presented in the context of pattern classification problems. The algorithm incorporates the non-linearity of the filter's output in the updating rules of the classical RLS algorithm. The proposed method yields lower stationary error levels when compared to the standard LMS and RLS algorithms in a classical application of pattern classification, such as the channel equalization problem.