Search results for "Computer Science::Sound"

showing 10 items of 39 documents

Comparison of perceptually uniform quantisation with average error minimisation in image transform coding

1999

An alternative transform coder design criterion based on restricting the maximum perceptual error of each coefficient is proposed. This perceptually uniform quantisation of the transform domain ensures that the perceptual error will be below a certain limit regardless of the particular input image. The results show that the proposed criterion improves the subjective quality of the conventional average error criterion even if it is weighted with the same perceptual metric.

Minimisation (psychology)Image codingbusiness.industryComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONPattern recognitionDomain (mathematical analysis)Image (mathematics)Computer Science::GraphicsComputer Science::SoundComputer Science::MultimediaMetric (mathematics)Limit (mathematics)Artificial intelligenceElectrical and Electronic EngineeringSubjective qualitybusinessTransform codingMathematicsElectronics Letters

researchProduct

An integrated architecture for speech-input multi-target machine translation

2007

The aim of this work is to show the ability of finite-state transducers to simultaneously translate speech into multiple languages. Our proposal deals with an extension of stochastic finite-state transducers that can produce more than one output at the same time. These kind of devices offer great versatility for the integration with other finite-state devices such as acoustic models in order to produce a speech translation system. This proposal has been evaluated in a practical situation, and its results have been compared with those obtained using a standard mono-target speech transducer.

Multi targetIntegrated architectureMachine translationComputer Science::SoundComputer scienceSpeech translationSpeech recognitionSpeech inputComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Extension (predicate logic)computer.software_genrecomputerHuman Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers on XX - NAACL '07

researchProduct

Speech-input multi-target machine translation

2007

In order to simultaneously translate speech into multiple languages an extension of stochastic finite-state transducers is proposed. In this approach the speech translation model consists of a single network where acoustic models (in the input) and the multilingual model (in the output) are embedded. The multi-target model has been evaluated in a practical situation, and the results have been compared with those obtained using several mono-target models. Experimental results show that the multi-target one requires less amount of memory. In addition, a single decoding is enough to get the speech translated into multiple languages.

Multi targetMachine translationComputer Science::SoundComputer scienceSpeech translationSpeech recognitionSpeech inputComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Extension (predicate logic)computer.software_genrecomputerDecoding methodsProceedings of the Second Workshop on Statistical Machine Translation - StatMT '07

researchProduct

Knots, Music and DNA

2020

Musical gestures connect the symbolic layer of the score to the physical layer of sound. I focus here on the mathematical theory of musical gestures, and I propose its generalization to include braids and knots. In this way, it is possible to extend the formalism to cover more case studies, especially regarding conducting gestures. Moreover, recent developments involving comparisons and similarities between gestures of orchestral musicians can be contextualized in the frame of braided monoidal categories. Because knots and braids can be applied to both music and biology (they apply to knotted proteins, for example), I end the article with a new musical rendition of DNA.

MusicalVisual artGeneral Mathematics (math.GM)Mathematics::Category TheoryBraidFOS: MathematicsCategory theory; Composition; Musical gesture; Synesthesia; Visual artAlgorithmic compositionMathematics - General MathematicsSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniSettore INF/01 - InformaticaMusic psychologyMusical gestureSettore MAT/04 - Matematiche ComplementariMathematics::Geometric TopologyLinguisticsMathematical theorySettore MAT/02 - AlgebraCategory theory Composition Musical gesture Synesthesia Visual art SettoreComputer Science::SoundThe SymbolicComputer musicMusicSynesthesiaGestureCategory theoryComposition

researchProduct

Implicit Wiener Filtering for Speech Enhancement In Non-Stationary Noise

2021

Speech quality is degraded in the presence of background noise, which reduces the quality of experience (QoE) of the end-user and therefore motivates the usage of speech enhancement algorithms. A large number of approaches have been proposed in this context. However most of them have focused on the case where the noise is stationary, an assumption that seldom holds in practice. For instance, in mobile telephony, noise sources with a marked non-stationary spectral signature include vehicles, machines, and other speakers to name a few. On the other hand, the usage of frequency-domain information in existing algorithms for speech enhancement in non-stationary noise environments can be made mor…

Noise powerComputer scienceSpeech recognitionWiener filterSpectral densityComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Context (language use)Background noiseSpeech enhancementNoisesymbols.namesakeComputer Science::SoundFrequency domainsymbols2021 11th International Conference on Information Science and Technology (ICIST)

researchProduct

Plasma Effecton Pulse Parameters of Sound Wave

2019

This article presents the experimental results of the effect of gas discharge plasma on an acoustic wave. It is shown that after a certain intensity of the acoustic wave, the presence of plasma leads to a decrease in the Q -factor of the acoustic resonator, a shift in the acoustic resonance in the gas discharge tube at a lower frequency, and after switching off the acoustics, low-frequency gas oscillations with a frequency of 2 Hz are generated in the tube.

Nuclear and High Energy PhysicsMaterials scienceAcousticsPlasmaAcoustic waveCondensed Matter Physicsbehavioral disciplines and activitiesElectric discharge in gasesIntensity (physics)law.inventionPulse (physics)ResonatorComputer Science::SoundPhysics::Plasma Physicslawotorhinolaryngologic diseasessense organsGas-filled tubepsychological phenomena and processesAcoustic resonanceIEEE Transactions on Plasma Science

researchProduct

Generation of stimulus features for analysis of FMRI during natural auditory experiences

2014

In contrast to block and event-related designs for fMRI experiments, it becomes much more difficult to extract events of interest in the complex continuous stimulus for finding corresponding blood-oxygen-level dependent (BOLD) responses. Recently, in a free music listening fMRI experiment, acoustic features of the naturalistic music stimulus were first extracted, and then principal component analysis (PCA) was applied to select the features of interest acting as the stimulus sequences. For feature generation, kernel PCA has shown its superiority over PCA in various applications, since it can implicitly exploit nonlinear relationship among features and such relationship seems to exist genera…

Quantitative Biology::Neurons and CognitionComputer Science::Soundsignaalinkäsittelyfeature extractionfMRIkernel PCAkokeet (tutkimustoiminta)riippumattomien komponenttien analyysiICAPolynomial kernelnaturalistic music

researchProduct

Optimizing auditory images and distance metrics for self‐organizing timbre maps*

1996

Abstract The effect of using different auditory images and distance metrics on the final configuration of a self‐organized timbre map is examined by comparing distance matrices, obtained from simulations, with a similarity rating matrix, obtained using the same set of stimuli as in the simulations. Gradient images, which are intended to represent idealizations of physiological gradient maps in the auditory pathway, are constructed. The optimal auditory image and distance metric, with respect to the similarity rating data, are searched using the gradient method.

Quantitative Biology::Neurons and CognitionVisual Arts and Performing Artsbusiness.industryRating matrixPattern recognitionImage (mathematics)Set (abstract data type)Similarity (network science)Computer Science::SoundComputer visionArtificial intelligencebusinessGradient methodTimbreMusicDistance matrices in phylogenyMathematicsJournal of New Music Research

researchProduct

KernelICA : Kernel Independent Component Analysis

2021

The kernel independent component analysis (kernel ICA) method introduced by Bach and Jordan (2003) . The incomplete Cholesky decomposition used in kernel ICA is provided as separate function. nonPeerReviewed

R-kieliComputer Science::Soundriippumattomien komponenttien analyysi

researchProduct

Non-linear RLS-based algorithm for pattern classification

2006

A new non-linear recursive least squares (RLS) algorithm is presented in the context of pattern classification problems. The algorithm incorporates the non-linearity of the filter's output in the updating rules of the classical RLS algorithm. The proposed method yields lower stationary error levels when compared to the standard LMS and RLS algorithms in a classical application of pattern classification, such as the channel equalization problem.

Recursive least squares filterSignal processingEqualizationContext (language use)Filter (signal processing)Computer Science::OtherNonlinear systemComputer Science::SoundControl and Systems EngineeringSignal ProcessingComputer Vision and Pattern RecognitionElectrical and Electronic EngineeringAlgorithmSoftwareMathematicsSignal Processing

researchProduct