Search results for "Speech processing"

showing 10 items of 210 documents

Embedded Knowledge-based Speech Detectors for Real-Time Recognition Tasks

2006

Speech recognition has become common in many application domains, from dictation systems for professional practices to vocal user interfaces for people with disabilities or hands-free system control. However, so far the performance of automatic speech recognition (ASR) systems are comparable to human speech recognition (HSR) only under very strict working conditions, and in general much lower. Incorporating acoustic-phonetic knowledge into ASR design has been proven a viable approach to raise ASR accuracy. Manner of articulation attributes such as vowel, stop, fricative, approximant, nasal, and silence are examples of such knowledge. Neural networks have already been used successfully as de…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniVoice activity detectionArtificial neural networkDictationbusiness.industryComputer scienceSpeech recognitionSpeech technologycomputer.software_genreSpeech processingManner of articulationSilenceVowelComputer ScienceTelecommunicationsMel-frequency cepstrumArtificial intelligencespeech detectorUser interfacebusinesscomputerNatural language processing

researchProduct

A nonstationary model for the analysis of transient speech signals

1987

In this correspondence, a model is presented for the analysis of transient speech signals, which is based on a sum of the impulsive responses corresponding to a number of poles with time-dependent parameters. The aim of this analysis is to obtain discriminative features of the different transient elements of speech.

Signal processingComputer scienceSignal reconstructionSpeech recognitionSpeech processingsymbols.namesakeFourier transformDiscriminative modelComputer Science::SoundFrequency domainSignal ProcessingsymbolsTransient (oscillation)Decoding methodsIEEE Transactions on Acoustics, Speech, and Signal Processing

researchProduct

Usage of HMM-Based Speech Recognition Methods for Automated Determination of a Similarity Level Between Languages

2019

The problem of automated determination of language similarity (or even defining of a distance on the space of languages) could be solved in different ways – working with phonetic transcriptions, with speech recordings or both of them. For the recordings, we propose and test a HMM-based one: in the first part of our article we successfully try language detection, afterwards we are trying to calculate distances between HMM-based models, using different metrics and divergences. The Kullback-Leibler divergence is the only one we got good results with – it means that the calculated distances between languages correspond to analytical understanding of similarity between them. Even if it does not …

Space (punctuation)Kullback–Leibler divergenceLanguage identificationSimilarity (network science)Computer scienceSpeech recognitionComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Hidden Markov modelUSableDivergence (statistics)

researchProduct

Special factors and the combinatorics of suffix and factor automata

2011

AbstractThe suffix automaton (resp. factor automaton) of a finite word w is the minimal deterministic automaton recognizing the set of suffixes (resp. factors) of w. We study the relationships between the structure of the suffix and factor automata and classical combinatorial parameters related to the special factors of w. We derive formulae for the number of states of these automata. We also characterize the languages LSA and LFA of words having respectively suffix automaton and factor automaton with the minimal possible number of states.

Special factorGeneral Computer ScienceSpecial factorsFactor automatonBüchi automatonω-automatonTheoretical Computer ScienceCombinatoricsDeterministic automatonTwo-way deterministic finite automatonNondeterministic finite automatonComputer Science::Data Structures and AlgorithmsCombinatorics on wordStandard Sturmian wordsMathematicsDiscrete mathematicsCombinatorics on wordsDAWGPushdown automatonComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Nonlinear Sciences::Cellular Automata and Lattice GasesSuffix automatonProbabilistic automatonSuffix automatonComputer Science::Formal Languages and Automata TheoryComputer Science(all)Theoretical Computer Science

researchProduct

Relationship between speech perception and production skills and morphosyntactic development in Spanish-speaking children with Speech Sound Disorders

2021

La investigación sobre el desarrollo gramatical y su posible relación con los déficits de procesamiento de habla en niños con Trastorno Fonológico (TF) es escasa, especialmente para la lengua española. El objetivo es analizar la influencia de las habilidades de percepción y producción de habla en el desarrollo morfosintáctico de los niños con TF sin Trastorno del Lenguaje (TL). Participaron 52 niños de habla española de 4 a 6 años: 26 con TF y 26 con desarrollo típico (DT) emparejados en edad cronológica, cociente de inteligencia no verbal y nivel de vocabulario receptivo. El desarrollo morfosintáctico se evaluó con el test de lenguaje CELF-Preschool-2-Spanish. Los niños realizaron una tare…

Speech productionPhonological disordersintaxisLanguage and Linguisticstrastorno fonológicoMorfologíamedia_commonSpeech productionpercepción de hablaIntelligence quotientLanguage and LiteratureSpeech perception05 social sciencesPSpeech sound disorderTrastorno de hablaSpeech sound disordermedicine.symptom0305 other medical sciencePsychologyPercepción de habla050104 developmental & child psychologyCognitive psychologySintaxisMorphologySpeech perceptionmedia_common.quotation_subjectP1-1091030507 speech-language pathology & audiology03 medical and health sciencesSpeech and HearingNonverbal communicationtrastorno de hablaPerceptionmorfologíamedicine0501 psychology and cognitive sciencesSyntaxProducción de hablaPhilology. Linguisticsproducción de hablaSpeech processingmedicine.diseaseOtorhinolaryngologyRF1-547PsicologiaTrastorno fonológicoTrastorns de la parlaSpeech disorderRevista de Investigación en Logopedia

researchProduct

Mismatch brain response to speech sound changes in rats

2011

Understanding speech is based on neural representations of individual speech sounds. In humans, such representations are capable of supporting an automatic and memory-based mechanism for auditory change detection, as reflected by the mismatch negativity of event-related potentials. There are also findings of neural representations of speech sounds in animals, but it is not known whether these representations can support the change detection mechanism analogous to that underlying the mismatch negativity in humans. To this end, we presented synthesized spoken syllables to urethane-anesthetized rats while local field potentials were epidurally recorded above their primary auditory cortex. In a…

Speech recognitionSpeech soundslcsh:BF1-990Mismatch negativityLocal field potentiallocal field potentialsAuditory cortex050105 experimental psychology03 medical and health sciences0302 clinical medicineequiprobable conditionPsychologyoddball condition0501 psychology and cognitive sciencesratauditoryequiprobableconditionGeneral Psychologyta515Original ResearchSpeech sound05 social scienceslocalfieldpotentialsSpeech processingoddballconditionspeechsoundlcsh:PsychologyStandard stimulusPsychologyspeech sound030217 neurology & neurosurgeryChange detectionFrontiers in Psychology

researchProduct

The Shuffle Product: New Research Directions

2015

In this paper we survey some recent researches concerning the shuffle operation that arise both in Formal Languages and in Combinatorics on Words.

Star-free languageComputer scienceProgramming languageComputer Science (all)Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)computer.software_genreIntermixed languageTheoretical Computer ScienceCombinatorics on wordsTheoryofComputation_ANALYSISOFALGORITHMSANDPROBLEMCOMPLEXITYProduct (mathematics)Formal languageShuffle squarecomputerShuffle

researchProduct

On the empirical spectral distribution for certain models related to sample covariance matrices with different correlations

2021

Given [Formula: see text], we study two classes of large random matrices of the form [Formula: see text] where for every [Formula: see text], [Formula: see text] are iid copies of a random variable [Formula: see text], [Formula: see text], [Formula: see text] are two (not necessarily independent) sets of independent random vectors having different covariance matrices and generating well concentrated bilinear forms. We consider two main asymptotic regimes as [Formula: see text]: a standard one, where [Formula: see text], and a slightly modified one, where [Formula: see text] and [Formula: see text] while [Formula: see text] for some [Formula: see text]. Assuming that vectors [Formula: see t…

Statistics and ProbabilityPhysicsAlgebra and Number TheorySpectral power distributionComputer Science::Information RetrievalProbability (math.PR)Astrophysics::Instrumentation and Methods for AstrophysicsBlock (permutation group theory)Marchenko–Pastur lawComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Bilinear form60F05 60B20 47N30Sample mean and sample covarianceCombinatoricsConvergence of random variablesFOS: Mathematicssample covariance matricesComputer Science::General LiteratureDiscrete Mathematics and CombinatoricsRandom matriceshigh dimensional statisticsStatistics Probability and UncertaintyRandom matrixRandom variableMathematics - ProbabilityRandom Matrices: Theory and Applications

researchProduct

Lévy–Khintchine decompositions for generating functionals on algebras associated to universal compact quantum groups

2018

We study the first and second cohomology groups of the $^*$-algebras of the universal unitary and orthogonal quantum groups $U_F^+$ and $O_F^+$. This provides valuable information for constructing and classifying L\'evy processes on these quantum groups, as pointed out by Sch\"urmann. In the case when all eigenvalues of $F^*F$ are distinct, we show that these $^*$-algebras have the properties (GC), (NC), and (LK) introduced by Sch\"urmann and studied recently by Franz, Gerhold and Thom. In the degenerate case $F=I_d$, we show that they do not have any of these properties. We also compute the second cohomology group of $U_d^+$ with trivial coefficients -- $H^2(U_d^+,{}_\epsilon\Bbb{C}_\epsil…

Statistics and ProbabilityPure mathematicsQuantum groupComputer Science::Information RetrievalApplied Mathematics010102 general mathematicsAstrophysics::Instrumentation and Methods for AstrophysicsComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Statistical and Nonlinear PhysicsHopf algebra[MATH.MATH-FA]Mathematics [math]/Functional Analysis [math.FA]01 natural sciencesUnitary stateCohomologyMathematics::K-Theory and HomologyMathematics - Quantum Algebra0103 physical sciencesComputer Science::General Literature16T20 (Primary) 16T05 (Secondary)010307 mathematical physics0101 mathematicsQuantumMathematical PhysicsComputingMilieux_MISCELLANEOUSMathematics

researchProduct

2018

Despite increasing interest in the development of audiovisual speech perception in infancy, the underlying mechanisms and neural processes are still only poorly understood. In addition to regions in temporal cortex associated with speech processing and multimodal integration, such as superior temporal sulcus, left inferior frontal cortex (IFC) has been suggested to be critically involved in mapping information from different modalities during speech perception. To further illuminate the role of IFC during infant language learning and speech perception, the current study examined the processing of auditory, visual and audiovisual speech in 6-month-old infants using functional near-infrared s…

Temporal cortexmedicine.medical_specialtyModalitiesModality (human–computer interaction)Speech perceptiongenetic structuresCognitive Neurosciencemedia_common.quotation_subject05 social sciencesSuperior temporal sulcusAudiologyLanguage acquisitionSpeech processing050105 experimental psychology03 medical and health sciences0302 clinical medicinePerceptionotorhinolaryngologic diseasesmedicine0501 psychology and cognitive sciencesPsychology030217 neurology & neurosurgerymedia_commonDevelopmental Cognitive Neuroscience

researchProduct