Search results for "speech"

showing 10 items of 1281 documents

On the use of a metric-space search algorithm (AESA) for fast DTW-based recognition of isolated words

1988

The approximating and eliminating search algorithm (AESA) presented was recently introduced for finding nearest neighbors in metric spaces. Although the AESA was originally developed for reducing the time complexity of dynamic time-warping isolated word recognition (DTW-IWR), only rather limited experiments had been previously carried out to check its performance in this task. A set of experiments aimed at filling this gap is reported. The main results show that the important features reflected in previous simulation experiments are also true for real speech samples. With single-speaker dictionaries of up to 200 words, and for most of the different speech parameterizations, local metrics, a…

Dynamic time warpingbusiness.industryComputer scienceSpeech recognitionComputationPattern recognitionTask (project management)Set (abstract data type)Metric spaceSearch algorithmSignal ProcessingWord recognitionArtificial intelligencebusinessTime complexityIEEE Transactions on Acoustics, Speech, and Signal Processing
researchProduct

Medical Academic Speech. A Corpus-based Investigation of Same-Speaker Most Frequent Content Key Word Repetition in Non-Native English Discourse

2016

Studies on repetition in ELF interactions have been carried out in several domains, but medical academic discourse still remains under-researched. This paper explores same-speaker repetition in a 31,153-word corpus of lectures included in the 100,135-word medical section of the 1 million-word ELFA (English as a Lingua Franca in Academic Settings) corpus. More specifi cally, the corpus was searched for the most frequent same-speaker content key word repetition and corresponding functions, with both immediate and delayed repetition being scrutinized. The results confi rmed the initial hypothesis according to which same-speaker repetition was expected to be pervasive in the data, not only as a…

ELFA corpus.same-speaker repetitionEnglish as a lingua francaMedical academic speechMedical academic speech; same-speaker repetition; English as a lingua franca; ELFA corpus.Settore L-LIN/12 - Lingua E Traduzione - Lingua Inglese
researchProduct

Electrophysiological evidence for change detection in speech sound patterns by anesthetized rats

2014

Human infants are able to detect changes in grammatical rules in a speech sound stream. Here, we tested whether rats have a comparable ability by using an electrophysiological measure that has been shown to reflect higher order auditory cognition even before it becomes manifested in behavioral level. Urethane-anesthetized rats were presented with a stream of sequences consisting of three pseudowords carried out at a fast pace. Frequently presented “standard” sequences had 16 variants which all had the same structure. They were occasionally replaced by acoustically novel “deviant” sequences of two different types: structurally consistent and inconsistent sequences. Two stimulus conditions we…

EXTRACTIONCORTEX515 PsychologySpeech recognitionspeecheducationMismatch negativityINTELLIGENCELocal field potentialStimulus (physiology)Auditory cortexbehavioral disciplines and activitieslcsh:RC321-571MECHANISMSlocal-field potentialsmedicinePsychologyauditory cortexratOriginal Research ArticleCOTTON-TOP TAMARINSlcsh:Neurosciences. Biological psychiatry. Neuropsychiatryta515pattern perceptionGeneral NeuroscienceNoveltyCognitionHuman brainElectrophysiologymedicine.anatomical_structureDISCRIMINATIONSTREAMmismatch negativityMONKEYSpoikkeavuusnegatiivisuusPsychologyNeuroscienceRULE
researchProduct

Design and Implementation of Deep Learning Based Contactless Authentication System Using Hand Gestures

2021

Hand gestures based sign language digits have several contactless applications. Applications include communication for impaired people, such as elderly and disabled people, health-care applications, automotive user interfaces, and security and surveillance. This work presents the design and implementation of a complete end-to-end deep learning based edge computing system that can verify a user contactlessly using &lsquo

Edge deviceComputer Networks and CommunicationsComputer scienceSpeech recognitionlcsh:TK7800-8360securitySign languageVDP::Teknologi: 500::Elektrotekniske fag: 540edge computingCode (cryptography)ComputerSystemsOrganization_SPECIAL-PURPOSEANDAPPLICATION-BASEDSYSTEMSElectrical and Electronic EngineeringEdge computingAuthenticationhand gestures recognitionArtificial neural networkbusiness.industryDeep learninglcsh:Electronicsdeep learningneural networkscontactless authenticationHardware and ArchitectureControl and Systems Engineeringcamera based authenticationSignal ProcessingArtificial intelligencebusinessGesture
researchProduct

Integrating Computational Linguistic Analysis of Multilingual Learning Data and Educational Measurement Approaches to Explore Learning in Higher Educ…

2017

This chapter develops a computational linguistic model for analyzing and comparing multilingual data as well as its application to a large body of standardized assessment data from higher education. The approach employs both an automatic and a manual annotation of the data on several linguistic layers (including parts of speech, text structure and content). Quantitative features of the textual data are explored that are related to both the students’ (domain-specific knowledge) test results and their level of academic experience. The respective analysis involves statistics of distance correlation, text categorization with respect to text types (questions and response options) as well as lang…

Educational measurementHigher educationbusiness.industryComputer scienceStandardized testPart of speechcomputer.software_genrelanguage.human_languageTest (assessment)GermanDistance correlationlanguageText typesArtificial intelligencebusinesscomputerNatural language processing
researchProduct

Measurement of ultra-low heating rates of a single antiproton in a cryogenic Penning trap

2019

Physical review letters 122(4), 043201 (2019). doi:10.1103/PhysRevLett.122.043201

Electric fieldsField noiseCryogenicsAtomic Physics (physics.atom-ph)Penning trapOther Fields of PhysicsGeneral Physics and AstronomyFOS: Physical sciences01 natural sciences530physics.atom-phPhysics - Atomic PhysicsSpectral densityNoise spectral densityTheoryofComputation_ANALYSISOFALGORITHMSANDPROBLEMCOMPLEXITY0103 physical sciencesddc:530010306 general physicsPhysicsComputer Science::Information RetrievalSpectral densityComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Penning trapOrders of magnitudeAntiprotonQuantum transition rateDewey Decimal Classification::500 | Naturwissenschaften::530 | PhysikAtomic physicsPräzisionsexperimente - Abteilung BlaumIon traps
researchProduct

Exploring relationships between audio features and emotion in music

2009

In this paper, we present an analysis of the associations between emotion categories and audio features automatically extracted from raw audio data. This work is based on 110 excerpts from film soundtracks evaluated by 116 listeners. This data is annotated with 5 basic emotions (fear, anger, happiness, sadness, tenderness) on a 7 points scale. Exploiting state-of-the-art Music Information Retrieval (MIR) techniques, we extract audio features of different kind: timbral, rhythmic and tonal. Among others we also compute estimations of dissonance, mode, onset rate and loudness. We study statistical relations between audio descriptors and emotion categories confirming results from psychological …

Emotion classificationmedia_common.quotation_subjectSpeech recognitionAngerLoudnessSadnessBehavioral NeurosciencePsychiatry and Mental healthRaw audio formatMode (music)Neuropsychology and Physiological PsychologyNeurologyHappinessMusic information retrievalPsychologyBiological Psychiatrymedia_commonFrontiers in Human Neuroscience
researchProduct

Boosting Hankel matrices for face emotion recognition and pain detection

2017

HighligthsDynamics of face expression descriptors are modeled for emotion recognition.A set of Hankel matrices is built upon several multi-scale face representations.Boosting and random subspace projection are used for dynamics selection.Dynamics of Haar-like features and Gabor Energies are compared.Fine-grained dynamics of subtle expressions can be modeled at small spatial scales. Studies in psychology have shown that the dynamics of emotional expressions play an important role in face emotion recognition in humans. Motivated by these studies, in this paper the dynamics of face expressions are modeled and used for automatic emotion recognition and pain detection.Given a temporal sequence o…

EmotionLTI systemSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniFacial expressionSignal processingBoosting (machine learning)business.industrySpeech recognition020207 software engineeringHankel matrix02 engineering and technologyBoostingSoftwareSignal Processing0202 electrical engineering electronic engineering information engineeringFace processing020201 artificial intelligence & image processingEmotional expressionComputer Vision and Pattern RecognitionbusinessClassifier (UML)Hankel matrixSubspace topologySoftwareMathematics
researchProduct

Ensemble of Hankel Matrices for Face Emotion Recognition

2015

In this paper, a face emotion is considered as the result of the composition of multiple concurrent signals, each corresponding to the movements of a specific facial muscle. These concurrent signals are represented by means of a set of multi-scale appearance features that might be correlated with one or more concurrent signals. The extraction of these appearance features from a sequence of face images yields to a set of time series. This paper proposes to use the dynamics regulating each appearance feature time series to recognize among different face emotions. To this purpose, an ensemble of Hankel matrices corresponding to the extracted time series is used for emotion classification withi…

EmotionLTI systemSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniMajority ruleComputer sciencebusiness.industrySpeech recognitionEmotion classificationComputer Science (all)Hankel matrixPattern recognitionClassificationTheoretical Computer Sciencek-nearest neighbors algorithmSchema (psychology)Face processingArtificial intelligenceEmotion recognitionbusinessHankel matrix
researchProduct

Mismatches between objective parameters and measured perception assessment in room acoustics: a holistic approach

2014

Psychoacoustic research in the field of concert halls has revealed that many aspects concerning listening perception have yet to be totally understood. On the one hand, the objective room acoustics of performance spaces are reflected in parameters, some standardized and some not, but these are related to a limited number of perceptual attributes of human response. In general, these objective parameters cannot accurately describe the acoustic details due to their inherent simplification. Under these premises, impulse responses (576 receivers) are measured in 16 concert halls, according to standard procedures, and the perception and satisfaction of the occupants of the rooms are evaluated by …

EngineeringEnvironmental EngineeringSpeech recognitionmedia_common.quotation_subjectGeography Planning and DevelopmentPerceptive acoustic evaluationAcoustic qualityMachine learningcomputer.software_genreConcert-goers responsesField (computer science)CorrelationPerceptionActive listeningPsychoacousticsMultidimensional scalingConcert hallCivil and Structural Engineeringmedia_commonbusiness.industryBuilding and ConstructionRoom acousticsHierarchical clusteringFISICA APLICADAArtificial intelligencebusinessMATEMATICA APLICADAcomputerRoom acousticsMultidimensional scaling
researchProduct