Search results for "speech"
showing 10 items of 1281 documents
On the use of a metric-space search algorithm (AESA) for fast DTW-based recognition of isolated words
1988
The approximating and eliminating search algorithm (AESA) presented was recently introduced for finding nearest neighbors in metric spaces. Although the AESA was originally developed for reducing the time complexity of dynamic time-warping isolated word recognition (DTW-IWR), only rather limited experiments had been previously carried out to check its performance in this task. A set of experiments aimed at filling this gap is reported. The main results show that the important features reflected in previous simulation experiments are also true for real speech samples. With single-speaker dictionaries of up to 200 words, and for most of the different speech parameterizations, local metrics, a…
Medical Academic Speech. A Corpus-based Investigation of Same-Speaker Most Frequent Content Key Word Repetition in Non-Native English Discourse
2016
Studies on repetition in ELF interactions have been carried out in several domains, but medical academic discourse still remains under-researched. This paper explores same-speaker repetition in a 31,153-word corpus of lectures included in the 100,135-word medical section of the 1 million-word ELFA (English as a Lingua Franca in Academic Settings) corpus. More specifi cally, the corpus was searched for the most frequent same-speaker content key word repetition and corresponding functions, with both immediate and delayed repetition being scrutinized. The results confi rmed the initial hypothesis according to which same-speaker repetition was expected to be pervasive in the data, not only as a…
Electrophysiological evidence for change detection in speech sound patterns by anesthetized rats
2014
Human infants are able to detect changes in grammatical rules in a speech sound stream. Here, we tested whether rats have a comparable ability by using an electrophysiological measure that has been shown to reflect higher order auditory cognition even before it becomes manifested in behavioral level. Urethane-anesthetized rats were presented with a stream of sequences consisting of three pseudowords carried out at a fast pace. Frequently presented “standard” sequences had 16 variants which all had the same structure. They were occasionally replaced by acoustically novel “deviant” sequences of two different types: structurally consistent and inconsistent sequences. Two stimulus conditions we…
Design and Implementation of Deep Learning Based Contactless Authentication System Using Hand Gestures
2021
Hand gestures based sign language digits have several contactless applications. Applications include communication for impaired people, such as elderly and disabled people, health-care applications, automotive user interfaces, and security and surveillance. This work presents the design and implementation of a complete end-to-end deep learning based edge computing system that can verify a user contactlessly using &lsquo
Integrating Computational Linguistic Analysis of Multilingual Learning Data and Educational Measurement Approaches to Explore Learning in Higher Educ…
2017
This chapter develops a computational linguistic model for analyzing and comparing multilingual data as well as its application to a large body of standardized assessment data from higher education. The approach employs both an automatic and a manual annotation of the data on several linguistic layers (including parts of speech, text structure and content). Quantitative features of the textual data are explored that are related to both the students’ (domain-specific knowledge) test results and their level of academic experience. The respective analysis involves statistics of distance correlation, text categorization with respect to text types (questions and response options) as well as lang…
Measurement of ultra-low heating rates of a single antiproton in a cryogenic Penning trap
2019
Physical review letters 122(4), 043201 (2019). doi:10.1103/PhysRevLett.122.043201
Exploring relationships between audio features and emotion in music
2009
In this paper, we present an analysis of the associations between emotion categories and audio features automatically extracted from raw audio data. This work is based on 110 excerpts from film soundtracks evaluated by 116 listeners. This data is annotated with 5 basic emotions (fear, anger, happiness, sadness, tenderness) on a 7 points scale. Exploiting state-of-the-art Music Information Retrieval (MIR) techniques, we extract audio features of different kind: timbral, rhythmic and tonal. Among others we also compute estimations of dissonance, mode, onset rate and loudness. We study statistical relations between audio descriptors and emotion categories confirming results from psychological …
Boosting Hankel matrices for face emotion recognition and pain detection
2017
HighligthsDynamics of face expression descriptors are modeled for emotion recognition.A set of Hankel matrices is built upon several multi-scale face representations.Boosting and random subspace projection are used for dynamics selection.Dynamics of Haar-like features and Gabor Energies are compared.Fine-grained dynamics of subtle expressions can be modeled at small spatial scales. Studies in psychology have shown that the dynamics of emotional expressions play an important role in face emotion recognition in humans. Motivated by these studies, in this paper the dynamics of face expressions are modeled and used for automatic emotion recognition and pain detection.Given a temporal sequence o…
Ensemble of Hankel Matrices for Face Emotion Recognition
2015
In this paper, a face emotion is considered as the result of the composition of multiple concurrent signals, each corresponding to the movements of a specific facial muscle. These concurrent signals are represented by means of a set of multi-scale appearance features that might be correlated with one or more concurrent signals. The extraction of these appearance features from a sequence of face images yields to a set of time series. This paper proposes to use the dynamics regulating each appearance feature time series to recognize among different face emotions. To this purpose, an ensemble of Hankel matrices corresponding to the extracted time series is used for emotion classification withi…
Mismatches between objective parameters and measured perception assessment in room acoustics: a holistic approach
2014
Psychoacoustic research in the field of concert halls has revealed that many aspects concerning listening perception have yet to be totally understood. On the one hand, the objective room acoustics of performance spaces are reflected in parameters, some standardized and some not, but these are related to a limited number of perceptual attributes of human response. In general, these objective parameters cannot accurately describe the acoustic details due to their inherent simplification. Under these premises, impulse responses (576 receivers) are measured in 16 concert halls, according to standard procedures, and the perception and satisfaction of the occupants of the rooms are evaluated by …