Search results for "speech"
showing 10 items of 1281 documents
Part of Speech Tagging Using Hidden Markov Models
2020
Abstract In this paper, we present a wide range of models based on less adaptive and adaptive approaches for a PoS tagging system. These parameters for the adaptive approach are based on the n-gram of the Hidden Markov Model, evaluated for bigram and trigram, and based on three different types of decoding method, in this case forward, backward, and bidirectional. We used the Brown Corpus for the training and the testing phase. The bidirectional trigram model almost reaches state of the art accuracy but is disadvantaged by the decoding speed time while the backward trigram reaches almost the same results with a way better decoding speed time. By these results, we can conclude that the decodi…
Lists of Spanish sentences with equivalent predictability, phonetic content, length, and frequency of the last word.
2010
This paper presents a pool of Spanish sentences designed for use in cognitive research and speech processing in circumstances in which the effects of context are relevant. These lists of sentences are divided into six lists of 25 equivalent high-predictability sentences and six lists of 25 low-predictability sentences according to the extent to which the last word can be predicted by the preceding context. These lists were also equivalent in phonetic content, length and frequency of the last word. These lists are intended for use in psycholinguistic research with Spanish-speaking listeners.
Towards Diagrammatic Patterns
2008
This article presents the idea that the graphical representation (concrete syntax) of a visual language can be specified based on some pre-defined diagrammatic patterns. A diagram from the Specification and Description Language (SDL) is used as illustration.
Overview of the Second BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora
2017
This paper presents the BUCC 2017 shared task on parallel sentence extraction from comparable corpora. It recalls the design of the datasets, presents their final construction and statistics and the methods used to evaluate system results. 13 runs were submitted to the shared task by 4 teams, covering three of the four proposed language pairs: French-English (7 runs), German-English (3 runs), and Chinese-English (3 runs). The best F-scores as measured against the gold standard were 0.84 (German-English), 0.80 (French-English), and 0.43 (Chinese-English). Because of the design of the dataset, in which not all gold parallel sentence pairs are known, these are only minimum values. We examined …
Morphological Analysis Combined with a Machine Learning Approach to Detect Utrasound Median Sagittal Sections for the Nuchal Translucency Measurement
2017
The screening of chromosomal defects, as trisomy 13, 18 and 21, can be obtained by the measurement of the nuchal translucency thickness scanning during the end of the first trimester of pregnancy. This contribution proposes an automatic methodology to detect mid-sagittal sections to identify the correct measurement of nuchal translucency. Wavelet analysis and neural network classifiers are the main strategies of the proposed methodology to detect the frontal components of the skull and the choroid plexus with the support of radial symmetry analysis. Real clinical ultrasound images were adopted to measure the performance and the robustness of the methodology, thus it can be highlighted an er…
Fusion Architectures for Word-Based Audiovisual Speech Recognition
2020
Spectrogram analysis of multipath fading channels
2015
The analysis of the Doppler power spectral density (PSD) of measured and simulated data is an important topic in the area of mobile radio channel modelling. In this paper, we estimate the Doppler PSD of multipath fading channels by using the concept of the spectrogram. The spectrogram is a spectral representation that gives insight into how the distribution of the spectral density of a signal changes over time. The multipath fading channel is modelled by a sum-of-cisoids (SOC) process. A closed-form solution is presented for the spectrogram and the corresponding time-dependent autocorrelation function (ACF). The closed-form solutions disclose several unwanted effects that come with the limi…
Atrial activity extraction for atrial fibrillation analysis using blind source separation.
2004
This contribution addresses the extraction of atrial activity (AA) from real electrocardiogram (ECG) recordings of atrial fibrillation (AF). We show the appropriateness of independent component analysis (ICA) to tackle this biomedical challenge when regarded as a blind source separation (BSS) problem. ICA is a statistical tool able to reconstruct the unobservable independent sources of bioelectric activity which generate, through instantaneous linear mixing, a measurable set of signals. The three key hypothesis that make ICA applicable in the present scenario are discussed and validated: 1) AA and ventricular activity (VA) are generated by sources of independent bioelectric activity; 2) AA …
A Musical Pattern Discovery System Founded on a Modeling of Listening Strategies
2004
Music is a domain of expression that conveys a paramount degree of complexity. The musical surface, composed of a multitude of notes, results from the elaboration of numerous structures of different types and sizes. The composer constructs this structural complexity in a more or less explicit way. The listener, faced by such a complex phenomenon, is able to reconstruct only a limited part of it, mostly in a non-explicit way. One particular aim of music analysis is to objectify such complexity, thus offering to the listener a tool for enriching the appreciation of music (Lartillot and SaintJames, 2004). The trouble is, traditional musical analysis, although offering a valuable understanding …
Depression Assessment by Fusing High and Low Level Features from Audio, Video, and Text
2016
International audience; Depression is a major cause of disability world-wide. The present paper reports on the results of our participation to the depression sub-challenge of the sixth Audio/Visual Emotion Challenge (AVEC 2016), which was designed to compare feature modalities ( audio, visual, interview transcript-based) in gender-based and gender-independent modes using a variety of classification algorithms. In our approach, both high and low level features were assessed in each modality. Audio features were extracted from the low-level descriptors provided by the challenge organizers. Several visual features were extracted and assessed including dynamic characteristics of facial elements…