Search results for "speech"
showing 10 items of 1281 documents
Video preprocessing for audiovisual indexing
2003
We address the problem of detecting shots of subjects that are interviewed in news sequences. This is useful since usually these kinds of scenes contain important and reusable information that can be used for other news programs. In a previous paper, we presented a technique based on a priori knowledge of the editing techniques used in news sequences which allowed a fast search of news stories (see Albiol, A. et al., 3rd Int. Conf. on Audio and Video-based Biometric Person Authentication, p.366-71, 2001). We now present a new shot descriptor technique which improves the previous search results by using a simple, yet efficient, algorithm, based on the information contained in consecutive fra…
Detection of steering direction using EEG recordings based on sample entropy and time-frequency analysis.
2016
Monitoring driver's intentions beforehand is an ambitious aim, which will bring a huge impact on the society by preventing traffic accidents. Hence, in this preliminary study we recorded high resolution electroencephalography (EEG) from 5 subjects while driving a car under real conditions along with an accelerometer which detects the onset of steering. Two sensor-level analyses, sample entropy and time-frequency analysis, have been implemented to observe the dynamics before the onset of steering. Thus, in order to classify the steering direction we applied a machine learning algorithm consisting of: dimensionality reduction and classification using principal-component-analysis (PCA) and sup…
A NEW COMPLEXITY FUNCTION FOR WORDS BASED ON PERIODICITY
2013
Motivated by the extension of the critical factorization theorem to infinite words, we study the (local) periodicity function, i.e. the function that, for any position in a word, gives the size of the shortest square centered in that position. We prove that this function characterizes any binary word up to exchange of letters. We then introduce a new complexity function for words (the periodicity complexity) that, for any position in the word, gives the average value of the periodicity function up to that position. The new complexity function is independent from the other commonly used complexity measures as, for instance, the factor complexity. Indeed, whereas any infinite word with bound…
Background noise suppression for acoustic localization by means of an adaptive energy detection approach
2008
A microphone array can be employed to localize dominant acoustic sources in a given noisy environment. This capability is successfully used in good signal to noise ratio (SNR) conditions but its accuracy decreases considerably in the presence of other background noise sources. In order to counteract this effect, a novel approach that combines the information provided by a Gaussian energy detector (GED) with the approved localization method SRP-PHAT is presented in this paper. To evaluate the presented technique, several acoustic sources (speech and impulsive sounds) were considered in a variety of different scenarios to demonstrate the robustness and the accuracy of the system proposed.
Effects of tinnitus on postural control and stabilization: A pilot study
2015
Introduction: The aim of this study was to evaluate the tinnitus's impacts on postural control. Material and methods: Sixty-six subjects (age: 46,71 ± 15,12 years, height 166,32 ± 8,88 cm, weight 64,85 ± 12,57 kg) with idiopathic tinnitus were recruited for the study and were tested. Each subject underwent to ‘Romberg test’, ‘Static balance’ and ‘posture analysis’. Static balance and posture analysis were performed two times, with open and close eyes, and were measured through the FreeMed posturography system. Results: showed that subjects had worse Baropodometric performances respect to benchmarks; moreover according to literature the results show that these patients had significant differ…
Decoding Children's Social Behavior
2013
We introduce a new problem domain for activity recognition: the analysis of children's social and communicative behaviors based on video and audio data. We specifically target interactions between children aged 1-2 years and an adult. Such interactions arise naturally in the diagnosis and treatment of developmental disorders such as autism. We introduce a new publicly-available dataset containing over 160 sessions of a 3-5 minute child-adult interaction. In each session, the adult examiner followed a semi-structured play interaction protocol which was designed to elicit a broad range of social behaviors. We identify the key technical challenges in analyzing these behaviors, and describe met…
The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring.
2020
Laaksonen, S-M.; Haapoja, J.; Kinnunen, T., Nelimarkka, M. & Pöyhtäri, R. (2020, accepted). . Frontiers in Big Data: Data Mining and Management / Critical Data and Algorithm Studies. doi:10.3389/fdata.2020.00003 Hate speech has been identified as a pressing problem in society and several automated approaches have been designed to detect and prevent it. This paper reports and reflects upon an action research setting consisting of multi-organizational collaboration conducted during Finnish municipal elections in 2017, wherein a technical infrastructure was designed to automatically monitor candidates' social media updates for hate speech. The setting allowed us to engage in a 2-fold investiga…
2017
In continuous flash suppression (CFS), a dynamic noise masker, presented to one eye, suppresses conscious perception of a test stimulus, presented to the other eye, until the suppressed stimulus comes to awareness after few seconds. But what do we see breaking the dominance of the masker in the transition period? We addressed this question with a dual-task in which observers indicated (i) whether the test object was left or right of the fixation mark (localization) and (ii) whether it was a face or a house (categorization). As done recently (Stein et al., 2011), we used two experimental varieties to rule out confounds with decisional strategy. In the terminated mode, stimulus and masker wer…
Quantitative comparison of motion history image variants for video-based depression assessment
2017
Abstract Depression is the most prevalent mood disorder and a leading cause of disability worldwide. Automated video-based analyses may afford objective measures to support clinical judgments. In the present paper, categorical depression assessment is addressed by proposing a novel variant of the Motion History Image (MHI) which considers Gabor-inhibited filtered data instead of the original image. Classification results obtained with this method on the AVEC’14 dataset are compared to those derived using (a) an earlier MHI variant, the Landmark Motion History Image (LMHI), and (b) the original MHI. The different motion representations were tested in several combinations of appearance-based …
Polarization Modulation Instability in All-Normal Dispersion Microstructured Optical Fibers with Quasi-Continuous 1064 nm Pump
2019
Polarization modulation instability (PMI) is a form of modulation instability that can exist in weakly birefringent optical fibers [1]. Sidebands can be generated by this effect when a polarization mode of the birefringent fiber is excited with an intense optical pump. The polarization state of the sidebands is orthogonal to the polarization of the pump signal. PMI has been observed in microstructured optical fibers (MOFs). PMI was reported in a large-air-filling fraction MOF that was pumped in the normal dispersion regime with visible light [2]. The coherent degradation of femtosecond supercontinuum light generated in all-normal dispersion (ANDi) MOFs due to PMI was recently investigated […