Search results for "speech"

showing 10 items of 1281 documents

Video preprocessing for audiovisual indexing

2003

We address the problem of detecting shots of subjects that are interviewed in news sequences. This is useful since usually these kinds of scenes contain important and reusable information that can be used for other news programs. In a previous paper, we presented a technique based on a priori knowledge of the editing techniques used in news sequences which allowed a fast search of news stories (see Albiol, A. et al., 3rd Int. Conf. on Audio and Video-based Biometric Person Authentication, p.366-71, 2001). We now present a new shot descriptor technique which improves the previous search results by using a simple, yet efficient, algorithm, based on the information contained in consecutive fra…

AuthenticationSequenceInformation retrievalContextual image classificationBiometricsComputer scienceSpeech recognitionSearch engine indexingcomputer.software_genreObject detectionReduction (complexity)Face (geometry)PreprocessorAudio signal processingcomputerImage retrievalIEEE International Conference on Acoustics Speech and Signal Processing
researchProduct

Detection of steering direction using EEG recordings based on sample entropy and time-frequency analysis.

2016

Monitoring driver's intentions beforehand is an ambitious aim, which will bring a huge impact on the society by preventing traffic accidents. Hence, in this preliminary study we recorded high resolution electroencephalography (EEG) from 5 subjects while driving a car under real conditions along with an accelerometer which detects the onset of steering. Two sensor-level analyses, sample entropy and time-frequency analysis, have been implemented to observe the dynamics before the onset of steering. Thus, in order to classify the steering direction we applied a machine learning algorithm consisting of: dimensionality reduction and classification using principal-component-analysis (PCA) and sup…

Automobile DrivingSupport Vector MachineComputer scienceSpeech recognitionEntropyElectroencephalography03 medical and health sciencesEntropy (classical thermodynamics)0302 clinical medicine0502 economics and businessAccelerometrymedicineEntropy (information theory)HumansEntropy (energy dispersal)Entropy (arrow of time)050210 logistics & transportationPrincipal Component Analysismedicine.diagnostic_testbusiness.industryEntropy (statistical thermodynamics)Dimensionality reduction05 social sciencesPattern recognitionElectroencephalographyTime–frequency analysisSupport vector machineSample entropyPrincipal component analysisArtificial intelligencebusiness030217 neurology & neurosurgeryAlgorithmsEntropy (order and disorder)Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference
researchProduct

A NEW COMPLEXITY FUNCTION FOR WORDS BASED ON PERIODICITY

2013

Motivated by the extension of the critical factorization theorem to infinite words, we study the (local) periodicity function, i.e. the function that, for any position in a word, gives the size of the shortest square centered in that position. We prove that this function characterizes any binary word up to exchange of letters. We then introduce a new complexity function for words (the periodicity complexity) that, for any position in the word, gives the average value of the periodicity function up to that position. The new complexity function is independent from the other commonly used complexity measures as, for instance, the factor complexity. Indeed, whereas any infinite word with bound…

Average-case complexityDiscrete mathematicsFibonacci numberSettore INF/01 - InformaticaGeneral Mathematicscomplexity functionComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Function (mathematics)periodicitycritical factorization theoremCombinatoricsComplexity indexCombinatorics on wordsBounded functionComplexity functionComputer Science::Formal Languages and Automata TheoryWord (computer architecture)Combinatorics on wordMathematicsInternational Journal of Algebra and Computation
researchProduct

Background noise suppression for acoustic localization by means of an adaptive energy detection approach

2008

A microphone array can be employed to localize dominant acoustic sources in a given noisy environment. This capability is successfully used in good signal to noise ratio (SNR) conditions but its accuracy decreases considerably in the presence of other background noise sources. In order to counteract this effect, a novel approach that combines the information provided by a Gaussian energy detector (GED) with the approved localization method SRP-PHAT is presented in this paper. To evaluate the presented technique, several acoustic sources (speech and impulsive sounds) were considered in a variety of different scenarios to demonstrate the robustness and the accuracy of the system proposed.

Background noisesymbols.namesakeMicrophone arraySignal-to-noise ratioComputer Science::SoundComputer scienceRobustness (computer science)AcousticsGaussianSpeech recognitionDetectorsymbolsNoise control2008 IEEE International Conference on Acoustics, Speech and Signal Processing
researchProduct

Effects of tinnitus on postural control and stabilization: A pilot study

2015

Introduction: The aim of this study was to evaluate the tinnitus's impacts on postural control. Material and methods: Sixty-six subjects (age: 46,71 ± 15,12 years, height 166,32 ± 8,88 cm, weight 64,85 ± 12,57 kg) with idiopathic tinnitus were recruited for the study and were tested. Each subject underwent to ‘Romberg test’, ‘Static balance’ and ‘posture analysis’. Static balance and posture analysis were performed two times, with open and close eyes, and were measured through the FreeMed posturography system. Results: showed that subjects had worse Baropodometric performances respect to benchmarks; moreover according to literature the results show that these patients had significant differ…

Balance Tinnitus Speech motor controlSettore M-EDF/02 - Metodi E Didattiche Delle Attivita' SportiveSettore M-EDF/01 - Metodi E Didattiche Delle Attivita' Motorie
researchProduct

Decoding Children's Social Behavior

2013

We introduce a new problem domain for activity recognition: the analysis of children's social and communicative behaviors based on video and audio data. We specifically target interactions between children aged 1-2 years and an adult. Such interactions arise naturally in the diagnosis and treatment of developmental disorders such as autism. We introduce a new publicly-available dataset containing over 160 sessions of a 3-5 minute child-adult interaction. In each session, the adult examiner followed a semi-structured play interaction protocol which was designed to elicit a broad range of social behaviors. We identify the key technical challenges in analyzing these behaviors, and describe met…

Behavior Psychology Dataset Video analysis Speech Analysis AutismInter-action protocolsSocial and communicative behaviorInteraction protocol02 engineering and technologycomputer.software_genreAnnan data- och informationsvetenskapSession (web analytics)Activity recognitionTechnical challenges0202 electrical engineering electronic engineering information engineeringmedicineSocial behaviorAudio signal processingMultimediabusiness.industryDevelopmental disorders020207 software engineeringmedicine.diseaseSemi-structuredResearch questionsActivity recognitionProblem domainKey (cryptography)Autism020201 artificial intelligence & image processingArtificial intelligencePsychologybusinessOther Computer and Information SciencecomputerCognitive psychologySocial behavior2013 IEEE Conference on Computer Vision and Pattern Recognition
researchProduct

The Datafication of Hate: Expectations and Challenges in Automated Hate Speech Monitoring.

2020

Laaksonen, S-M.; Haapoja, J.; Kinnunen, T., Nelimarkka, M. & Pöyhtäri, R. (2020, accepted). . Frontiers in Big Data: Data Mining and Management / Critical Data and Algorithm Studies. doi:10.3389/fdata.2020.00003 Hate speech has been identified as a pressing problem in society and several automated approaches have been designed to detect and prevent it. This paper reports and reflects upon an action research setting consisting of multi-organizational collaboration conducted during Finnish municipal elections in 2017, wherein a technical infrastructure was designed to automatically monitor candidates' social media updates for hate speech. The setting allowed us to engage in a 2-fold investiga…

Big DataComputer sciencehate speechsocial media518 Media and communicationssosiaalinen mediamonitorointi050801 communication & media studiesSocial issues0508 media and communicationspolitiikkadatatiedeArtificial Intelligencealgoritmit050602 political science & public administrationComputer Science (miscellaneous)Social mediaalgorithmic systemvihapuheAction researchObjectivity (science)Original Researchlcsh:T58.5-58.64DataficationSocial phenomenonlcsh:Information technologytekstinlouhinta05 social sciencesCitizen journalism16. Peace & justice113 Computer and information sciencesData science0506 political sciencekoneoppiminenmachine learningNeutralitydata sciencepoliticsInformation Systems
researchProduct

2017

In continuous flash suppression (CFS), a dynamic noise masker, presented to one eye, suppresses conscious perception of a test stimulus, presented to the other eye, until the suppressed stimulus comes to awareness after few seconds. But what do we see breaking the dominance of the masker in the transition period? We addressed this question with a dual-task in which observers indicated (i) whether the test object was left or right of the fixation mark (localization) and (ii) whether it was a face or a house (categorization). As done recently (Stein et al., 2011), we used two experimental varieties to rule out confounds with decisional strategy. In the terminated mode, stimulus and masker wer…

Binocular rivalrygenetic structuresConscious perceptionSpeech recognitionStimulus (physiology)Test object050105 experimental psychology03 medical and health sciencesBehavioral Neuroscience0302 clinical medicineContinuous flash suppression0501 psychology and cognitive sciencesComputer visionDynamic noiseBiological Psychiatrybusiness.industry05 social sciencesCognitive neuroscience of visual object recognitionPsychiatry and Mental healthNeuropsychology and Physiological PsychologyNeurologyCategorizationArtificial intelligencePsychologybusiness030217 neurology & neurosurgeryFrontiers in Human Neuroscience
researchProduct

Quantitative comparison of motion history image variants for video-based depression assessment

2017

Abstract Depression is the most prevalent mood disorder and a leading cause of disability worldwide. Automated video-based analyses may afford objective measures to support clinical judgments. In the present paper, categorical depression assessment is addressed by proposing a novel variant of the Motion History Image (MHI) which considers Gabor-inhibited filtered data instead of the original image. Classification results obtained with this method on the AVEC’14 dataset are compared to those derived using (a) an earlier MHI variant, the Landmark Motion History Image (LMHI), and (b) the original MHI. The different motion representations were tested in several combinations of appearance-based …

BiometricsComputer scienceSpeech recognitionlcsh:TK7800-836002 engineering and technologyConvolutional neural networkMotion (physics)[SPI]Engineering Sciences [physics]Image processingMachine learning0502 economics and business[ SPI ] Engineering Sciences [physics]0202 electrical engineering electronic engineering information engineeringElectrical and Electronic EngineeringCategorical variableComputingMilieux_MISCELLANEOUSLandmarkbusiness.industrylcsh:Electronics05 social sciencesAffective computingFacial image analysisPattern recognitionMotion history imageMoodSignal ProcessingPattern recognition (psychology)Depression assessment020201 artificial intelligence & image processingArtificial intelligenceF1 scorebusiness050203 business & managementInformation SystemsEURASIP Journal on Image and Video Processing
researchProduct

Polarization Modulation Instability in All-Normal Dispersion Microstructured Optical Fibers with Quasi-Continuous 1064 nm Pump

2019

Polarization modulation instability (PMI) is a form of modulation instability that can exist in weakly birefringent optical fibers [1]. Sidebands can be generated by this effect when a polarization mode of the birefringent fiber is excited with an intense optical pump. The polarization state of the sidebands is orthogonal to the polarization of the pump signal. PMI has been observed in microstructured optical fibers (MOFs). PMI was reported in a large-air-filling fraction MOF that was pumped in the normal dispersion regime with visible light [2]. The coherent degradation of femtosecond supercontinuum light generated in all-normal dispersion (ANDi) MOFs due to PMI was recently investigated […

BirefringenceOptical fiberMaterials sciencebusiness.industryComputer Science::Information RetrievalPhysics::OpticsComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Polarization (waves)Supercontinuumlaw.inventionOptical pumpinglawPicosecondExcited stateFemtosecondOptoelectronicsbusiness2019 Conference on Lasers and Electro-Optics Europe & European Quantum Electronics Conference (CLEO/Europe-EQEC)
researchProduct