Search results for "speech recognition"

showing 10 items of 357 documents

Boosting Hankel matrices for face emotion recognition and pain detection

2017

HighligthsDynamics of face expression descriptors are modeled for emotion recognition.A set of Hankel matrices is built upon several multi-scale face representations.Boosting and random subspace projection are used for dynamics selection.Dynamics of Haar-like features and Gabor Energies are compared.Fine-grained dynamics of subtle expressions can be modeled at small spatial scales. Studies in psychology have shown that the dynamics of emotional expressions play an important role in face emotion recognition in humans. Motivated by these studies, in this paper the dynamics of face expressions are modeled and used for automatic emotion recognition and pain detection.Given a temporal sequence o…

EmotionLTI systemSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniFacial expressionSignal processingBoosting (machine learning)business.industrySpeech recognition020207 software engineeringHankel matrix02 engineering and technologyBoostingSoftwareSignal Processing0202 electrical engineering electronic engineering information engineeringFace processing020201 artificial intelligence & image processingEmotional expressionComputer Vision and Pattern RecognitionbusinessClassifier (UML)Hankel matrixSubspace topologySoftwareMathematics
researchProduct

Ensemble of Hankel Matrices for Face Emotion Recognition

2015

In this paper, a face emotion is considered as the result of the composition of multiple concurrent signals, each corresponding to the movements of a specific facial muscle. These concurrent signals are represented by means of a set of multi-scale appearance features that might be correlated with one or more concurrent signals. The extraction of these appearance features from a sequence of face images yields to a set of time series. This paper proposes to use the dynamics regulating each appearance feature time series to recognize among different face emotions. To this purpose, an ensemble of Hankel matrices corresponding to the extracted time series is used for emotion classification withi…

EmotionLTI systemSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniMajority ruleComputer sciencebusiness.industrySpeech recognitionEmotion classificationComputer Science (all)Hankel matrixPattern recognitionClassificationTheoretical Computer Sciencek-nearest neighbors algorithmSchema (psychology)Face processingArtificial intelligenceEmotion recognitionbusinessHankel matrix
researchProduct

Mismatches between objective parameters and measured perception assessment in room acoustics: a holistic approach

2014

Psychoacoustic research in the field of concert halls has revealed that many aspects concerning listening perception have yet to be totally understood. On the one hand, the objective room acoustics of performance spaces are reflected in parameters, some standardized and some not, but these are related to a limited number of perceptual attributes of human response. In general, these objective parameters cannot accurately describe the acoustic details due to their inherent simplification. Under these premises, impulse responses (576 receivers) are measured in 16 concert halls, according to standard procedures, and the perception and satisfaction of the occupants of the rooms are evaluated by …

EngineeringEnvironmental EngineeringSpeech recognitionmedia_common.quotation_subjectGeography Planning and DevelopmentPerceptive acoustic evaluationAcoustic qualityMachine learningcomputer.software_genreConcert-goers responsesField (computer science)CorrelationPerceptionActive listeningPsychoacousticsMultidimensional scalingConcert hallCivil and Structural Engineeringmedia_commonbusiness.industryBuilding and ConstructionRoom acousticsHierarchical clusteringFISICA APLICADAArtificial intelligencebusinessMATEMATICA APLICADAcomputerRoom acousticsMultidimensional scaling
researchProduct

A real time electromyostimulator linked with EMG analysis device

2013

International audience; In this study, a new system composed of two modules (electromyostimulation + electromyography recording) is presented. It can analyze in real time EMG signals during electromyostimulation. In addition, we propose a new method based on wavelet decomposition to analyze changes in M-wave characteristics. It leads to introduce a new index related to muscular fatigue.

EngineeringIndex (economics)medicine.diagnostic_testbusiness.industrySpeech recognition0206 medical engineeringBiomedical EngineeringBiophysics02 engineering and technologyElectromyography020601 biomedical engineering[SPI.TRON] Engineering Sciences [physics]/Electronics[ SPI.TRON ] Engineering Sciences [physics]/Electronics[SPI.TRON]Engineering Sciences [physics]/Electronics03 medical and health sciences0302 clinical medicineWavelet decompositionMuscular fatiguemedicinebusiness030217 neurology & neurosurgery
researchProduct

An open access database for the evaluation of heart sound algorithms

2016

In the past few decades, analysis of heart sound signals (i.e. the phonocardiogram or PCG), especially for automated heart sound segmentation and classification, has been widely studied and has been reported to have the potential value to detect pathology accurately in clinical applications. However, comparative analyses of algorithms in the literature have been hindered by the lack of high-quality, rigorously validated, and standardized open databases of heart sound recordings. This paper describes a public heart sound database, assembled for an international competition, the PhysioNet/Computing in Cardiology (CinC) Challenge 2016. The archive comprises nine different heart sound databases…

EngineeringResearch groupsDatabases FactualPhysiologySpeech recognition0206 medical engineeringphonocardiogram (PCG)Biomedical EngineeringBiophysicsMEDLINE02 engineering and technologycomputer.software_genreArticleheart soundAccess to InformationTECNOLOGIA ELECTRONICACoronary artery diseasePhysioNet/CinC Challenge[INFO.INFO-TS]Computer Science [cs]/Signal and Image ProcessingPhysiology (medical)heart sound classification0202 electrical engineering electronic engineering information engineeringmedicineHumansSegmentationHeart valveSound (geography)databasePhonocardiogramgeographygeography.geographical_feature_categoryDatabasebusiness.industryPhonocardiographySignal Processing Computer-Assistedmedicine.disease020601 biomedical engineeringHeart Soundsmedicine.anatomical_structureheart sound segmentationHeart sounds020201 artificial intelligence & image processingbusinessAlgorithmcomputerAlgorithms
researchProduct

Stereo to Wave-Field Synthesis music up-mixing: An objective and subjective evaluation

2008

Sound source separation techniques are known to be very useful in many applications. High fidelity and audio oriented applications are a challenging issue in this topic, however, existing algorithms are far from performing with such a high quality. In this paper, a subjective and objective evaluation are carried out for several algorithms designed for dealing with stereo music mixtures. The performance of these algorithms applied to acoustic scene resynthesis in a Wave Field Synthesis system is discussed.

EngineeringWave field synthesisbusiness.industrySpeech recognitionSound source separationmedia_common.quotation_subjectField (computer science)High fidelitySource separationQuality (business)Computer visionArtificial intelligenceObjective evaluationbusinessMixing (physics)media_common2008 3rd International Symposium on Communications, Control and Signal Processing
researchProduct

Does Affective Content of Sounds Affect Auditory Time-to-collision Estimation?

2021

EstimationTime to collisionApplied MathematicsSpeech recognitionContent (measure theory)PsychologyAffect (psychology)Auditory Perception & Cognition
researchProduct

Mathematical modeling and parameters estimation of a car crash using data-based regressive model approach

2011

Author's version of an article in the journal: Applied Mathematical Modelling. Also available from the publisher at: http://dx.doi.org/10.1016/j.apm.2011.04.024 n this paper we present the application of regressive models to simulation of car-to-pole impacts. Three models were investigated: RARMAX, ARMAX and AR. Their suitability to estimate physical system parameters as well as to reproduce car kinematics was examined. It was found out that they not only estimate the one quantity which was used for their creation (car acceleration) but also describe the car's acceleration, velocity and crush. A virtual experiment was performed to obtain another set of data for use in further research. An A…

Estimationregressive models parameters estimation viscoelastic modeling virtual experimentComputer sciencebusiness.industrySpeech recognitionApplied MathematicsVDP::Technology: 500::Mechanical engineering: 570CrashMachine learningcomputer.software_genreVDP::Mathematics and natural science: 400::Mathematics: 410Modeling and SimulationModelling and SimulationVirtual experimentArtificial intelligencebusinesscomputerApplied Mathematical Modelling
researchProduct

Effects of Global and Local Contexts on Harmonic Expectancy

1998

Several psycholinguistic studies have investigated the influence of local and global semantic contexts on word processing. The first aim of the present study was to examine local and global level contributions to harmonic priming. The second was to test a spreading-activation account of harmonic context effects (Bharucha, 1987). The expectations for the last chord (the target) of eight-chord sequences were varied by simultaneously manipulating the harmonic relationship of the target to the first six chords (global context) and to the seventh chord (local context). Human performances demonstrated that harmonic expectancies are derived from both the global and local levels of musical structur…

Expectancy theoryConnectionismContext effectComputer scienceSpeech recognitionWord processingChord (music)SchematicMusicMusical formCognitive psychologyMusic Perception
researchProduct

Using Hankel matrices for dynamics-based facial emotion recognition and pain detection

2015

This paper proposes a new approach to model the temporal dynamics of a sequence of facial expressions. To this purpose, a sequence of Face Image Descriptors (FID) is regarded as the output of a Linear Time Invariant (LTI) system. The temporal dynamics of such sequence of descriptors are represented by means of a Hankel matrix. The paper presents different strategies to compute dynamics-based representation of a sequence of FID, and reports classification accuracy values of the proposed representations within different standard classification frameworks. The representations have been validated in two very challenging application domains: emotion recognition and pain detection. Experiments on…

FOS: Computer and information sciencesComputer Science - Artificial IntelligenceComputer Vision and Pattern Recognition (cs.CV)Speech recognitionFeature extractionComputer Science - Computer Vision and Pattern RecognitionPainLTI system theoryComputer Science - RoboticsLinear time invariant systemRepresentation (mathematics)Hidden Markov modelMathematicsEmotionSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniSequencebusiness.industryPattern recognitiondynamicsClassificationSupport vector machineArtificial Intelligence (cs.AI)Face (geometry)Artificial intelligencebusinessRobotics (cs.RO)Hankel matrix2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)
researchProduct