Search results for "speech"

showing 10 items of 1281 documents

Words and Patterns

2002

In this paper some new ideas, problems and results on patterns are proposed. In particular, motivated by questions concerning avoidability, we first study the set of binary patterns that can occur in one infinite binary word, comparing it with the set of factors of the word. This suggests a classification of infinite words in terms of the "difference" between the set of its patterns and the set of its factors. The fact that each factor in an infinite word can give rise to several distinct patterns leads to study the set of patterns of a single finite word. This set, endowed with a natural order relation, defines a poset: we investigate the relationships between the structure of such a poset…

Set (abstract data type)Discrete mathematicsStructure (mathematical logic)Regular languageRelation (database)Binary numberComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Natural orderPartially ordered setComputer Science::Formal Languages and Automata TheoryWord (computer architecture)Mathematics

researchProduct

Modeling musical attributes to characterize ensemble recordings using rhythmic audio features

2011

In this paper, we present the results of a pre-study on music performance analysis of ensemble music. Our aim is to implement a music classification system for the description of live recordings, for instance to help musicologist and musicians to analyze improvised ensemble performances. The main problem we deal with is the extraction of a suitable set of audio features from the recorded instrument tracks. Our approach is to extract rhythm-related audio features and to apply them for regression-based modeling of eight more general musical attributes. The model based on Partial Least-Squares Regression without preceding Principal Component Analysis performed best for all of the eight attribu…

Set (abstract data type)Sound recording and reproductionMusicologyComputer scienceSpeech recognitionFeature extractionMusicalAudio signal processingcomputer.software_genrecomputer2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

researchProduct

A Medium Level Language for Pyramid Architectures

1989

In the paper a Parallel C Languages for pyramid architectures is described. The concept of context is introduced in order to handle concurrence between processes in massive parallel machines. Feature implementation on the PAPIA-machine are given.

Settore INF/01 - InformaticaComputer scienceSpeech recognitionConcurrencyPyramidFeature (machine learning)ConcurrenceContext (language use)Parallel computingParallel languages Concurrency Image Analysis Pyramids.

researchProduct

A calculus for robot inner speech and self-awareness

2019

The inner speech is the common mental experience the humans have when they dialogue with themselves. It is widely acknowledged that inner speech is related to awareness and self-awareness. The inner speech reproduces and expands in the mind social and physical sources of awareness. In this preliminary work, a calculus based on a first-order modal logic to automate inner speech is presented. It attempts to make the existing inner speech theories suitable for robot. By making robot able to talk to itself, it is possible to analyze the role of inner speech in robot awareness and self-awareness, opening new interesting research scenarios not yet investigated.

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniArtificial intelligenceSpeech theoryRobotModal logicSpeechSelf-awarene

researchProduct

Efficient FPGA Implementation of a Knowledge-Based Automatic Speech Classifier

2005

Speech recognition has become common in many application domains, from dictation systems for professional practices to vocal user interfaces for people with disabilities or hands-free system control. However, so far the performance of Automatic Speech Recognition (ASR) systems are comparable to Human Speech Recognition (HSR) only under very strict working conditions, and in general far lower. Incorporating acoustic-phonetic knowledge into ASR design has been proven a viable approach to rise ASR accuracy. Manner of articulation attributes such as vowel, stop, fricative, approximant, nasal, and silence are examples of such knowledge. Neural networks have already been used successfully as dete…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniArtificial neural networkDictationComputer sciencebusiness.industrySpeech recognitionField programmable gate arrays (FPGA)artificial neuralPerceptronManner of articulationKnowledge baseUser interfacebusinessField-programmable gate arrayClassifier (UML)Neural networks

researchProduct

Application of EαNets to Feature Recognition of Articulation Manner in Knowledge-Based Automatic Speech Recognition

2006

Speech recognition has become common in many application domains. Incorporating acoustic-phonetic knowledge into Automatic Speech Recognition (ASR) systems design has been proven a viable approach to rise ASR accuracy. Manner of articulation attributes such as vowel, stop, fricative, approximant, nasal, and silence are examples of such knowledge. Neural networks have already been used successfully as detectors for manner of articulation attributes starting from representations of speech signal frames. In this paper, a set of six detectors for the above mentioned attributes is designed based on the E-αNet model of neural networks. This model was chosen for its capability to learn hidden acti…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniArtificial neural networkGeneralizationComputer scienceSpeech recognitionSIGNAL (programming language)cognitive architectureFeature recognitionneural networks speech recognitionAnthropomorphic robotsManner of articulationSystems designSet (psychology)Articulation (phonetics)Robots

researchProduct

An Emotional Talking Head for a Humoristic Chatbot

2011

The interest about enhancing the interface usability of applications and entertainment platforms has increased in last years. The research in human-computer interaction on conversational agents, named also chatbots, and natural language dialogue systems equipped with audio-video interfaces has grown as well. One of the most pursued goals is to enhance the realness of interaction of such systems. For this reason they are provided with catchy interfaces using humanlike avatars capable to adapt their behavior according to the conversation content. This kind of agents can vocally interact with users by using Automatic Speech Recognition (ASR) and Text To Speech (TTS) systems; besides they can c…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniComputer sciencetalking head chatbot emotional AIMLSpeech synthesisAnimationcomputer.software_genreCyberwareASCIIChatbotIntelligent agentHuman–computer interactionGraphicscomputerNatural languageComputingMethodologies_COMPUTERGRAPHICS

researchProduct

Robot’s Inner Speech Effects on Human Trust and Anthropomorphism

2023

AbstractInner Speech is an essential but also elusive human psychological process that refers to an everyday covert internal conversation with oneself. We argued that programming a robot with an overt self-talk system that simulates human inner speech could enhance both human trust and users’ perception of robot’s anthropomorphism, animacy, likeability, intelligence and safety. For this reason, we planned a pre-test/post-test control group design. Participants were divided in two different groups, one experimental group and one control group. Participants in the experimental group interacted with the robot Pepper equipped with an over inner speech system whereas participants in the control …

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniGeneral Computer ScienceSocial PsychologyRobotInner speechAnthropomorphismTrustHuman–robot interactionHuman-Computer InteractionPhilosophySettore M-PSI/04 - Psicologia Dello Sviluppo E Psicologia Dell'EducazioneControl and Systems EngineeringElectrical and Electronic EngineeringSelf-talk

researchProduct

Exploiting Correlation between Body Gestures and Spoken Sentences for Real-time Emotion Recognition

2017

Humans communicate their affective states through different media, both verbal and non-verbal, often used at the same time. The knowledge of the emotional state plays a key role to provide personalized and context-related information and services. This is the main reason why several algorithms have been proposed in the last few years for the automatic emotion recognition. In this work we exploit the correlation between one's affective state and the simultaneous body expressions in terms of speech and gestures. Here we propose a system for real-time emotion recognition from gestures. In a first step, the system builds a trusted dataset of association pairs (motion data -> emotion pattern), a…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniGround truthSettore INF/01 - InformaticaExploitK-nearest neighborbusiness.industrySpeech recognitioncomputer.software_genreMotion (physics)CorrelationDynamic Time Warping Emotion Recognition K-nearest neighborEmotion RecognitionKey (cryptography)Artificial intelligenceState (computer science)businessAssociation (psychology)PsychologycomputerNatural language processingGestureDynamic Time Warping

researchProduct

InspirationWall

2015

Collaborative idea generation leverages social interactions and knowledge sharing to spark diverse associations and produce creative ideas. Information exploration systems expand the current context by suggesting novel but related concepts. In this paper we introduce InspirationWall, an unobtrusive display that leverages speech recognition and information exploration to enhance an ongoing idea generation session with automatically retrieved concepts that relate to the conversation. We evaluated the system in six idea generation sessions of 20 minutes with small groups of two people. Preliminary results suggest that InspirationWall contrasts the decay of idea productivity over time and can t…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniInformation ExplorationSettore INF/01 - InformaticaComputer sciencemedia_common.quotation_subjectContext (language use)Automatic Speech RecognitionIdeationIdea generationSession (web analytics)Knowledge sharingSPARK (programming language)Human–computer interactionConversationInformation explorationcomputercomputer.programming_languagemedia_commonProceedings of the 2015 ACM SIGCHI Conference on Creativity and Cognition

researchProduct