Learning high-level tasks through imitation
This paper presents the cognitive architecture Con-SCIS (Conceptual Space based Cognitive Imitation System), which tightly links low-level data processing with knowledge representation in the context of imitation learning. We use the word imitate to refer to the paradigm of program-level imitation: we are interested in the final effects of actions on objects, and not on the particular kinematic or dynamic properties of the motion. The same architecture is used both to analyze and represent the task to be imitated, and to perform the imitation by generalizing in novel and different circumstances. The implemented experimental scenario is a simplified two-dimensional world populated with vario…
Sign Languages Recognition Based on Neural Network Architecture
In the last years, many steps forward have been made in speech and natural languages recognition and were developed many virtual assistants such as Apple’s Siri, Google Now and Microsoft Cortana. Unfortunately, not everyone can use voice to communicate to other people and digital devices. Our system is a first step for extending the possibility of using virtual assistants to speech impaired people by providing an artificial sign languages recognition based on neural network architecture.
Social robots and therapeutic adherence: a new challenge in pediatric asthma?
Social Robots are used in different contexts and, in healthcare, they are better known as Socially Assistive Robots. In the context of asthma, the use of Socially Assistive Robots has the potential to increase motivation and engagement to treatment. Other positive roles proposed for Socially Assistive Robots are to provide education, training regarding treatments, and feedback to patients. This review evaluates emerging interventions for improving treatment adherence in pediatric asthma, focusing on the possible future role of social robots in the clinical practice.
A System for Simultaneous People Tracking and Posture Recognition in the context of Human-Computer Interaction
The paper deals with an artificial-vision based system for simultaneous people tracking and posture recognition In the context of human-computer Interaction. We adopt no particular assumptions on the movement of a person and on Its appearance, making the system suitable to several real-world applications. The system can be roughly subdivided Into two highly correlated phases: tracking and recognition. The tracking phase Is concerned with establishing coherent relations of the same subject between frames. We adopted the Condensation algorithm due to Its robustness In highly cluttered environments. The recognition phase adopts a modified elgenspace technique In order to classify between sever…
Fast Volumetric Reconstruction of Human Body through Superquadrics
This paper describes a technique to reconstruct the volumes of the human body. For this purpose, are introduced mathematical objects able to represent 3d shapes, called super quadrics. These objects are positioned in the space according the captures made by a Microsoft Kinect device and are composed to represent the volumes of the human body. The employment of quaternions provides a relevant speedup for the rotation of the volumes and allows to follow the human movements in real time and reduced computational cost.
A framework for sign language sentence recognition by common sense context
This correspondence proposes a complete framework for sign language recognition that integrates a commonsense engine in order to deal with sentence recognition. The proposed system is based on a multilevel architecture that allows modeling and managing of the knowledge of the recognition process in a simple and robust way. The final abstraction level of this architecture introduces the semantic context and the analysis of the correctness of a sentence given in a sequence of recognized signs. Experimentations are presented using a set of signs from the Italian sign language (LIS) for domotic applications. The implemented system maintains a high recognition rate when the set of signs grows, c…
An automatic system for humanoid dance creation
Abstract The paper describes a novel approach to allow a robot to dance following musical rhythm. The proposed system generates a dance for a humanoid robot through the combination of basic movements synchronized with the music. The system made up of three parts: the extraction of features from audio file, estimation of movements through the Hidden Markov Models and, finally, the generation of dance. Starting from a set of given movements, the robot choices sequence of movements a suitable Hidden Markov Model, and synchronize them processing musical input. The proposed approach has the advantage that movement execution probabilities could be changed according evaluation of the dance executi…
A vision system for symbolic interpretation of dynamic scenes using arsom
We describe an artificial high-level vision system for the symbolic interpretation of data coming from a video camera that acquires the image sequences of moving scenes. The system is based on ARSOM neural networks that learn to generate the perception-grounded predicates obtained by image sequences. The ARSOM neural networks also provide a three-dimensional estimation of the movements of the relevant objects in the scene. The vision system has been employed in two scenarios: the monitoring of a robotic arm suitable for space operations, and the surveillance of an electronic data processing (EDP) center.
Architectural Scenes Reconstruction from Uncalibrated Photos and Map Based Model Knowledge
In this paper we consider the problem of reconstructing architectural scenes from multiple photographs taken from arbitrarily viewpoints. The original contribution of this work is the use of a map as a source of a priori knowledge and geometric constraints in order to obtain in a fast and simple way a detailed model of a scene. We suppose images are uncalibrated and have at least one planar structure as a facade for exploiting the planar homography induced between world plane and image to calculate a first estimation of the projection matrix. Estimations are improved by using correspondences between images and map. We show how these simple constraints can be used to calibrate the cameras, t…
Creation and cognition for humanoid live dancing
Abstract Computational creativity in dancing is a recent and challenging research field in Artificial Intelligence and Robotics. We present a cognitive architecture embodied in a humanoid robot capable to create and perform dances driven by the perception of music. The humanoid robot is able to suitably move, to react to human mate dancers and to generate novel and appropriate sequences of movements. The approach is based on a cognitive architecture that integrates Hidden Markov Models and Genetic Algorithms. The system has been implemented on a NAO robot and tested in public setting-up live performances, obtaining positive feedbacks from the audience.
Visual Control of a Robotic Hand
The paper deals with the design and implementation of a visual control of a robotic system composed of a dexterous hand and stereo cameras. The aim of the proposed system is to reproduce the movements of a human hand in order to learn complex manipulation tasks. A novelty algorithm for a robust and fast fingertips localization and tracking is presented. Moreover, a simulator is integrated in the system to give useful feedbacks to the users during operations, and provide robust testing framework for real experiments (see video).
A cooperating strategy for objects recognition
The paper describes an object recognition system, based on the co-operation of several visual modules (early vision, object detector, and object recognizer). The system is active because the behavior of each module is tuned on the results given by other modules and by the internal models. This solution allows to detect inconsistencies and to generate a feedback process. The proposed strategy has shown good performance especially in case of complex scene analysis, and it has been included in the visual system of the DAISY robotics system. Experimental results on real data are also reported.
A cognitive approach to goal-level imitation
Imitation in robotics is seen as a powerful means to reduce the complexity of robot programming. It allows users to instruct robots by simply showing them how to execute a given task. Through imitation robots can learn from their environment and adapt to it just as human newborns do. Despite different facets of imitative behaviours observed in humans and higher primates, imitation in robotics has usually been implemented as a process of copying demonstrated actions onto the movement apparatus of the robot. While the results being reached are impressive, we believe that a shift towards a higher expression of imitation, namely the comprehension of human actions and inference of its intentions…
Anchoring by Imitation Learning in Conceptual Spaces
In order to have a robotic system able to effectively learn by imitation, and not merely reproduce the movements of a human teacher, the system should have the capabilities of deeply understanding the perceived actions to be imitated. This paper deals with the development of a cognitive architecture for learning by imitation in which a rich conceptual representation of the observed actions is built. The purpose of the following discussion is to show how the same conceptual representation can be used both in a bottom-up approach, in order to learn sequences of actions by imitation learning paradigm, and in a top-down approach, in order to anchor the symbolical representations to the perceptu…
Imitation Learning and Anchoring through Conceptual Spaces
In order to have a robotic system able to effectively learn by imitation and not merely reproduce the movements of a human teacher, the system should have the capability to deeply understand the perceived actions to be imitated. This paper deals with the development of a cognitive architecture for learning by imitation in which a rich conceptual representation of the observed actions is built. The purpose of the following discussion is to show how the same conceptual representation can be used both in a bottom-up approach, in order to learn sequences of actions by imitation learning paradigm, and in a top-down approach, in order to anchor the symbolical representations to the perceptual act…
Learning high-level manipulative tasks through imitation
This paper presents ConSCIS, Conceptual Space based Cognitive Imitation System, which tightly links low-level data processing with knowledge representation in the context of robot imitation. Our focus is on the program-level imitation: we are interested in the final effects of actions on objects, and not on the particular kinematic or dynamic properties of the motion. The same architecture is used both to analyze and represent the task to be imitated, and to perform the imitation by generalizing in novel and different circumstances. The implemented experimental scenario is a two dimensional world populated with various objects in which observation/imitation takes place. To validate our appr…
An Artificial Soft Somatosensory System for a Cognitive Robot
The paper proposes an artificial somatosensory system loosely inspired by human beings' biology and embedded in a cognitive architecture (CA). It enables a robot to receive the stimulation from its embodiment, and use these sensations, we called roboceptions, to behave according to both the external environment and the internal robot status. In such a way, the robot is aware of its body and able to interpret physical sensations can be more effective in the task while maintaining its well being. The robot's physiological urges are tightly bound to the specific physical state of the robot. Positive and negative physical information can, therefore, be processed and let the robot behave in a mo…
A system for sign language sentence recognition based on common sense context
The paper proposes a complete framework for sign language recognition that integrates common sense in order to deal with sentences. The proposed system is based on a cognitive architecture allows modeling and managing the knowledge of the recognition process in a simple and robust way. The final abstraction level of this architecture introduces the semantic context and the analysis of the correctness of a sentence given a sequence of recognized signs. Experimentations are presented using the Italian sign language (LIS), and shows that the system maintains the recognition rate high when set of sign grows, correcting erroneous recognized single sign using the context
Representation, Recognition and Generation of Actions in the Context of Imitation Learning
The paper deals with the development of a cognitive architecture for learning by imitation in which a rich conceptual representation of the observed actions is built. We adopt the paradigm of conceptual spaces, in which static and dynamic entities are employed to efficiently organize perceptual data, to recognize positional relations, to learn movements from human demonstration and to generate complex actions by combining and sequencing simpler ones. The aim is to have a robotic system able to effectively learn by imitation and which has the capabilities of deeply understanding the perceived actions to be imitated. Experimentation has been performed on a robotic system composed of a PUMA 20…
Artificial Pleasure and Pain Antagonism Mechanism in a Social Robot
The goal of the work is to build some Python modules that allow the Nao robot to emulate a somatosensorial system similar to the human one. Assuming it can perceive some feelings similar to the ones recognized by the human system, it will be possible to make it react appropriately to the external stimuli. The idea is to have a group of software sensors working simultaneously, providing some feedback to show how the robot is feeling at a particular time. It will be able to feel articular pain and stress, to perceive people in his surroundings (and in a future work to react according to the knowledge of them with face recognition), feel pleasure by recognizing caresses on his head and respond…
Image Segmentation through a Hierarchy of Minimum Spanning Trees
Many approaches have been adopted to solve the problem of image segmentation. Among them a noticeable part is based on graph theory casting the pixels as nodes in a graph. This paper proposes an algorithm to select clusters in the images (corresponding to relevant segments in the image) corresponding to the areas induced in the images through the search of the Minimum Spanning Tree (MST). In particular is is based on a clustering algorithm that extracts clusters computing a hierarchy of Minimum Spanning Trees. The main drawback of this previous algorithm is that the dimension of the cluster is not predictable and a relevant portion of found clusters can be composed by micro-clusters that ar…
Composition of SIFT features for robust image representation
In this paper we propose a novel feature based on SIFT (Scale Invariant Feature Transform) algorithm1 for the robust representation of local visual contents. SIFT features have raised much interest for their power of description of visual content characterizing punctual information against variation of luminance and change of viewpoint and they are very useful to capture local information. For a single image hundreds of keypoints are found and they are particularly suitable for tasks dealing with image registration or image matching. In this work we stretched the spatial coverage of descriptors creating a novel feature as composition of keypoints present in an image region while maintaining…
A Cognitive Framework for Imitation Learning
Abstract In order to have a robotic system able to effectively learn by imitation, and not merely reproduce the movements of a human teacher, the system should have the capabilities of deeply understanding the perceived actions to be imitated. This paper deals with the development of cognitive architecture for learning by imitation in which a rich conceptual representation of the observed actions is built. The purpose of the following discussion is to show how this Conceptual Area can be employed to efficiently organize perceptual data, to learn movement primitives from human demonstration and to generate complex actions by combining and sequencing simpler ones. The proposed architecture ha…
Roboception and adaptation in a cognitive robot
In robotics, perception is usually oriented at understanding what is happening in the external world, while few works pay attention to what is occurring in the robot’s body. In this work, we propose an artificial somatosensory system, embedded in a cognitive architecture, that enables a robot to perceive the sensations from its embodiment while executing a task. We called these perceptions roboceptions, and they let the robot act according to its own physical needs in addition to the task demands. Physical information is processed by the robot to behave in a balanced way, determining the most appropriate trade-off between the achievement of the task and its well being. The experiments show …
Experiences with CiceRobot, a Museum Guide Cognitive Robot
The paper describes CiceRobot, a robot based on a cognitive architecture for robot vision and action. The aim of the architecture is to integrate visual perception and actions with knowledge representation, in order to let the robot to generate a deep inner understanding of its environment. The principled integration of perception, action and of symbolic knowledge is based on the introduction of an intermediate representation based on Gardenfors conceptual spaces. The architecture has been tested on a RWI B21 autonomous robot on tasks related with guided tours in the Archaeological Museum of Agrigento. Experimental results are presented.
A Rapid and Efficient Method for Determination of Fruit Peel Color
Peel color is a critical index of external fruit quality and consumer appreciation level. Traditional methods for determination of peel color are based on visual analysis or punctual measurements by colorimeter. In this study we present a method based on digital image analysis that integrates the accuracy of an interactive measurement and the efficacy of an image analysis that descibes entire sides of the fruit. A sample of apple, mandarin, grape, and peach fruit was photographed (each fruit on two opposite sides) with a digital camera for determination of peel color. Digital images were converted from RGB to CIE L*a*b* format, and color characteristics were indexed and quantified. The impl…
Recognition of Human Actions Through Deep Neural Networks for Multimedia Systems Interaction
Nowadays, interactive multimedia systems are part of everyday life. The most common way to interact and control these devices is through remote controls or some sort of touch panel. In recent years, due to the introduction of reliable low-cost Kinect-like sensing technology, more and more attention has been dedicated to touchless interfaces. A Kinect-like devices can be positioned on top of a multimedia system, detect a person in front of the system and process skeletal data, optionally with RGBd data, to determine user gestures. The gestures of the person can then be used to control, for example, a media device. Even though there is a lot of interest in this area, currently, no consumer sy…
A Cognitive Architecture for Robotic Hand Posture Learning
This paper deals with the design and implementation of a visual control of a robotic system composed of a dexterous hand and video camera. The aim of the proposed system is to reproduce the movements of a human hand in order to learn complex manipulation tasks or to interact with the user. A novel algorithm for robust and fast fingertips localization and tracking is presented. A suitable kinematic hand model is adopted to achieve a fast and acceptable solution to an inverse kinematics problem. The system is part of a cognitive architecture for posture learning that integrates the perceptions by a high-level representation of the scene and of the observed actions. The anthropomorphic robotic…
A Posture Sequence Learning System for an Anthropomorphic Robotic Hand
The paper presents a cognitive architecture for posture learning of an anthropomorphic robotic hand. Our approach is aimed to allow the robotic system to perform complex perceptual operations, to interact with an human user and to integrate the perceptions by a cognitive representation of the scene and the observed actions. The anthropomorphic robotic hand imitates the gestures acquired by the vision system in order to learn meaningful movements, to build its knowledge by different conceptual spaces and to perform complex interaction with the human operator.