6533b7d8fe1ef96bd126b884

RESEARCH PRODUCT

A probabilistic approach to learning a visually grounded language model through human-robot interaction

Haris DindoDaniele Zambuto

subject

Robotics Machine Learning Human-Robot InteractionComputer sciencebusiness.industryProbabilistic logicLanguage acquisitionSemanticscomputer.software_genreHuman–robot interactionHuman–computer interactionArtificial intelligenceLanguage modelSet (psychology)Hidden Markov modelbusinesscomputerMotor skillHumanoid robotNatural language processingNatural language

description

A Language is among the most fascinating and complex cognitive activities that develops rapidly since the early months of infants' life. The aim of the present work is to provide a humanoid robot with cognitive, perceptual and motor skills fundamental for the acquisition of a rudimentary form of language. We present a novel probabilistic model, inspired by the findings in cognitive sciences, able to associate spoken words with their perceptually grounded meanings. The main focus is set on acquiring the meaning of various perceptual categories (e. g. red, blue, circle, above, etc.), rather than specific world entities (e. g. an apple, a toy, etc.). Our probabilistic model is based on a variant of multi-instance learning technique, and it enables a robotic platform to learn grounded meanings of adjective/noun terms. The systems could be used to understand and generate appropriate natural language descriptions of real objects in a scene, and it has been successfully tested on the NAO humanoid robotic platform.

https://doi.org/10.1109/iros.2010.5654440