Search results for " Mach"
showing 10 items of 1388 documents
Recognition of Human Actions Through Deep Neural Networks for Multimedia Systems Interaction
2019
Nowadays, interactive multimedia systems are part of everyday life. The most common way to interact and control these devices is through remote controls or some sort of touch panel. In recent years, due to the introduction of reliable low-cost Kinect-like sensing technology, more and more attention has been dedicated to touchless interfaces. A Kinect-like devices can be positioned on top of a multimedia system, detect a person in front of the system and process skeletal data, optionally with RGBd data, to determine user gestures. The gestures of the person can then be used to control, for example, a media device. Even though there is a lot of interest in this area, currently, no consumer sy…
Human Activity Recognition Process Using 3-D Posture Data
2015
In this paper, we present a method for recognizing human activities using information sensed by an RGB-D camera, namely the Microsoft Kinect. Our approach is based on the estimation of some relevant joints of the human body by means of the Kinect; three different machine learning techniques, i.e., K-means clustering, support vector machines, and hidden Markov models, are combined to detect the postures involved while performing an activity, to classify them, and to model each activity as a spatiotemporal evolution of known postures. Experiments were performed on Kinect Activity Recognition Dataset, a new dataset, and on CAD-60, a public dataset. Experimental results show that our solution o…
Super-resolution-based magnification of endothelium cells from biomicroscope videos of the cornea
2018
We present a practical, robust, and effective pipeline to compute a high-resolution (HR) image of the corneal endothelium starting from a low-resolution (LR) video sequence obtained with a general purpose slit lamp biomicroscope. An image quality typical of dedicated and more expensive confocal microscopes is achieved via software magnification by exploiting information redundancy in the video sequence. In particular, the HR image is generated from the best LR frames, obtained by identifying the most suitable endothelium video subsequence using a support vector machine-based learning approach, followed by a robust graph-based frame registration. Results on long, real sequences show that the…
Bayesian Network Based Classification of Mammography Structured Reports
2013
In modern medical domain, documents are created directly in electronic form and stored on huge databases containing documents, text in integral form and images. Retrieving right informations from these servers is challenging and, sometimes, this is very time consuming. Current medical technology do not provide a smart methodology classification of such documents based on their content. In this work the radiological structured reports are analysed classified and assigning an appropriate label. The text classifier is used to label a mammographic structured report. The experimental data are real clinical report coming from a hospital server. Analysing the structured report content, the classif…
Software Design of an AGI System Based on Perception Loop
2010
According to the externalist approach, subjective experience hypothesizes a processual unity between the activity in the brain and the perceived event in the external world. A perception loop therefore occurs among the brain's activitie8 and the external world. In our work the metaphor of test is employed to create a software de8ign methodology for implementing an AGI system endowed with the perception loop. Preliminary experiments with a NAO humanoid robots are reported.
A Multimodal People Recognition System for an Intelligent Environment
2011
In this paper, a multimodal system for recognizing people in intelligent environments is presented. Users are identified and tracked by detecting and recognizing voices and faces through cameras and microphones spread around the environment. This multimodal approach has been chosen to develop a flexible and cheap though reliable system, implemented through consumer electronics. Voice features are extracted through a short time spectrum analysis, while face features are extracted using the eigenfaces technique. The recognition task is achieved through the use of some Support Vector Machines, one per modality, that learn and classify the features of each person, while bindings between modalit…
McRock at SemEval-2022 Task 4: Patronizing and Condescending Language Detection using Multi-Channel CNN, Hybrid LSTM, DistilBERT and XLNet
2022
In this paper we propose four deep learning models for the task of detecting and classifying Patronizing and Condescending Language (PCL) using a corpus of over 13,000 annotated paragraphs in English. The task, hosted at SemEval-2022, consists of two different subtasks. The Subtask 1 is a binary classification problem. Namely, given a paragraph, a system must predict whether or not it contains any form of PCL. The Subtask 2 is a multi-label classification task. Given a paragraph, a system must identify which PCL categories express the condescension. A paragraph might contain one or more categories of PCL. To face with the first subtask we propose a multi-channel Convolutional Neural Network…
A machine learning approach for user localization exploiting connectivity data
2016
The growing popularity of Location-Based Services (LBSs) has boosted research on cheaper and more pervasive localization systems, typically relying on such monitoring equipment as Wireless Sensor Networks (WSNs), which allow to re-use the same instrumentation both for monitoring and for localization without requiring lengthy off-line training. This work addresses the localization problem, exploiting knowledge acquired in sample environments, and extensible to areas not considered in advance. Localization is turned into a learning problem, solved by a statistical algorithm. Additionally, parameter tuning is fully automated thanks to its formulation as an optimization problem based only on co…
An Explorable Immersive Panorama
2012
The immersive panoramas are widely used to provide virtual tours of real scene. Their use covers a wide field of applications: art, industry, space research, topography, forensic investigation and all those systems requiring the exploration of a virtual environment which simulates a real one. Often sophisticated devices are used to perform the panorama acquisition. In this paper, we present an image based immersive panorama requiring low cost devices for the acquisition task and provides an innovative human-computer interaction approach. Many panoramic images of the same location are captured. The visualization system changes the panorama in a transparent way with respect to the user intera…
On Representing Concepts in High-dimensional Linear Spaces
2017
Producing a mathematical model of concepts is a very important issue in artificial intelligence, because if such a model were found this, besides being a very interesting result in its own right, would also contribute to the emergence of what we could call the ‘mathematics of thought.’ One of the most interesting attempts made in this direction is P. Gardenfors’ theory of conceptual spaces, a ¨ theory which is mostly presented by its author in an informal way. The main aim of the present article is contributing to Gardenfors’ theory of conceptual spaces ¨ by discussing some of the advantages which derive from the possibility of representing concepts in high-dimensional linear spaces.