Search results for "feature"

showing 10 items of 4091 documents

PerceptNet: A Human Visual System Inspired Neural Network for Estimating Perceptual Distance

2019

Traditionally, the vision community has devised algorithms to estimate the distance between an original image and images that have been subject to perturbations. Inspiration was usually taken from the human visual perceptual system and how the system processes different perturbations in order to replicate to what extent it determines our ability to judge image quality. While recent works have presented deep neural networks trained to predict human perceptual quality, very few borrow any intuitions from the human visual system. To address this, we present PerceptNet, a convolutional neural network where the architecture has been chosen to reflect the structure and various stages in the human…

FOS: Computer and information sciencesComputer Science - Machine LearningVisual perceptionComputer scienceImage qualitymedia_common.quotation_subjectFeature extractionMachine Learning (stat.ML)02 engineering and technology01 natural sciencesConvolutional neural networkhuman visual systemMachine Learning (cs.LG)010309 opticsStatistics - Machine LearningPerception0103 physical sciences0202 electrical engineering electronic engineering information engineeringFOS: Electrical engineering electronic engineering information engineeringperceptual distancemedia_commonArtificial neural networkbusiness.industryDeep learningImage and Video Processing (eess.IV)Pattern recognitionElectrical Engineering and Systems Science - Image and Video Processingneural networksHuman visual system model020201 artificial intelligence & image processingArtificial intelligencebusiness
researchProduct

Deep Generative Model-Driven Multimodal Prostate Segmentation in Radiotherapy

2019

Deep learning has shown unprecedented success in a variety of applications, such as computer vision and medical image analysis. However, there is still potential to improve segmentation in multimodal images by embedding prior knowledge via learning-based shape modeling and registration to learn the modality invariant anatomical structure of organs. For example, in radiotherapy automatic prostate segmentation is essential in prostate cancer diagnosis, therapy, and post-therapy assessment from T2-weighted MR or CT images. In this paper, we present a fully automatic deep generative model-driven multimodal prostate segmentation method using convolutional neural network (DGMNet). The novelty of …

FOS: Computer and information sciencesComputer scienceComputer Vision and Pattern Recognition (cs.CV)medicine.medical_treatmentProstate segmentationFeature extractionComputer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONConvolutional neural network[SDV.IB.MN]Life Sciences [q-bio]/Bioengineering/Nuclear medicineConvolutional neural network030218 nuclear medicine & medical imaging03 medical and health sciencesProstate cancer0302 clinical medicineFOS: Electrical engineering electronic engineering information engineeringmedicineSegmentationArtificial neural networkbusiness.industryDeep learningImage and Video Processing (eess.IV)NoveltyDeep learningPattern recognitionElectrical Engineering and Systems Science - Image and Video Processingmedicine.diseaseTransfer learning3. Good healthRadiation therapyGenerative model030220 oncology & carcinogenesisEmbeddingArtificial intelligencebusinessCTMRI
researchProduct

Learning Structures in Earth Observation Data with Gaussian Processes

2020

Gaussian Processes (GPs) has experienced tremendous success in geoscience in general and for bio-geophysical parameter retrieval in the last years. GPs constitute a solid Bayesian framework to formulate many function approximation problems consistently. This paper reviews the main theoretical GP developments in the field. We review new algorithms that respect the signal and noise characteristics, that provide feature rankings automatically, and that allow applicability of associated uncertainty intervals to transport GP models in space and time. All these developments are illustrated in the field of geoscience and remote sensing at a local and global scales through a set of illustrative exa…

FOS: Computer and information sciencesEarth observation010504 meteorology & atmospheric sciencesComputer science0211 other engineering and technologiesFOS: Physical sciencesMachine Learning (stat.ML)02 engineering and technologyApplied Physics (physics.app-ph)computer.software_genre01 natural sciencesField (computer science)Physics::GeophysicsSet (abstract data type)Physics - Geophysicssymbols.namesakeStatistics - Machine LearningFeature (machine learning)Gaussian process021101 geological & geomatics engineering0105 earth and related environmental sciencesbusiness.industryPhysics - Applied PhysicsGeophysics (physics.geo-ph)Function approximationsymbolsGlobal Positioning SystemNoise (video)Data miningbusinesscomputer
researchProduct

At Your Service: Coffee Beans Recommendation From a Robot Assistant

2020

With advances in the field of machine learning, precisely algorithms for recommendation systems, robot assistants are envisioned to become more present in the hospitality industry. Additionally, the COVID-19 pandemic has also highlighted the need to have more service robots in our everyday lives, to minimise the risk of human to-human transmission. One such example would be coffee shops, which have become intrinsic to our everyday lives. However, serving an excellent cup of coffee is not a trivial feat as a coffee blend typically comprises rich aromas, indulgent and unique flavours and a lingering aftertaste. Our work addresses this by proposing a computational model which recommends optima…

FOS: Computer and information sciencesService (systems architecture)business.industryComputer scienceFeature vectorSupervised learningComputer Science - Human-Computer InteractionComputingMilieux_PERSONALCOMPUTING02 engineering and technologyRecommender systemMachine learningcomputer.software_genreField (computer science)GeneralLiterature_MISCELLANEOUSComputer Science - Information RetrievalPersonalizationHuman-Computer Interaction (cs.HC)0202 electrical engineering electronic engineering information engineeringRobotUnsupervised learning020201 artificial intelligence & image processingArtificial intelligencebusinesscomputerInformation Retrieval (cs.IR)
researchProduct

Acoustic Scene Classification with Squeeze-Excitation Residual Networks

2020

Acoustic scene classification (ASC) is a problem related to the field of machine listening whose objective is to classify/tag an audio clip in a predefined label describing a scene location (e. g. park, airport, etc.). Many state-of-the-art solutions to ASC incorporate data augmentation techniques and model ensembles. However, considerable improvements can also be achieved only by modifying the architecture of convolutional neural networks (CNNs). In this work we propose two novel squeeze-excitation blocks to improve the accuracy of a CNN-based ASC framework based on residual learning. The main idea of squeeze-excitation blocks is to learn spatial and channel-wise feature maps independently…

FOS: Computer and information sciencesSound (cs.SD)Computer Science - Machine LearningGeneral Computer ScienceCalibration (statistics)Computer scienceResidualConvolutional neural networkField (computer science)Computer Science - SoundMachine Learning (cs.LG)030507 speech-language pathology & audiology03 medical and health sciencesAudio and Speech Processing (eess.AS)Acoustic scene classificationFeature (machine learning)FOS: Electrical engineering electronic engineering information engineeringGeneral Materials ScienceBlock (data storage)Artificial neural networkbusiness.industrypattern recognitionGeneral Engineeringdeep learningPattern recognitionmachine listeningsqueeze-excitationArtificial intelligencelcsh:Electrical engineering. Electronics. Nuclear engineering0305 other medical sciencebusinesslcsh:TK1-9971Electrical Engineering and Systems Science - Audio and Speech Processing
researchProduct

Dimensionality Reduction via Regression in Hyperspectral Imagery

2015

This paper introduces a new unsupervised method for dimensionality reduction via regression (DRR). The algorithm belongs to the family of invertible transforms that generalize Principal Component Analysis (PCA) by using curvilinear instead of linear features. DRR identifies the nonlinear features through multivariate regression to ensure the reduction in redundancy between he PCA coefficients, the reduction of the variance of the scores, and the reduction in the reconstruction error. More importantly, unlike other nonlinear dimensionality reduction methods, the invertibility, volume-preservation, and straightforward out-of-sample extension, makes DRR interpretable and easy to apply. The pro…

FOS: Computer and information sciencesbusiness.industryDimensionality reductionComputer Vision and Pattern Recognition (cs.CV)Feature extractionNonlinear dimensionality reductionDiffusion mapComputer Science - Computer Vision and Pattern RecognitionPattern recognitionMachine Learning (stat.ML)CollinearityReduction (complexity)Statistics - Machine LearningSignal ProcessingPrincipal component analysisArtificial intelligenceElectrical and Electronic EngineeringbusinessMathematicsCurse of dimensionality
researchProduct

Audio-video people recognition system for an intelligent environment

2011

In this paper an audio-video system for intelligent environments with the capability to recognize people is presented. Users are tracked inside the environment and their positions and activities can be logged. Users identities are assessed through a multimodal approach by detecting and recognizing voices and faces through the different cameras and microphones installed in the environment. This approach has been chosen in order to create a flexible and cheap but reliable system, implemented using consumer electronics. Voice features are extracted by a short time cepstrum analysis, and face features are extracted using the eigenfaces technique. The recognition task is solved using the same Su…

Face featureSettore ING-INF/05 - Sistemi Di Elaborazione Delle Informazionibusiness.industryComputer scienceIntelligent environmentPeople recognitionFeature extractionReliable systemSet-up phaseSingle sensorFacial recognition systemSelection principleSupport vector machineSoftwareEigenfaceMulti-modal approachMiddlewareCepstrumLearning ruleIntelligent environmentCepstrum analysiComputer visionArtificial intelligenceEigenfacebusiness
researchProduct

Probabilistic Corner Detection for Facial Feature Extraction

2009

After more than 35 years of resarch, face processing is considered nowadays as one of the most important application of image analysis. It can be considered as a collection of problems (i.e., face detection, normalization, recognition and so on) each of which can be treated separately. Some face detection and face recognition techniques have reached a certain level of maturity, however facial feature extraction still represents the bottleneck of the entire process. In this paper we present a novel facial feature extraction approach that could be used for normalizing Viola-Jones detected faces and let them be recognized by an appearance-based face recognition method. For each observed featur…

Face hallucinationbusiness.industryComputer scienceFeature extractionCorner detectionNormalization (image processing)Pattern recognitionFace detection - face recognition - features extraction - CBIRFacial recognition systemObject-class detectionThree-dimensional face recognitionComputer visionArtificial intelligenceFace detectionbusiness
researchProduct

Automatic Generation of Subject-Based Image Transitions

2011

This paper presents a novel approach for the automatic generation of image slideshows. Counter to standard cross-fading, the idea is to operate the image transitions keeping the subject focused in the intermediate frames by automatically identifying him/her and preserving face and facial features alignment. This is done by using a novel Active Shape Model and time-series Image Registration. The final result is an aesthetically appealing slideshow which emphasizes the subject. The results have been evaluated with a users’ response survey. The outcomes show that the proposed slideshow concept is widely preferred by final users w.r.t. standard image transitions.

Face processing; image morphing; image registrationComputer sciencebusiness.industryComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONImage registrationSubject (documents)Image processingimage morphingImage (mathematics)image registrationAutomatic image annotationActive shape modelFace (geometry)Face processingComputer visionArtificial intelligencebusinessFeature detection (computer vision)
researchProduct

Designing a framework for assisting depression severity assessment from facial image analysis

2015

Depression is one of the most common mental disorders affecting millions of people worldwide. Developing adjunct tools aiding depression assessment is expected to impact overall health outcomes and treatment cost reduction. To this end, platforms designed for automatic and non-invasive depression assessment could help in detecting signs of the disease on a regular basis, without requiring the physical presence of a mental health professional. Despite the different approaches that can be found in the literature, both in terms of methods and algorithms, a fully satisfactory system for the automatic assessment of depression severity has not been presented as yet. This paper describes a propose…

Facial expressionComputer scienceProcess (engineering)business.industryFeature extractionFeature selectionMachine learningcomputer.software_genreMental healthCurveletArtificial intelligencebusinessHidden Markov modelcomputerDepression (differential diagnoses)2015 IEEE International Conference on Signal and Image Processing Applications (ICSIPA)
researchProduct