Search results for " Computer vision"

showing 10 items of 352 documents

Graph Embedding via High Dimensional Model Representation for Hyperspectral Images

2021

Learning the manifold structure of remote sensing images is of paramount relevance for modeling and understanding processes, as well as to encapsulate the high dimensionality in a reduced set of informative features for subsequent classification, regression, or unmixing. Manifold learning methods have shown excellent performance to deal with hyperspectral image (HSI) analysis but, unless specifically designed, they cannot provide an explicit embedding map readily applicable to out-of-sample data. A common assumption to deal with the problem is that the transformation between the high-dimensional input space and the (typically low) latent space is linear. This is a particularly strong assump…

FOS: Computer and information sciencesComputer Science - Machine LearningI.5.2Computer Vision and Pattern Recognition (cs.CV)G.1.6I.5.4Image and Video Processing (eess.IV)0211 other engineering and technologiesComputer Science - Computer Vision and Pattern RecognitionI.4.702 engineering and technologyElectrical Engineering and Systems Science - Image and Video ProcessingI.4.10; I.5.2; G.1.6; I.4.7; I.5.4I.4.10Machine Learning (cs.LG)FOS: Electrical engineering electronic engineering information engineeringGeneral Earth and Planetary SciencesElectrical and Electronic Engineering021101 geological & geomatics engineering

researchProduct

Brima: Low-Overhead Browser-Only Image Annotation Tool (Preprint)

2021

Image annotation and large annotated datasets are crucial parts within the Computer Vision and Artificial Intelligence this http URL the same time, it is well-known and acknowledged by the research community that the image annotation process is challenging, time-consuming and hard to scale. Therefore, the researchers and practitioners are always seeking ways to perform the annotations easier, faster, and at higher quality. Even though several widely used tools exist and the tools' landscape evolved considerably, most of the tools still require intricate technical setups and high levels of technical savviness from its operators and crowdsource contributors. In order to address such challenge…

FOS: Computer and information sciencesComputer Science - Machine LearningLow overheadProcess (engineering)Computer scienceComputer Vision and Pattern Recognition (cs.CV)Scale (chemistry)media_common.quotation_subjectComputer Science - Computer Vision and Pattern RecognitionMachine Learning (cs.LG)World Wide WebCrowdsourceAutomatic image annotationResearch communityQuality (business)Preprintmedia_common2021 IEEE International Conference on Image Processing (ICIP)

researchProduct

Unsupervised Anomaly and Change Detection With Multivariate Gaussianization

2022

Anomaly detection (AD) is a field of intense research in remote sensing (RS) image processing. Identifying low probability events in RS images is a challenging problem given the high dimensionality of the data, especially when no (or little) information about the anomaly is available a priori. While a plenty of methods are available, the vast majority of them do not scale well to large datasets and require the choice of some (very often critical) hyperparameters. Therefore, unsupervised and computationally efficient detection methods become strictly necessary, especially now with the data deluge problem. In this article, we propose an unsupervised method for detecting anomalies and changes …

FOS: Computer and information sciencesComputer Science - Machine LearningMultivariate statisticsComputer sciencebusiness.industryComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionFOS: Physical sciencesImage processingPattern recognitionMultivariate normal distributionComputational Physics (physics.comp-ph)Machine Learning (cs.LG)Methodology (stat.ME)Transformation (function)Robustness (computer science)General Earth and Planetary SciencesAnomaly detectionArtificial intelligenceElectrical and Electronic EngineeringbusinessPhysics - Computational PhysicsStatistics - MethodologyChange detectionCurse of dimensionalityIEEE Transactions on Geoscience and Remote Sensing

researchProduct

Extracting Deformation-Aware Local Features by Learning to Deform

2021

Despite the advances in extracting local features achieved by handcrafted and learning-based descriptors, they are still limited by the lack of invariance to non-rigid transformations. In this paper, we present a new approach to compute features from still images that are robust to non-rigid deformations to circumvent the problem of matching deformable surfaces and objects. Our deformation-aware local descriptor, named DEAL, leverages a polar sampling and a spatial transformer warping to provide invariance to rotation, scale, and image deformations. We train the model architecture end-to-end by applying isometric non-rigid deformations to objects in a simulated environment as guidance to pr…

FOS: Computer and information sciencesComputer Science - Machine Learning[INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Computer Vision and Pattern Recognition (cs.CV)Computer Science::Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONComputer Science - Computer Vision and Pattern Recognition[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Machine Learning (cs.LG)ComputingMethodologies_COMPUTERGRAPHICS

researchProduct

Deep Non-Line-of-Sight Reconstruction

2020

The recent years have seen a surge of interest in methods for imaging beyond the direct line of sight. The most prominent techniques rely on time-resolved optical impulse responses, obtained by illuminating a diffuse wall with an ultrashort light pulse and observing multi-bounce indirect reflections with an ultrafast time-resolved imager. Reconstruction of geometry from such data, however, is a complex non-linear inverse problem that comes with substantial computational demands. In this paper, we employ convolutional feed-forward networks for solving the reconstruction problem efficiently while maintaining good reconstruction quality. Specifically, we devise a tailored autoencoder architect…

FOS: Computer and information sciencesComputer Science - Machine Learningbusiness.industryComputer scienceComputer Vision and Pattern Recognition (cs.CV)Image and Video Processing (eess.IV)Computer Science - Computer Vision and Pattern RecognitionNonlinear optics020207 software engineering02 engineering and technologyIterative reconstructionInverse problemElectrical Engineering and Systems Science - Image and Video ProcessingAutoencoderRendering (computer graphics)Machine Learning (cs.LG)Non-line-of-sight propagation0202 electrical engineering electronic engineering information engineeringFOS: Electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingComputer visionArtificial intelligencebusiness

researchProduct

Ensemble of Hankel Matrices for Face Emotion Recognition

2015

In this paper, a face emotion is considered as the result of the composition of multiple concurrent signals, each corresponding to the movements of a specific facial muscle. These concurrent signals are represented by means of a set of multi-scale appearance features that might be correlated with one or more concurrent signals. The extraction of these appearance features from a sequence of face images yields to a set of time series. This paper proposes to use the dynamics regulating each appearance feature time series to recognize among different face emotions. To this purpose, an ensemble of Hankel matrices corresponding to the extracted time series is used for emotion classification withi…

FOS: Computer and information sciencesComputer Science - RoboticsComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionComputer Science - Human-Computer InteractionRobotics (cs.RO)Human-Computer Interaction (cs.HC)

researchProduct

Multi-Grid Redundant Bounding Box Annotation for Accurate Object Detection

2021

Modern leading object detectors are either two-stage or one-stage networks repurposed from a deep CNN-based backbone classifier network. YOLOv3 is one such very-well known state-of-the-art one-shot detector that takes in an input image and divides it into an equal-sized grid matrix. The grid cell having the center of an object is the one responsible for detecting the particular object. This paper presents a new mathematical approach that assigns multiple grids per object for accurately tight-fit bounding box prediction. We also propose an effective offline copy-paste data augmentation for object detection. Our proposed method significantly outperforms some current state-of-the-art object de…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognition

researchProduct

Eigen-Distortions of Hierarchical Representations

2017

We develop a method for comparing hierarchical image representations in terms of their ability to explain perceptual sensitivity in humans. Specifically, we utilize Fisher information to establish a model-derived prediction of sensitivity to local perturbations of an image. For a given image, we compute the eigenvectors of the Fisher information matrix with largest and smallest eigenvalues, corresponding to the model-predicted most- and least-noticeable image distortions, respectively. For human subjects, we then measure the amount of each distortion that can be reliably detected when added to the image. We use this method to test the ability of a variety of representations to mimic human p…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognition

researchProduct

MOISST: Multimodal Optimization of Implicit Scene for SpatioTemporal calibration

2023

With the recent advances in autonomous driving and the decreasing cost of LiDARs, the use of multimodal sensor systems is on the rise. However, in order to make use of the information provided by a variety of complimentary sensors, it is necessary to accurately calibrate them. We take advantage of recent advances in computer graphics and implicit volumetric scene representation to tackle the problem of multi-sensor spatial and temporal calibration. Thanks to a new formulation of the Neural Radiance Field (NeRF) optimization, we are able to jointly optimize calibration parameters along with scene representation based on radiometric and geometric measurements. Our method enables accurate and …

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognition

researchProduct

Surgical Visual Domain Adaptation: Results from the MICCAI 2020 SurgVisDom Challenge

2021

Surgical data science is revolutionizing minimally invasive surgery by enabling context-aware applications. However, many challenges exist around surgical data (and health data, more generally) needed to develop context-aware models. This work - presented as part of the Endoscopic Vision (EndoVis) challenge at the Medical Image Computing and Computer Assisted Intervention (MICCAI) 2020 conference - seeks to explore the potential for visual domain adaptation in surgery to overcome data privacy concerns. In particular, we propose to use video from virtual reality (VR) simulations of surgical exercises in robotic-assisted surgery to develop algorithms to recognize tasks in a clinical-like sett…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognition

researchProduct