Search results for "VISION"

showing 10 items of 5066 documents

Incorporating depth information into few-shot semantic segmentation

2021

International audience; Few-shot segmentation presents a significant challengefor semantic scene understanding under limited supervision.Namely, this task targets at generalizing the segmentationability of the model to new categories given a few samples.In order to obtain complete scene information, we extend theRGB-centric methods to take advantage of complementary depthinformation. In this paper, we propose a two-stream deep neuralnetwork based on metric learning. Our method, known as RDNet,learns class-specific prototype representations within RGB anddepth embedding spaces, respectively. The learned prototypesprovide effective semantic guidance on the corresponding RGBand depth query ima…

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]Artificial neural networkComputer sciencebusiness.industry[INFO.INFO-TS] Computer Science [cs]/Signal and Image ProcessingComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]020206 networking & telecommunications02 engineering and technologyImage segmentationSemanticsVisualization[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI][INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][INFO.INFO-TS]Computer Science [cs]/Signal and Image ProcessingMetric (mathematics)0202 electrical engineering electronic engineering information engineeringEmbeddingRGB color modelSegmentationComputer visionArtificial intelligencebusiness
researchProduct

Leveraging Uncertainty Estimates to Improve Segmentation Performance in Cardiac MR

2021

International audience; In medical image segmentation, several studies have used Bayesian neural networks to segment and quantify the uncertainty of the images. These studies show that there might be an increased epistemic uncertainty in areas where there are semantically and visually challenging pixels. The uncertain areas of the image can be of a great interest as they can possibly indicate the regions of incorrect segmentation. To leverage the uncertainty information, we propose a segmentation model that incorporates the uncertainty into its learning process. Firstly, we generate the uncertainty estimate (sample variance) using Monte-Carlo dropout during training. Then we incorporate it …

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]Bayesian deep learningCardiac MRI Segmentation[INFO.INFO-IM] Computer Science [cs]/Medical ImagingComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONUncertainty[INFO.INFO-IM]Computer Science [cs]/Medical ImagingMyocardial scar[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
researchProduct

Sub-optimal waypoints, UAV path planning and mosaicing application

2016

International audience; Create a complete system of video surveillance using camera mounted on a robot like UAV to maintain optimized vast area coverage and reconstruct an image by using mosaicing techniques. This paper demonstrated the efficiency of using one UAV to cover vast area using optimized positions.

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]Cover (telecommunications)Computer scienceComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION010103 numerical & computational mathematics01 natural sciencesUnmanned aerial vehicles[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI][INFO.INFO-RB]Computer Science [cs]/Robotics [cs.RO]Computer visionMotion planning0101 mathematics[ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI]Genetic Algorithmbusiness.industry[ INFO.INFO-RB ] Computer Science [cs]/Robotics [cs.RO][INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO][SPI.TRON] Engineering Sciences [physics]/Electronics[ SPI.TRON ] Engineering Sciences [physics]/Electronics[SPI.TRON]Engineering Sciences [physics]/Electronics010101 applied mathematicsCoverage path planningArea coverageRobotArtificial intelligencebusiness
researchProduct

Application of LSTM architectures for next frame forecasting in Sentinel-1 images time series

2020

L'analyse prédictive permet d'estimer les tendances des évènements futurs. De nos jours, les algorithmes Deep Learning permettent de faire de bonnes prédictions. Cependant, pour chaque type de problème donné, il est nécessaire de choisir l'architecture optimale. Dans cet article, les modèles Stack-LSTM, CNN-LSTM et ConvLSTM sont appliqués à une série temporelle d'images radar sentinel-1, le but étant de prédire la prochaine occurrence dans une séquence. Les résultats expérimentaux évalués à l'aide des indicateurs de performance tels que le RMSE et le MAE, le temps de traitement et l'index de similarité SSIM, montrent que chacune des trois architectures peut produire de bons résultats en fon…

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]FOS: Computer and information sciencesApprentissage profondComputer Science - Machine LearningImage and Video Processing (eess.IV)[INFO.INFO-NE] Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]PrévisionComputer Science - Neural and Evolutionary ComputingDeep Learning AlgorithmsPrédiction[INFO.INFO-NE]Computer Science [cs]/Neural and Evolutionary Computing [cs.NE]Electrical Engineering and Systems Science - Image and Video ProcessingLand cover change[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]Machine Learning (cs.LG)SARIMA[INFO.INFO-TI] Computer Science [cs]/Image Processing [eess.IV][INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]FOS: Electrical engineering electronic engineering information engineeringSatellite imagesNeural and Evolutionary Computing (cs.NE)LSTMPredictionForecastingImages satellitaires
researchProduct

hidden markov random fields and cuckoo search method for medical image segmentation

2020

Segmentation of medical images is an essential part in the process of diagnostics. Physicians require an automatic, robust and valid results. Hidden Markov Random Fields (HMRF) provide powerful model. This latter models the segmentation problem as the minimization of an energy function. Cuckoo search (CS) algorithm is one of the recent nature-inspired meta-heuristic algorithms. It has shown its efficiency in many engineering optimization problems. In this paper, we use three cuckoo search algorithm to achieve medical image segmentation.

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]FOS: Computer and information sciencesComputer Science - Machine LearningComputer Vision and Pattern Recognition (cs.CV)Image and Video Processing (eess.IV)FOS: Electrical engineering electronic engineering information engineeringComputer Science - Computer Vision and Pattern RecognitionElectrical Engineering and Systems Science - Image and Video Processing[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]Machine Learning (cs.LG)
researchProduct

Unsupervised learning of category-specific symmetric 3D keypoints from point sets

2020

Lecture Notes in Computer Science, 12370

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]FOS: Computer and information sciencesComputer sciencePlane symmetryComputer Vision and Pattern Recognition (cs.CV)Point cloudComputer Science - Computer Vision and Pattern Recognition02 engineering and technology010501 environmental sciences01 natural sciences[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI][INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Linear basis0202 electrical engineering electronic engineering information engineeringComputingMilieux_COMPUTERSANDEDUCATIONPoint (geometry)0105 earth and related environmental sciencesbusiness.industryCategory specific[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Pattern recognition16. Peace & justiceBenchmark (computing)Unsupervised learning020201 artificial intelligence & image processingArtificial intelligenceSymmetry (geometry)business
researchProduct

3D landmark detection for augmented reality based otologic procedures

2019

International audience; Ear consists of the smallest bones in the human body and does not contain significant amount of distinct landmark points that may be used to register a preoperative CT-scan with the surgical video in an augmented reality framework. Learning based algorithms may be used to help the surgeons to identify landmark points. This paper presents a convolutional neural network approach to landmark detection in preoperative ear CT images and then discusses an augmented reality system that can be used to visualize the cochlear axis on an otologic surgical video.

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]FOS: Computer and information sciences[INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Computer Vision and Pattern Recognition (cs.CV)Image and Video Processing (eess.IV)Computer Science - Computer Vision and Pattern Recognition[INFO.INFO-IM] Computer Science [cs]/Medical ImagingFOS: Electrical engineering electronic engineering information engineering[INFO.INFO-IM]Computer Science [cs]/Medical Imaging[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Electrical Engineering and Systems Science - Image and Video Processing[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
researchProduct

Improving Video Object Detection by Seq-Bbox Matching

2019

International audience

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]Matching (statistics)business.industryComputer science02 engineering and technology010501 environmental sciences01 natural sciencesObject detection[INFO.INFO-ES] Computer Science [cs]/Embedded Systems[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingComputer vision[INFO.INFO-ES]Computer Science [cs]/Embedded SystemsArtificial intelligencebusinessComputingMilieux_MISCELLANEOUS0105 earth and related environmental sciences
researchProduct

Enhancement and assessment of WKS variance parameter for intelligent 3D shape recognition and matching based on MPSO

2016

This paper presents an improved wave kernel signature (WKS) using the modified particle swarm optimization (MPSO)-based intelligent recognition and matching on 3D shapes. We select the first feature vector from WKS, which represents the 3D shape over the first energy scale. The choice of this vector is to reinforce robustness against non-rigid 3D shapes. Furthermore, an optimized WKS-based method for extracting key-points from objects is introduced. Due to its discriminative power, the associated optimized WKS values with each point remain extremely stable, which allows for efficient salient features extraction. To assert our method regarding its robustness against topological deformations,…

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI][ INFO ] Computer Science [cs]Matching (graph theory)Feature vectorComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION02 engineering and technology[INFO] Computer Science [cs][ INFO.INFO-CV ] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]Kernel (linear algebra)[INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Discriminative modelRobustness (computer science)0202 electrical engineering electronic engineering information engineeringFeature (machine learning)[INFO]Computer Science [cs][ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI]ComputingMilieux_MISCELLANEOUSMathematicsbusiness.industryParticle swarm optimization[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]020207 software engineeringPattern recognition020201 artificial intelligence & image processingArtificial intelligencebusinessEnergy (signal processing)
researchProduct

Repérage précis de caméras multispectrales et de scanners 3D pour le recalage de données multicapteurs appliqué à l'étude du patrimoine

2012

Session "Atelier V3DPAT"; National audience; Nos travaux portent sur le recalage de données multi-capteurs et spécifiquement sur la projection de textures 2D sur des modèles 3D d'objet du patrimoine en pierre. Nous nous intéressons particulièrement aux textures acquises par imagerie multispectrale mais notre technique est également adaptée à d'autres systèmes optiques d'acquisition tels que l'imagerie thermique. Les modèles 3D, eux, sont acquis par un système de projection de franges. La difficulté du recalage multicapteur vient principalement de la variation de la représentation de l'objet. Ainsi, les points saillants d'un jeu de données ne correspondent pas forcément à ceux d'une autre re…

[INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI][INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI][ INFO.INFO-CV ] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
researchProduct