Search results for "VISION"
showing 10 items of 5066 documents
Incorporating depth information into few-shot semantic segmentation
2021
International audience; Few-shot segmentation presents a significant challengefor semantic scene understanding under limited supervision.Namely, this task targets at generalizing the segmentationability of the model to new categories given a few samples.In order to obtain complete scene information, we extend theRGB-centric methods to take advantage of complementary depthinformation. In this paper, we propose a two-stream deep neuralnetwork based on metric learning. Our method, known as RDNet,learns class-specific prototype representations within RGB anddepth embedding spaces, respectively. The learned prototypesprovide effective semantic guidance on the corresponding RGBand depth query ima…
Leveraging Uncertainty Estimates to Improve Segmentation Performance in Cardiac MR
2021
International audience; In medical image segmentation, several studies have used Bayesian neural networks to segment and quantify the uncertainty of the images. These studies show that there might be an increased epistemic uncertainty in areas where there are semantically and visually challenging pixels. The uncertain areas of the image can be of a great interest as they can possibly indicate the regions of incorrect segmentation. To leverage the uncertainty information, we propose a segmentation model that incorporates the uncertainty into its learning process. Firstly, we generate the uncertainty estimate (sample variance) using Monte-Carlo dropout during training. Then we incorporate it …
Sub-optimal waypoints, UAV path planning and mosaicing application
2016
International audience; Create a complete system of video surveillance using camera mounted on a robot like UAV to maintain optimized vast area coverage and reconstruct an image by using mosaicing techniques. This paper demonstrated the efficiency of using one UAV to cover vast area using optimized positions.
Application of LSTM architectures for next frame forecasting in Sentinel-1 images time series
2020
L'analyse prédictive permet d'estimer les tendances des évènements futurs. De nos jours, les algorithmes Deep Learning permettent de faire de bonnes prédictions. Cependant, pour chaque type de problème donné, il est nécessaire de choisir l'architecture optimale. Dans cet article, les modèles Stack-LSTM, CNN-LSTM et ConvLSTM sont appliqués à une série temporelle d'images radar sentinel-1, le but étant de prédire la prochaine occurrence dans une séquence. Les résultats expérimentaux évalués à l'aide des indicateurs de performance tels que le RMSE et le MAE, le temps de traitement et l'index de similarité SSIM, montrent que chacune des trois architectures peut produire de bons résultats en fon…
hidden markov random fields and cuckoo search method for medical image segmentation
2020
Segmentation of medical images is an essential part in the process of diagnostics. Physicians require an automatic, robust and valid results. Hidden Markov Random Fields (HMRF) provide powerful model. This latter models the segmentation problem as the minimization of an energy function. Cuckoo search (CS) algorithm is one of the recent nature-inspired meta-heuristic algorithms. It has shown its efficiency in many engineering optimization problems. In this paper, we use three cuckoo search algorithm to achieve medical image segmentation.
Unsupervised learning of category-specific symmetric 3D keypoints from point sets
2020
Lecture Notes in Computer Science, 12370
3D landmark detection for augmented reality based otologic procedures
2019
International audience; Ear consists of the smallest bones in the human body and does not contain significant amount of distinct landmark points that may be used to register a preoperative CT-scan with the surgical video in an augmented reality framework. Learning based algorithms may be used to help the surgeons to identify landmark points. This paper presents a convolutional neural network approach to landmark detection in preoperative ear CT images and then discusses an augmented reality system that can be used to visualize the cochlear axis on an otologic surgical video.
Improving Video Object Detection by Seq-Bbox Matching
2019
International audience
Enhancement and assessment of WKS variance parameter for intelligent 3D shape recognition and matching based on MPSO
2016
This paper presents an improved wave kernel signature (WKS) using the modified particle swarm optimization (MPSO)-based intelligent recognition and matching on 3D shapes. We select the first feature vector from WKS, which represents the 3D shape over the first energy scale. The choice of this vector is to reinforce robustness against non-rigid 3D shapes. Furthermore, an optimized WKS-based method for extracting key-points from objects is introduced. Due to its discriminative power, the associated optimized WKS values with each point remain extremely stable, which allows for efficient salient features extraction. To assert our method regarding its robustness against topological deformations,…
Repérage précis de caméras multispectrales et de scanners 3D pour le recalage de données multicapteurs appliqué à l'étude du patrimoine
2012
Session "Atelier V3DPAT"; National audience; Nos travaux portent sur le recalage de données multi-capteurs et spécifiquement sur la projection de textures 2D sur des modèles 3D d'objet du patrimoine en pierre. Nous nous intéressons particulièrement aux textures acquises par imagerie multispectrale mais notre technique est également adaptée à d'autres systèmes optiques d'acquisition tels que l'imagerie thermique. Les modèles 3D, eux, sont acquis par un système de projection de franges. La difficulté du recalage multicapteur vient principalement de la variation de la représentation de l'objet. Ainsi, les points saillants d'un jeu de données ne correspondent pas forcément à ceux d'une autre re…