Search results for "GEP"

showing 10 items of 1017 documents

Learning With Context Feedback Loop for Robust Medical Image Segmentation

2021

Deep learning has successfully been leveraged for medical image segmentation. It employs convolutional neural networks (CNN) to learn distinctive image features from a defined pixel-wise objective function. However, this approach can lead to less output pixel interdependence producing incomplete and unrealistic segmentation results. In this paper, we present a fully automatic deep learning method for robust medical image segmentation by formulating the segmentation problem as a recurrent framework using two systems. The first one is a forward system of an encoder-decoder CNN that predicts the segmentation result from the input image. The predicted probabilistic output of the forward system …

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceComputer Vision and Pattern Recognition (cs.CV)Feature vectorComputer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONContext (language use)Convolutional neural networkMachine Learning (cs.LG)Feedback030218 nuclear medicine & medical imaging03 medical and health sciences0302 clinical medicineFOS: Electrical engineering electronic engineering information engineeringImage Processing Computer-Assisted[INFO.INFO-IM]Computer Science [cs]/Medical ImagingSegmentationElectrical and Electronic EngineeringComputingMilieux_MISCELLANEOUSRadiological and Ultrasound TechnologyPixelbusiness.industryDeep learningImage and Video Processing (eess.IV)Pattern recognitionImage segmentationElectrical Engineering and Systems Science - Image and Video ProcessingFeedback loopComputer Science ApplicationsFeature (computer vision)Neural Networks ComputerArtificial intelligencebusinessSoftware

researchProduct

Enforcing Perceptual Consistency on Generative Adversarial Networks by Using the Normalised Laplacian Pyramid Distance

2019

In recent years there has been a growing interest in image generation through deep learning. While an important part of the evaluation of the generated images usually involves visual inspection, the inclusion of human perception as a factor in the training process is often overlooked. In this paper we propose an alternative perceptual regulariser for image-to-image translation using conditional generative adversarial networks (cGANs). To do so automatically (avoiding visual inspection), we use the Normalised Laplacian Pyramid Distance (NLPD) to measure the perceptual similarity between the generated image and the original image. The NLPD is based on the principle of normalising the value of…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceImage qualitymedia_common.quotation_subjectComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONMachine Learning (stat.ML)Translation (geometry)Image (mathematics)Machine Learning (cs.LG)Consistency (database systems)Statistics - Machine LearningPerceptionFOS: Electrical engineering electronic engineering information engineeringmedia_commonbusiness.industryDeep learningImage and Video Processing (eess.IV)Contrast (statistics)Pattern recognitionGeneral MedicineImage segmentationElectrical Engineering and Systems Science - Image and Video ProcessingGenerative Adversarial NetworkPerceptionArtificial intelligencebusiness

researchProduct

Extracting Deformation-Aware Local Features by Learning to Deform

2021

Despite the advances in extracting local features achieved by handcrafted and learning-based descriptors, they are still limited by the lack of invariance to non-rigid transformations. In this paper, we present a new approach to compute features from still images that are robust to non-rigid deformations to circumvent the problem of matching deformable surfaces and objects. Our deformation-aware local descriptor, named DEAL, leverages a polar sampling and a spatial transformer warping to provide invariance to rotation, scale, and image deformations. We train the model architecture end-to-end by applying isometric non-rigid deformations to objects in a simulated environment as guidance to pr…

FOS: Computer and information sciencesComputer Science - Machine Learning[INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Computer Vision and Pattern Recognition (cs.CV)Computer Science::Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONComputer Science - Computer Vision and Pattern Recognition[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Machine Learning (cs.LG)ComputingMethodologies_COMPUTERGRAPHICS

researchProduct

Combination of Hidden Markov Random Field and Conjugate Gradient for Brain Image Segmentation

2017

Image segmentation is the process of partitioning the image into significant regions easier to analyze. Nowadays, segmentation has become a necessity in many practical medical imaging methods as locating tumors and diseases. Hidden Markov Random Field model is one of several techniques used in image segmentation. It provides an elegant way to model the segmentation process. This modeling leads to the minimization of an objective function. Conjugate Gradient algorithm (CG) is one of the best known optimization techniques. This paper proposes the use of the Conjugate Gradient algorithm (CG) for image segmentation, based on the Hidden Markov Random Field. Since derivatives are not available fo…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)Computer Science::Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONComputer Science - Computer Vision and Pattern Recognition

researchProduct

Visual Illusions Also Deceive Convolutional Neural Networks: Analysis and Implications

2019

Visual illusions allow researchers to devise and test new models of visual perception. Here we show that artificial neural networks trained for basic visual tasks in natural images are deceived by brightness and color illusions, having a response that is qualitatively very similar to the human achromatic and chromatic contrast sensitivity functions, and consistent with natural image statistics. We also show that, while these artificial networks are deceived by illusions, their response might be significantly different to that of humans. Our results suggest that low-level illusions appear in any system that has to perform basic visual tasks in natural environments, in line with error minimiz…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONComputer Science - Computer Vision and Pattern Recognition

researchProduct

Degraded Historical Documents Images Binarization Using a Combination of Enhanced Techniques

2019

Document image binarization is the initial step and a crucial in many document analysis and recognition scheme. In fact, it is still a relevant research subject and a fundamental challenge due to its importance and influence. This paper provides an original multi-phases system that hybridizes various efficient image thresholding methods in order to get the best binarization output. First, to improve contrast in particularly defective images, the application of CLAHE algorithm is suggested and justified. We then use a cooperative technique to segment image into two separated classes. At the end, a special transformation is applied for the purpose of removing scattered noise and of correcting…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONComputer Science - Computer Vision and Pattern Recognition

researchProduct

A Psychophysically Oriented Saliency Map Prediction Model

2020

Visual attention is one of the most significant characteristics for selecting and understanding the outside redundancy world. The human vision system cannot process all information simultaneously due to the visual information bottleneck. In order to reduce the redundant input of visual information, the human visual system mainly focuses on dominant parts of scenes. This is commonly known as visual saliency map prediction. This paper proposed a new psychophysical saliency prediction architecture, WECSF, inspired by multi-channel model of visual cortex functioning in humans. The model consists of opponent color channels, wavelet transform, wavelet energy map, and contrast sensitivity function…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONComputer Science - Computer Vision and Pattern Recognition

researchProduct

On the Reliability of the PNU for Source Camera Identification Tasks

2020

The PNU is an essential and reliable tool to perform SCI and, during the years, became a standard de-facto for this task in the forensic field. In this paper, we show that, although strategies exist that aim to cancel, modify, replace the PNU traces in a digital camera image, it is still possible, through our experimental method, to find residual traces of the noise produced by the sensor used to shoot the photo. Furthermore, we show that is possible to inject the PNU of a different camera in a target image and trace it back to the source camera, but only under the condition that the new camera is of the same model of the original one used to take the target image. Both cameras must fall wi…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)Image and Video Processing (eess.IV)FOS: Electrical engineering electronic engineering information engineeringComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONComputer Science - Computer Vision and Pattern RecognitionElectrical Engineering and Systems Science - Image and Video Processing

researchProduct

A Region-based Gauss-Newton Approach to Real-Time Monocular Multiple Object Tracking

2018

We propose an algorithm for real-time 6DOF pose tracking of rigid 3D objects using a monocular RGB camera. The key idea is to derive a region-based cost function using temporally consistent local color histograms. While such region-based cost functions are commonly optimized using first-order gradient descent techniques, we systematically derive a Gauss-Newton optimization scheme which gives rise to drastically faster convergence and highly accurate and robust tracking performance. We furthermore propose a novel complex dataset dedicated for the task of monocular object pose tracking and make it publicly available to the community. To our knowledge, it is the first to address the common and…

FOS: Computer and information sciencesComputer scienceComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION02 engineering and technologyArtificial IntelligenceHistogram0202 electrical engineering electronic engineering information engineeringComputer visionPoseMonocularbusiness.industryApplied MathematicsImage segmentationObject detectionComputational Theory and MathematicsVideo trackingComputer Science::Computer Vision and Pattern RecognitionRGB color model020201 artificial intelligence & image processingComputer Vision and Pattern RecognitionArtificial intelligencebusinessGradient descentSoftware

researchProduct

Deep Generative Model-Driven Multimodal Prostate Segmentation in Radiotherapy

2019

Deep learning has shown unprecedented success in a variety of applications, such as computer vision and medical image analysis. However, there is still potential to improve segmentation in multimodal images by embedding prior knowledge via learning-based shape modeling and registration to learn the modality invariant anatomical structure of organs. For example, in radiotherapy automatic prostate segmentation is essential in prostate cancer diagnosis, therapy, and post-therapy assessment from T2-weighted MR or CT images. In this paper, we present a fully automatic deep generative model-driven multimodal prostate segmentation method using convolutional neural network (DGMNet). The novelty of …

FOS: Computer and information sciencesComputer scienceComputer Vision and Pattern Recognition (cs.CV)medicine.medical_treatmentProstate segmentationFeature extractionComputer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONConvolutional neural network[SDV.IB.MN]Life Sciences [q-bio]/Bioengineering/Nuclear medicineConvolutional neural network030218 nuclear medicine & medical imaging03 medical and health sciencesProstate cancer0302 clinical medicineFOS: Electrical engineering electronic engineering information engineeringmedicineSegmentationArtificial neural networkbusiness.industryDeep learningImage and Video Processing (eess.IV)NoveltyDeep learningPattern recognitionElectrical Engineering and Systems Science - Image and Video Processingmedicine.diseaseTransfer learning3. Good healthRadiation therapyGenerative model030220 oncology & carcinogenesisEmbeddingArtificial intelligencebusinessCTMRI

researchProduct