Search results for " Computer Vision"

showing 10 items of 352 documents

Disentangling the Link Between Image Statistics and Human Perception

2023

In the 1950s Horace Barlow and Fred Attneave suggested a connection between sensory systems and how they are adapted to the environment: early vision evolved to maximise the information it conveys about incoming signals. Following Shannon's definition, this information was described using the probability of the images taken from natural scenes. Previously, direct accurate predictions of image probabilities were not possible due to computational limitations. Despite the exploration of this idea being indirect, mainly based on oversimplified models of the image density or on system design methods, these methods had success in reproducing a wide range of physiological and psychophysical phenom…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer Vision and Pattern Recognition (cs.CV)FOS: Biological sciencesQuantitative Biology - Neurons and CognitionComputer Science - Computer Vision and Pattern RecognitionNeurons and Cognition (q-bio.NC)ArticleMachine Learning (cs.LG)

researchProduct

Local-Area-Learning Network: Meaningful Local Areas for Efficient Point Cloud Analysis

2020

Research in point cloud analysis with deep neural networks has made rapid progress in recent years. The pioneering work PointNet offered a direct analysis of point clouds. However, due to its architecture PointNet is not able to capture local structures. To overcome this drawback, the same authors have developed PointNet++ by applying PointNet to local areas. The local areas are defined by center points and their neighbors. In PointNet++ and its further developments the center points are determined with a Farthest Point Sampling (FPS) algorithm. This has the disadvantage that the center points in general do not have meaningful local areas. In this paper, we introduce the neural Local-Area-L…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer Vision and Pattern Recognition (cs.CV)Image and Video Processing (eess.IV)Computer Science - Computer Vision and Pattern RecognitionFOS: Electrical engineering electronic engineering information engineeringElectrical Engineering and Systems Science - Image and Video ProcessingMachine Learning (cs.LG)

researchProduct

Requirement analysis for an artificial intelligence model for the diagnosis of the COVID-19 from chest X-ray data

2021

There are multiple papers published about different AI models for the COVID-19 diagnosis with promising results. Unfortunately according to the reviews many of the papers do not reach the level of sophistication needed for a clinically usable model. In this paper I go through multiple review papers, guidelines, and other relevant material in order to generate more comprehensive requirements for the future papers proposing a AI based diagnosis of the COVID-19 from chest X-ray data (CXR). Main findings are that a clinically usable AI needs to have an extremely good documentation, comprehensive statistical analysis of the possible biases and performance, and an explainability module.

FOS: Computer and information sciencesComputer Science - Machine LearningComputer Vision and Pattern Recognition (cs.CV)tilastomenetelmätImage and Video Processing (eess.IV)Computer Science - Computer Vision and Pattern RecognitionCOVID-19ennusteetlääketiedetekoälydiagnostiikkaElectrical Engineering and Systems Science - Image and Video Processingartificial intelligenceMachine Learning (cs.LG)data modelsclinical diagnosisstatistical analysisFOS: Electrical engineering electronic engineering information engineeringtilastolliset mallittietomallittietojärjestelmät2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

researchProduct

Road scenes analysis in adverse weather conditions by polarization-encoded images and adapted deep learning

2019

International audience; Object detection in road scenes is necessary to develop both autonomous vehicles and driving assistance systems. Even if deep neural networks for recognition task have shown great performances using conventional images, they fail to detect objects in road scenes in complex acquisition situations. In contrast, polarization images, characterizing the light wave, can robustly describe important physical properties of the object even under poor illumination or strong reflections. This paper shows how non-conventional polarimetric imaging modality overcomes the classical methods for object detection especially in adverse weather conditions. The efficiency of the proposed …

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONMachine Learning (stat.ML)02 engineering and technology010501 environmental sciences01 natural sciencesMachine Learning (cs.LG)[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI][SPI.GCIV.IT]Engineering Sciences [physics]/Civil Engineering/Infrastructures de transportStatistics - Machine Learning0202 electrical engineering electronic engineering information engineeringComputer vision0105 earth and related environmental sciencesAdverse weatherbusiness.industryDeep learningPolarization (waves)Object detectionRGB color model020201 artificial intelligence & image processingArtificial intelligencebusiness

researchProduct

Learning With Context Feedback Loop for Robust Medical Image Segmentation

2021

Deep learning has successfully been leveraged for medical image segmentation. It employs convolutional neural networks (CNN) to learn distinctive image features from a defined pixel-wise objective function. However, this approach can lead to less output pixel interdependence producing incomplete and unrealistic segmentation results. In this paper, we present a fully automatic deep learning method for robust medical image segmentation by formulating the segmentation problem as a recurrent framework using two systems. The first one is a forward system of an encoder-decoder CNN that predicts the segmentation result from the input image. The predicted probabilistic output of the forward system …

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceComputer Vision and Pattern Recognition (cs.CV)Feature vectorComputer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONContext (language use)Convolutional neural networkMachine Learning (cs.LG)Feedback030218 nuclear medicine & medical imaging03 medical and health sciences0302 clinical medicineFOS: Electrical engineering electronic engineering information engineeringImage Processing Computer-Assisted[INFO.INFO-IM]Computer Science [cs]/Medical ImagingSegmentationElectrical and Electronic EngineeringComputingMilieux_MISCELLANEOUSRadiological and Ultrasound TechnologyPixelbusiness.industryDeep learningImage and Video Processing (eess.IV)Pattern recognitionImage segmentationElectrical Engineering and Systems Science - Image and Video ProcessingFeedback loopComputer Science ApplicationsFeature (computer vision)Neural Networks ComputerArtificial intelligencebusinessSoftware

researchProduct

Nonlinear Cook distance for Anomalous Change Detection

2020

In this work we propose a method to find anomalous changes in remote sensing images based on the chronochrome approach. A regressor between images is used to discover the most {\em influential points} in the observed data. Typically, the pixels with largest residuals are decided to be anomalous changes. In order to find the anomalous pixels we consider the Cook distance and propose its nonlinear extension using random Fourier features as an efficient nonlinear measure of impact. Good empirical performance is shown over different multispectral images both visually and quantitatively evaluated with ROC curves.

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceComputer Vision and Pattern Recognition (cs.CV)Multispectral imageComputer Science - Computer Vision and Pattern Recognition0211 other engineering and technologies02 engineering and technologyMeasure (mathematics)Machine Learning (cs.LG)Kernel (linear algebra)symbols.namesake0502 economics and businessCook's distance021101 geological & geomatics engineering050208 financePixelbusiness.industry05 social sciencesPattern recognitionNonlinear systemFourier transformKernel (image processing)Computer Science::Computer Vision and Pattern RecognitionsymbolsArtificial intelligencebusinessChange detection

researchProduct

Enforcing Perceptual Consistency on Generative Adversarial Networks by Using the Normalised Laplacian Pyramid Distance

2019

In recent years there has been a growing interest in image generation through deep learning. While an important part of the evaluation of the generated images usually involves visual inspection, the inclusion of human perception as a factor in the training process is often overlooked. In this paper we propose an alternative perceptual regulariser for image-to-image translation using conditional generative adversarial networks (cGANs). To do so automatically (avoiding visual inspection), we use the Normalised Laplacian Pyramid Distance (NLPD) to measure the perceptual similarity between the generated image and the original image. The NLPD is based on the principle of normalising the value of…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceImage qualitymedia_common.quotation_subjectComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONMachine Learning (stat.ML)Translation (geometry)Image (mathematics)Machine Learning (cs.LG)Consistency (database systems)Statistics - Machine LearningPerceptionFOS: Electrical engineering electronic engineering information engineeringmedia_commonbusiness.industryDeep learningImage and Video Processing (eess.IV)Contrast (statistics)Pattern recognitionGeneral MedicineImage segmentationElectrical Engineering and Systems Science - Image and Video ProcessingGenerative Adversarial NetworkPerceptionArtificial intelligencebusiness

researchProduct

Improving prostate whole gland segmentation in t2-weighted MRI with synthetically generated data

2021

Whole gland (WG) segmentation of the prostate plays a crucial role in detection, staging and treatment planning of prostate cancer (PCa). Despite promise shown by deep learning (DL) methods, they rely on the availability of a considerable amount of annotated data. Augmentation techniques such as translation and rotation of images present an alternative to increase data availability. Nevertheless, the amount of information provided by the transformed data is limited due to the correlation between the generated data and the original. Based on the recent success of generative adversarial networks (GAN) in producing synthetic images for other domains as well as in the medical domain, we present…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer sciencePipeline (computing)Computer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognition02 engineering and technology030218 nuclear medicine & medical imagingMachine Learning (cs.LG)03 medical and health sciencesProstate cancer0302 clinical medicineProstate020204 information systems0202 electrical engineering electronic engineering information engineeringmedicineFOS: Electrical engineering electronic engineering information engineeringSegmentationbusiness.industryDeep learningImage and Video Processing (eess.IV)Pattern recognitionImage segmentationElectrical Engineering and Systems Science - Image and Video Processingmedicine.diseaseData availabilitymedicine.anatomical_structureArtificial intelligencebusinessT2 weighted

researchProduct

Deep Learning Based Cardiac MRI Segmentation: Do We Need Experts?

2021

Deep learning methods are the de facto solutions to a multitude of medical image analysis tasks. Cardiac MRI segmentation is one such application, which, like many others, requires a large number of annotated data so that a trained network can generalize well. Unfortunately, the process of having a large number of manually curated images by medical experts is both slow and utterly expensive. In this paper, we set out to explore whether expert knowledge is a strict requirement for the creation of annotated data sets on which machine learning can successfully be trained. To do so, we gauged the performance of three segmentation models, namely U-Net, Attention U-Net, and ENet, trained with dif…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceProcess (engineering)GeneralizationIndustrial engineering. Management engineeringComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognitionheartannotated data setT55.4-60.8Machine learningcomputer.software_genre030218 nuclear medicine & medical imagingTheoretical Computer ScienceMachine Learning (cs.LG)Set (abstract data type)03 medical and health sciences0302 clinical medicineFOS: Electrical engineering electronic engineering information engineeringSegmentationNumerical AnalysisArtificial neural networkbusiness.industryDeep learningsegmentationImage and Video Processing (eess.IV)deep learningQA75.5-76.95Electrical Engineering and Systems Science - Image and Video ProcessingComputational MathematicsHausdorff distanceComputational Theory and MathematicsIndex (publishing)Electronic computers. Computer scienceArtificial intelligencebusinesscomputer030217 neurology & neurosurgeryMRI

researchProduct

Warped Gaussian Processes in Remote Sensing Parameter Estimation and Causal Inference

2018

This letter introduces warped Gaussian process (WGP) regression in remote sensing applications. WGP models output observations as a parametric nonlinear transformation of a GP. The parameters of such a prior model are then learned via standard maximum likelihood. We show the good performance of the proposed model for the estimation of oceanic chlorophyll content from multispectral data, vegetation parameters (chlorophyll, leaf area index, and fractional vegetation cover) from hyperspectral data, and in the detection of the causal direction in a collection of 28 bivariate geoscience and remote sensing causal problems. The model consistently performs better than the standard GP and the more a…

FOS: Computer and information sciencesComputer Science - Machine LearningHeteroscedasticityRemote sensing applicationComputer scienceComputer Vision and Pattern Recognition (cs.CV)Maximum likelihoodComputer Science - Computer Vision and Pattern Recognition0211 other engineering and technologies02 engineering and technologyBivariate analysis010501 environmental sciences01 natural sciencesMachine Learning (cs.LG)Data modelingsymbols.namesakeElectrical and Electronic EngineeringGaussian process021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensingParametric statisticsEstimation theoryHyperspectral imagingGeotechnical Engineering and Engineering GeologyConfidence intervalCausal inferencesymbolsIEEE Geoscience and Remote Sensing Letters

researchProduct