Search results for "video processing"

showing 10 items of 56 documents

Requirement analysis for an artificial intelligence model for the diagnosis of the COVID-19 from chest X-ray data

2021

There are multiple papers published about different AI models for the COVID-19 diagnosis with promising results. Unfortunately according to the reviews many of the papers do not reach the level of sophistication needed for a clinically usable model. In this paper I go through multiple review papers, guidelines, and other relevant material in order to generate more comprehensive requirements for the future papers proposing a AI based diagnosis of the COVID-19 from chest X-ray data (CXR). Main findings are that a clinically usable AI needs to have an extremely good documentation, comprehensive statistical analysis of the possible biases and performance, and an explainability module.

FOS: Computer and information sciencesComputer Science - Machine LearningComputer Vision and Pattern Recognition (cs.CV)tilastomenetelmätImage and Video Processing (eess.IV)Computer Science - Computer Vision and Pattern RecognitionCOVID-19ennusteetlääketiedetekoälydiagnostiikkaElectrical Engineering and Systems Science - Image and Video Processingartificial intelligenceMachine Learning (cs.LG)data modelsclinical diagnosisstatistical analysisFOS: Electrical engineering electronic engineering information engineeringtilastolliset mallittietomallittietojärjestelmät2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
researchProduct

Learning With Context Feedback Loop for Robust Medical Image Segmentation

2021

Deep learning has successfully been leveraged for medical image segmentation. It employs convolutional neural networks (CNN) to learn distinctive image features from a defined pixel-wise objective function. However, this approach can lead to less output pixel interdependence producing incomplete and unrealistic segmentation results. In this paper, we present a fully automatic deep learning method for robust medical image segmentation by formulating the segmentation problem as a recurrent framework using two systems. The first one is a forward system of an encoder-decoder CNN that predicts the segmentation result from the input image. The predicted probabilistic output of the forward system …

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceComputer Vision and Pattern Recognition (cs.CV)Feature vectorComputer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONContext (language use)Convolutional neural networkMachine Learning (cs.LG)Feedback030218 nuclear medicine & medical imaging03 medical and health sciences0302 clinical medicineFOS: Electrical engineering electronic engineering information engineeringImage Processing Computer-Assisted[INFO.INFO-IM]Computer Science [cs]/Medical ImagingSegmentationElectrical and Electronic EngineeringComputingMilieux_MISCELLANEOUSRadiological and Ultrasound TechnologyPixelbusiness.industryDeep learningImage and Video Processing (eess.IV)Pattern recognitionImage segmentationElectrical Engineering and Systems Science - Image and Video ProcessingFeedback loopComputer Science ApplicationsFeature (computer vision)Neural Networks ComputerArtificial intelligencebusinessSoftware
researchProduct

Enforcing Perceptual Consistency on Generative Adversarial Networks by Using the Normalised Laplacian Pyramid Distance

2019

In recent years there has been a growing interest in image generation through deep learning. While an important part of the evaluation of the generated images usually involves visual inspection, the inclusion of human perception as a factor in the training process is often overlooked. In this paper we propose an alternative perceptual regulariser for image-to-image translation using conditional generative adversarial networks (cGANs). To do so automatically (avoiding visual inspection), we use the Normalised Laplacian Pyramid Distance (NLPD) to measure the perceptual similarity between the generated image and the original image. The NLPD is based on the principle of normalising the value of…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceImage qualitymedia_common.quotation_subjectComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern RecognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONMachine Learning (stat.ML)Translation (geometry)Image (mathematics)Machine Learning (cs.LG)Consistency (database systems)Statistics - Machine LearningPerceptionFOS: Electrical engineering electronic engineering information engineeringmedia_commonbusiness.industryDeep learningImage and Video Processing (eess.IV)Contrast (statistics)Pattern recognitionGeneral MedicineImage segmentationElectrical Engineering and Systems Science - Image and Video ProcessingGenerative Adversarial NetworkPerceptionArtificial intelligencebusiness
researchProduct

Improving prostate whole gland segmentation in t2-weighted MRI with synthetically generated data

2021

Whole gland (WG) segmentation of the prostate plays a crucial role in detection, staging and treatment planning of prostate cancer (PCa). Despite promise shown by deep learning (DL) methods, they rely on the availability of a considerable amount of annotated data. Augmentation techniques such as translation and rotation of images present an alternative to increase data availability. Nevertheless, the amount of information provided by the transformed data is limited due to the correlation between the generated data and the original. Based on the recent success of generative adversarial networks (GAN) in producing synthetic images for other domains as well as in the medical domain, we present…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer sciencePipeline (computing)Computer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognition02 engineering and technology030218 nuclear medicine & medical imagingMachine Learning (cs.LG)03 medical and health sciencesProstate cancer0302 clinical medicineProstate020204 information systems0202 electrical engineering electronic engineering information engineeringmedicineFOS: Electrical engineering electronic engineering information engineeringSegmentationbusiness.industryDeep learningImage and Video Processing (eess.IV)Pattern recognitionImage segmentationElectrical Engineering and Systems Science - Image and Video Processingmedicine.diseaseData availabilitymedicine.anatomical_structureArtificial intelligencebusinessT2 weighted
researchProduct

Deep Learning Based Cardiac MRI Segmentation: Do We Need Experts?

2021

Deep learning methods are the de facto solutions to a multitude of medical image analysis tasks. Cardiac MRI segmentation is one such application, which, like many others, requires a large number of annotated data so that a trained network can generalize well. Unfortunately, the process of having a large number of manually curated images by medical experts is both slow and utterly expensive. In this paper, we set out to explore whether expert knowledge is a strict requirement for the creation of annotated data sets on which machine learning can successfully be trained. To do so, we gauged the performance of three segmentation models, namely U-Net, Attention U-Net, and ENet, trained with dif…

FOS: Computer and information sciencesComputer Science - Machine LearningComputer scienceProcess (engineering)GeneralizationIndustrial engineering. Management engineeringComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognitionheartannotated data setT55.4-60.8Machine learningcomputer.software_genre030218 nuclear medicine & medical imagingTheoretical Computer ScienceMachine Learning (cs.LG)Set (abstract data type)03 medical and health sciences0302 clinical medicineFOS: Electrical engineering electronic engineering information engineeringSegmentationNumerical AnalysisArtificial neural networkbusiness.industryDeep learningsegmentationImage and Video Processing (eess.IV)deep learningQA75.5-76.95Electrical Engineering and Systems Science - Image and Video ProcessingComputational MathematicsHausdorff distanceComputational Theory and MathematicsIndex (publishing)Electronic computers. Computer scienceArtificial intelligencebusinesscomputer030217 neurology & neurosurgeryMRI
researchProduct

Graph Embedding via High Dimensional Model Representation for Hyperspectral Images

2021

Learning the manifold structure of remote sensing images is of paramount relevance for modeling and understanding processes, as well as to encapsulate the high dimensionality in a reduced set of informative features for subsequent classification, regression, or unmixing. Manifold learning methods have shown excellent performance to deal with hyperspectral image (HSI) analysis but, unless specifically designed, they cannot provide an explicit embedding map readily applicable to out-of-sample data. A common assumption to deal with the problem is that the transformation between the high-dimensional input space and the (typically low) latent space is linear. This is a particularly strong assump…

FOS: Computer and information sciencesComputer Science - Machine LearningI.5.2Computer Vision and Pattern Recognition (cs.CV)G.1.6I.5.4Image and Video Processing (eess.IV)0211 other engineering and technologiesComputer Science - Computer Vision and Pattern RecognitionI.4.702 engineering and technologyElectrical Engineering and Systems Science - Image and Video ProcessingI.4.10; I.5.2; G.1.6; I.4.7; I.5.4I.4.10Machine Learning (cs.LG)FOS: Electrical engineering electronic engineering information engineeringGeneral Earth and Planetary SciencesElectrical and Electronic Engineering021101 geological & geomatics engineering
researchProduct

PerceptNet: A Human Visual System Inspired Neural Network for Estimating Perceptual Distance

2019

Traditionally, the vision community has devised algorithms to estimate the distance between an original image and images that have been subject to perturbations. Inspiration was usually taken from the human visual perceptual system and how the system processes different perturbations in order to replicate to what extent it determines our ability to judge image quality. While recent works have presented deep neural networks trained to predict human perceptual quality, very few borrow any intuitions from the human visual system. To address this, we present PerceptNet, a convolutional neural network where the architecture has been chosen to reflect the structure and various stages in the human…

FOS: Computer and information sciencesComputer Science - Machine LearningVisual perceptionComputer scienceImage qualitymedia_common.quotation_subjectFeature extractionMachine Learning (stat.ML)02 engineering and technology01 natural sciencesConvolutional neural networkhuman visual systemMachine Learning (cs.LG)010309 opticsStatistics - Machine LearningPerception0103 physical sciences0202 electrical engineering electronic engineering information engineeringFOS: Electrical engineering electronic engineering information engineeringperceptual distancemedia_commonArtificial neural networkbusiness.industryDeep learningImage and Video Processing (eess.IV)Pattern recognitionElectrical Engineering and Systems Science - Image and Video Processingneural networksHuman visual system model020201 artificial intelligence & image processingArtificial intelligencebusiness
researchProduct

Deep Non-Line-of-Sight Reconstruction

2020

The recent years have seen a surge of interest in methods for imaging beyond the direct line of sight. The most prominent techniques rely on time-resolved optical impulse responses, obtained by illuminating a diffuse wall with an ultrashort light pulse and observing multi-bounce indirect reflections with an ultrafast time-resolved imager. Reconstruction of geometry from such data, however, is a complex non-linear inverse problem that comes with substantial computational demands. In this paper, we employ convolutional feed-forward networks for solving the reconstruction problem efficiently while maintaining good reconstruction quality. Specifically, we devise a tailored autoencoder architect…

FOS: Computer and information sciencesComputer Science - Machine Learningbusiness.industryComputer scienceComputer Vision and Pattern Recognition (cs.CV)Image and Video Processing (eess.IV)Computer Science - Computer Vision and Pattern RecognitionNonlinear optics020207 software engineering02 engineering and technologyIterative reconstructionInverse problemElectrical Engineering and Systems Science - Image and Video ProcessingAutoencoderRendering (computer graphics)Machine Learning (cs.LG)Non-line-of-sight propagation0202 electrical engineering electronic engineering information engineeringFOS: Electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingComputer visionArtificial intelligencebusiness
researchProduct

Fully automated analysis of muscle architecture from B-mode ultrasound images with deep learning

2020

B-mode ultrasound is commonly used to image musculoskeletal tissues, but one major bottleneck is data interpretation, and analyses of muscle thickness, pennation angle and fascicle length are often still performed manually. In this study we trained deep neural networks (based on U-net) to detect muscle fascicles and aponeuroses using a set of labelled musculoskeletal ultrasound images. We then compared neural network predictions on new, unseen images to those obtained via manual analysis and two existing semi/automated analysis approaches (SMA and Ultratrack). With a GPU, inference time for a single image with the new approach was around 0.7s, compared to 4.6s with a CPU. Our method detects…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)Image and Video Processing (eess.IV)Computer Science - Computer Vision and Pattern RecognitionFOS: Electrical engineering electronic engineering information engineeringElectrical Engineering and Systems Science - Image and Video Processing
researchProduct

Segmentation-Free Estimation of Aortic Diameters from MRI Using Deep Learning

2020

Accurate and reproducible measurements of the aortic diameters are crucial for the diagnosis of cardiovascular diseases and for therapeutic decision making. Currently, these measurements are manually performed by healthcare professionals, being time consuming, highly variable, and suffering from lack of reproducibility. In this work we propose a supervised deep-learning method for the direct estimation of aortic diameters. The approach is devised and tested over 100 magnetic resonance angiography scans without contrast agent. All data was expert-annotated at six aortic locations typically used in clinical practice. Our approach makes use of a 3D+2D convolutional neural network (CNN) that ta…

FOS: Computer and information sciencesComputer Vision and Pattern Recognition (cs.CV)Image and Video Processing (eess.IV)Computer Science - Computer Vision and Pattern RecognitionFOS: Electrical engineering electronic engineering information engineeringR Medicina (General)Electrical Engineering and Systems Science - Image and Video Processing
researchProduct