Search results for "computer vision"

showing 10 items of 2353 documents

Zero-shot Semantic Segmentation using Relation Network

2021

Zero-shot learning (ZSL) is widely studied in recent years to solve the problem of lacking annotations. Currently, most studies on ZSL are for image classification and object detection. But, zero-shot semantic segmentation, pixel level classification, is still at its early stage. Therefore, this work proposes to extend a zero-shot image classification model, Relation Network (RN), to semantic segmentation tasks. We modified the structure of RN based on other state-of-the-arts semantic segmentation models (i.e. U-Net and DeepLab) and utilizes word embeddings from Caltech-UCSD Birds 200-2011 attributes and natural language processing models (i.e. word2vec and fastText). Because meta-learning …

hahmontunnistus (tietotekniikka)Meta learning (computer science)Computer scienceSemanticscomputer visionlcsh:Telecommunicationmeta-learninglcsh:TK5101-6720SegmentationWord2veczero-shot semantic segmentationkonenäközero-shot learningimage segmentationContextual image classificationbusiness.industrydeep learningPattern recognitionImage segmentationsemantic segmentationObject detectionkoneoppiminenrelation networkArtificial intelligencebusinessWord (computer architecture)

researchProduct

Time Unification on Local Binary Patterns Three Orthogonal Planes for Facial Expression Recognition

2019

International audience; Machine learning has known a tremendous growth within the last years, and lately, thanks to that, some computer vision algorithms started to access what is difficult or even impossible to perceive by the human eye. While deep learning based computer vision algorithms have made themselves more and more present in the recent years, more classical feature extraction methods, such as the ones based on Local Binary Patterns (LBP), still present a non negligible interest, especially when dealing with small datasets. Furthermore, this operator has proven to be quite useful for facial emotions and human gestures recognition in general. Micro-Expression (ME) classification is…

human eyeHistogramsgeometryUnificationComputer scienceLocal binary patternsoptimisationFeature extraction02 engineering and technologyhuman gestures recognitionFacial recognition systemcomputer visionVideos[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]time unification method03 medical and health sciences0302 clinical medicineMathematical modelLBPemotion recognition0202 electrical engineering electronic engineering information engineeringfacial emotionsfacial expression recognitionlocal binary patternsFace recognitionContextual image classificationArtificial neural networkbusiness.industryDeep learningdeep learning[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Pattern recognitionComputational modelingmicroexpression classificationInterpolationorthogonal planesneural netsmachine learning[INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]Micro expressionFeature extraction020201 artificial intelligence & image processinglearning (artificial intelligence)Artificial intelligencebusiness030217 neurology & neurosurgeryGestureimage classification

researchProduct

WATCHING PEOPLE: ALGORITHMS TO STUDY HUMAN MOTION AND ACTIVITIES

2020

Nowadays human motion analysis is one of the most active research topics in Computer Vision and it is receiving an increasing attention from both the industrial and scientific communities. The growing interest in human motion analysis is motivated by the increasing number of promising applications, ranging from surveillance, human–computer interaction, virtual reality to healthcare, sports, computer games and video conferencing, just to name a few. The aim of this thesis is to give an overview of the various tasks involved in visual motion analysis of the human body and to present the issues and possible solutions related to it. In this thesis, visual motion analysis is categorized into thr…

human motionSettore ING-INF/05 - Sistemi Di Elaborazione Delle Informazionihuman actvity recognitionaction recognition360°360 cameradeep learningtracking 360°computer vision360° camerahuman behaviors segmentation360-degreepedestrian trackinghuman motion trackingtime serietime-serie

researchProduct

Restoration and Enhancement of Historical Stereo Photos

2021

Restoration of digital visual media acquired from repositories of historical photographic and cinematographic material is of key importance for the preservation, study and transmission of the legacy of past cultures to the coming generations. In this paper, a fully automatic approach to the digital restoration of historical stereo photographs is proposed, referred to as Stacked Median Restoration plus (SMR+). The approach exploits the content redundancy in stereo pairs for detecting and fixing scratches, dust, dirt spots and many other defects in the original images, as well as improving contrast and illumination. This is done by estimating the optical flow between the images, and using it …

image denoisingComputer sciencemedia_common.quotation_subjectNoise reductionComputer applications to medicine. Medical informaticsR858-859.7ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONOptical flow02 engineering and technologyimage restorationArticleoptical flowgradient filteringPhotography0202 electrical engineering electronic engineering information engineeringRedundancy (engineering)historical photosContrast (vision)Radiology Nuclear Medicine and imagingComputer visionimage enhancementElectrical and Electronic EngineeringTR1-1050stereo matchingImage restorationmedia_commonSettore ING-INF/05 - Sistemi Di Elaborazione Delle Informazioniguided supersamplingImage fusionSettore INF/01 - Informaticabusiness.industry020206 networking & telecommunicationsSupersamplingQA75.5-76.95stacked medianComputer Graphics and Computer-Aided DesignTransmission (telecommunications)Electronic computers. Computer science020201 artificial intelligence & image processingComputer Vision and Pattern RecognitionArtificial intelligencebusinessimage denoising image restoration image enhancement stereo matching optical flow gradient filtering stacked median guided supersampling historical photosJournal of Imaging

researchProduct

Space-Frequency Quantization for Image Compression With Directionlets

2007

The standard separable 2-D wavelet transform (WT) has recently achieved a great success in image processing because it provides a sparse representation of smooth images. However, it fails to efficiently capture 1-D discontinuities, like edges or contours. These features, being elongated and characterized by geometrical regularity along different directions, intersect and generate many large magnitude wavelet coefficients. Since contours are very important elements in the visual perception of images, to provide a good visual quality of compressed images, it is fundamental to preserve good reconstruction of these directional features. In our previous work, we proposed a construction of critic…

image orientation analysisMultiresolution analysisVideo RecordingComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONImage processingnonseparable transformsmultiresolution analysisRate–distortion theoryWaveletDVMsImage Interpretation Computer-AssistedComputer GraphicsComputer visionQuantization (image processing)image codingimage segmentationMathematicsbusiness.industryWavelet transformNumerical Analysis Computer-AssistedSignal Processing Computer-AssistedWTsData CompressionImage EnhancementComputer Graphics and Computer-Aided Designwavelet transformsdirectional vanishing momentsdirectional transformsArtificial intelligencebusinessAlgorithmsSoftwareImage compressionData compressionIEEE Transactions on Image Processing

researchProduct

A Cognitive Architecture for Robotic Hand Posture Learning

2005

This paper deals with the design and implementation of a visual control of a robotic system composed of a dexterous hand and video camera. The aim of the proposed system is to reproduce the movements of a human hand in order to learn complex manipulation tasks or to interact with the user. A novel algorithm for robust and fast fingertips localization and tracking is presented. A suitable kinematic hand model is adopted to achieve a fast and acceptable solution to an inverse kinematics problem. The system is part of a cognitive architecture for posture learning that integrates the perceptions by a high-level representation of the scene and of the observed actions. The anthropomorphic robotic…

imitation learningInverse kinematicsgesture recognitionbusiness.industryMachine visionComputer scienceCognitive architectureKinematicsAnthropometryCognitive architectureHuman–robot interactionComputer Science ApplicationsHuman-Computer Interactionrobotic visionControl and Systems EngineeringGesture recognitionRobot handComputer visionArtificial intelligenceElectrical and Electronic Engineeringbusinesshuman-robot interfaceSoftwareInformation SystemsGesture

researchProduct

Effects of menu structure and touch screen scrolling style on the variability of glance durations during in-vehicle visual search tasks.

2011

The effects of alternative navigation device display features on drivers' visual sampling efficiency while searching forpoints of interest were studied in two driving simulation experiments with 40 participants. Given that the number of display items was sufficient, display features that facilitate resumption of visual search following interruptions were expected to lead to more consistent in-vehicle glance durations. As predicted, compared with a grid-style menu, searching information in a list-style menu while driving led to smaller variance in durations of in-vehicle glances, in particular with nine item displays. Kinetic touch screen scrolling induced a greater number of very short in-v…

in-vehicle information systemAdultMaleEngineeringAutomobile DrivingVisual perceptionresumabilityInformationSystems_INFORMATIONINTERFACESANDPRESENTATION(e.g.HCI)Poison controlPhysical Therapy Sports Therapy and RehabilitationHuman Factors and Ergonomicsinterrupted visual searchajoneuvotietojärjestelmänäyttöStyle (sociolinguistics)User-Computer InterfaceYoung AdultInformation display systemsDistractionHumansComputer visionAttentionComputer Simulationta113Visual searchStructure (mathematical logic)Analysis of Variancebusiness.industryhäiriövaikutusvisual sampling strategydisplaykeskeytetty visuaalinen hakuScrollingData DisplayGeographic Information SystemsFemaleArtificial intelligencebusinesstiedon poimintastrategiadistractionErgonomics

researchProduct

Semi-blind Source Extraction Methods: Application to the measurement of non-contact physiological signs

2018

Non-contact physiological measurements are highlydesirable in many biomedical fields such asdiagnosis of infants, geriartic patients, patients withextreme physical trauma, and fitness and well-being.Remote photoplethysmography is increasingly beingused for non-contact measurement of heart rate fromvideos which is one of the most common biomedicalproperty required for most medical diagnosis. Oneof the common techniques for performing remotephotoplethysmography involves using Blind SourceSeparation (BSS) methods to extract the cardiacsignal from video data.In this context, the objective of this thesis is todevelop different methods in the field of extractionand separation of sources by improv…

integration of biophysical constraints[INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]L’analyse de composantes indépendantes contraint[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing[INFO.INFO-TS] Computer Science [cs]/Signal and Image ProcessingRemote photoplethysmographyL’analyse de composantes indépendantes[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Méthodes d’extraction semi-aveugleSemi-blind source extraction methodsIntègration des contraintes biophysiquesConstrained Independent Component AnalysisPhotopléthysmographie à distance

researchProduct

Editorial:Governance AI ethics

2022

intelligent systemslainsäädäntötekoälyorganizationartificial intelligenceethicsComputer Science ApplicationsHuman-Computer InteractionSociety 5.0AI ethicsgovernanceAIComputer Science (miscellaneous)Computer Vision and Pattern Recognitionetiikkalaw

researchProduct

Large-scale nonlinear dimensionality reduction for network intrusion detection

2017

International audience; Network intrusion detection (NID) is a complex classification problem. In this paper, we combine classification with recent and scalable nonlinear dimensionality reduction (NLDR) methods. Classification and DR are not necessarily adversarial, provided adequate cluster magnification occurring in NLDR methods like $t$-SNE: DR mitigates the curse of dimensionality, while cluster magnification can maintain class separability. We demonstrate experimentally the effectiveness of the approach by analyzing and comparing results on the big KDD99 dataset, using both NLDR quality assessment and classification rate for SVMs and random forests. Since data involves features of mixe…

intrusion detection[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][ SPI.SIGNAL ] Engineering Sciences [physics]/Signal and Image processing[INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG][ INFO.INFO-CV ] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][ INFO.INFO-LG ] Computer Science [cs]/Machine Learning [cs.LG][STAT.ML] Statistics [stat]/Machine Learning [stat.ML][INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]ComputingMethodologies_PATTERNRECOGNITION[STAT.ML]Statistics [stat]/Machine Learning [stat.ML][INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]Gower[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing[ STAT.ML ] Statistics [stat]/Machine Learning [stat.ML][SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processingdimensionality reduction

researchProduct