6533b860fe1ef96bd12c38d8

RESEARCH PRODUCT

Fully automatic face recognition system using a combined audio-visual approach

Alberto AlbiolEdward J. DelpLuis Torres

subject

Audio miningDynamic time warpingModalitiesComputer sciencebusiness.industryShot (filmmaking)Speech recognitionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONVideo sequenceFacial recognition systemVideo trackingSignal ProcessingFuse (electrical)Computer visionArtificial intelligenceElectrical and Electronic Engineeringbusiness

description

This paper presents a novel audio and video information fusion approach that greatly improves automatic recognition of people in video sequences. To that end, audio and video information is first used independently to obtain confidence values that indicate the likelihood that a specific person appears in a video shot. Finally, a post-classifier is applied to fuse audio and visual confidence values. The system has been tested on several news sequences and the results indicate that a significant improvement in the recognition rate can be achieved when both modalities are used together.

https://doi.org/10.1049/ip-vis:20045082