6533b829fe1ef96bd128aeef

RESEARCH PRODUCT

Deep Motion Model for Pedestrian Tracking in 360 Degrees Videos

Marco La CasciaLiliana Lo Presti

subject

Settore ING-INF/05 - Sistemi Di Elaborazione Delle Informazioni360 degree videobusiness.industryComputer scienceTrackingComputer Science::Neural and Evolutionary ComputationComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION020206 networking & telecommunications02 engineering and technologyPedestrianTracking (particle physics)Convolutional neural networkMotion (physics)Motion0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingComputer visionArtificial intelligencebusinessCNNequirectangularComputingMethodologies_COMPUTERGRAPHICS

description

This paper proposes a deep convolutional neural network (CNN) for pedestrian tracking in 360◦ videos based on the target’s motion. The tracking algorithm takes advantage of a virtual Pan-Tilt-Zoom (vPTZ) camera simulated by means of the 360◦ video. The CNN takes in input a motion image, i.e. the difference of two images taken by using the vPTZ camera at different times by the same pan, tilt and zoom parameters. The CNN predicts the vPTZ camera parameter adjustments required to keep the target at the center of the vPTZ camera view. Experiments on a publicly available dataset performed in cross-validation demonstrate that the learned motion model generalizes, and that the proposed tracking algorithm achieves state-of-the-art performance.

https://doi.org/10.1007/978-3-030-30642-7_4