Deep neural networks and maximum likelihood search for approximate nearest neighbor in video-based image recognition


如何引用文章

全文:

开放存取 开放存取
受限制的访问 ##reader.subscriptionAccessGranted##
受限制的访问 订阅存取

详细

We analyzed the way to increase computational efficiency of video-based image recognition methods with matching of high dimensional feature vectors extracted by deep convolutional neural networks. We proposed an algorithm for approximate nearest neighbor search. At the first step, for a given video frame the algorithm verifies a reference image obtained when recognizing the previous frame. After that the frame is compared with a few number of reference images. Each next examined reference image is chosen so that to maximize conditional probability density of distances to the reference instances tested at previous steps. To decrease the required memory space we beforehand calculate only distances from all the images to small number of instances (pivots). When experimenting with either face photos from Labeled Faces in the Wild and PubFig83 datasets or with video data from YouTube Faces we showed that our algorithm allows accelerating the recognition procedure by 1.4–4 times comparing with known approximate nearest neighbor methods.

作者简介

A. Savchenko

National Research University Higher School of Economics, Laboratory of Algorithms and Technologies for Network Analysis

编辑信件的主要联系方式.
Email: avsavchenko@hse.ru
俄罗斯联邦, Nizhny Novgorod

补充文件

附件文件
动作
1. JATS XML

版权所有 © Allerton Press, Inc., 2017