Image and Video Analysis

To aggregate or not to aggregate: selective match kernels for image search

International Conference on Computer Vision (Oral), December 2013.

This paper considers a family of metrics to compare images based on their local descriptors. It encompasses the VLAD descriptor and matching techniques such as Hamming Embedding. Making the bridge between these approaches leads us to propose a match kernel that takes the best of existing techniques by combining an aggregation procedure with a selective match kernel. Finally, the representation underpinning this kernel is approximated, providing a large scale image search both precise and scalable, as shown by our experiments on several benchmarks.

[ Bibtex ] [ PDF ]