Definition
An image descriptor is a vector representing concisely the content of an image. A similarity measure is a function estimating the similarity between two objects, usually represented by vectors.
Background
Image descriptors are an ubiquitous tool in computer vision. By representing the content of an image or an image region in a compact and robust way, they make matching problems more efficient, as shown in Figs. 1 and 2. Typically, a descriptor is used to query a set of descriptors, looking for the most similar descriptor in the set. Efficient algorithms, such as hashing, can be used to make this search extremely fast, even for large databases, when the Euclidean distance can be used as similarity measure. Applications range from simultaneous localization and mapping (SLAM) and Structure from Motion (SfM) to image retrieval and object recognition.
References
Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. Int J Comput Vis 60:63–86
Arandjelović R, Gronat P, Torii A, Pajdla T, Sivic J (2016) NetVLAD: CNN architecture for weakly supervised place recognition. In: Proceedings of the conference on computer vision and pattern recognition
Lindeberg T (1998) Principles for automatic scale selection. Technical Report ISRN KTH NA/P–98/14–SE, KTH (Royal Institute of Technology)
Schmid C, Mohr R (1997) Local grayvalue invariants for image retrieval. IEEE Trans Pattern Anal Mach Intell 19(5):530–534
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 20(2): 91–110
Mikolajczyk K, Schmid C (2005) A performance evaluation of local descriptors. IEEE Trans Pattern Anal Mach Intell 10(27):1615–1630
Bay H, Ess A, Tuytelaars T, Van Gool L (2008) SURF: speeded up robust features. Comput Vis Image Underst 10(3):346–359
Tola E, Lepetit V, Fua P (2010) Daisy: an efficient dense descriptor applied to wide baseline stereo. IEEE Trans Pattern Anal Mach Intell 32(5):815–830
Lindeberg T, Garding J (1997) Shape-adapted smoothing in estimation of 3-D shape cues from affine deformations of local 2-D brightness structure. Image Vis Comput 15(6):415–434
Baumberg A (2000) Reliable feature matching across widely separated views. In: Proceedings of the conference on computer vision and pattern recognition, pp 774–781
Zabih R, Woodfill J (1994) Non parametric local transforms for computing visual correspondences. In: Proceedings of the European conference on computer vision, pp 151–158, May 1994
Ojala T, Pietikäinen M, Mäenpää T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987
Calonder M, Lepetit V, Ozuysal M, Trzcinski T, Strecha C, Fua P (2012) BRIEF: computing a local binary descriptor very fast. IEEE Trans Pattern Anal Mach Intell 34(7):1281–1298
Rublee E, Rabaud V, Konolidge K, Bradski G (2011) ORB: an efficient alternative to SIFT or SURF. In: Proceedings of the international conference on computer vision
Balntas V, Lenc K, Vedaldi A, Mikolajczyk K (2017) HPatches: a benchmark and evaluation of handcrafted and learned local descriptors. In: Proceedings of the conference on computer vision and pattern recognition
Sivic J, Zisserman A (2003) Video Google: a text retrieval approach to object matching in videos. In: Proceedings of the international conference on computer vision
Jégou H, Perronnin F, Douze M, Sanchez J, Pérez P, Schmid C (2012) VLAD: aggregating local image descriptors into compact codes. IEEE Trans Pattern Anal Mach Intell 34(9):1704–1716
Arandjelović R, Zisserman A (2012) Three things everyone should know to improve object retrieval. In: Proceedings of the conference on computer vision and pattern recognition
Strecha C, Bronstein A, Bronstein M, Fua P (2012) LDAHash: improved matching with smaller descriptors. IEEE Trans Pattern Anal Mach Intell 34(1): 66–78
Bellet A, Habrard A, Sebban M (2013) A survey on metric learning for feature vectors and structured data. arXiv Preprint
Bromley J, Guyon I, LeCun Y, Säckinger E, Shah R (1993) Signature verification using a siamese time delay neural network. In: Advances in neural information processing systems. Morgan Kaufmann, San Mateo, pp 737–744
Zagoruyko S, Komodakis N (2015) Learning to compare image patches via convolutional neural networks. In: Proceedings of the conference on computer vision and pattern recognition
Author information
Authors and Affiliations
Corresponding author
Section Editor information
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this entry
Cite this entry
Lepetit, V. (2020). Image Descriptors and Similarity Measures. In: Computer Vision. Springer, Cham. https://doi.org/10.1007/978-3-030-03243-2_797-1
Download citation
DOI: https://doi.org/10.1007/978-3-030-03243-2_797-1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03243-2
Online ISBN: 978-3-030-03243-2
eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering