Just how to assess the similarity between two pictures?

Just how to assess the similarity between two pictures?

We have two team images for pet and dog. And every team have 2000 images for pet and dog correspondingly.

My objective is make an effort to cluster the pictures making use of k-means.

Assume image1 is x , and image2 is y .Here we have to gauge the similarity between any two images. what’s the way that is common determine between two pictures?

1 Response 1

Well, there a couple of therefore. lets go:

A – found in template matching:

Template Matching is linear and it is maybe perhaps not invariant to rotation (actually not really robust to it) however it is pretty simple and easy robust to sound for instance the people in photography taken with low lighting.

It is simple to implement these OpenCV Template that is using Matching. Bellow there are mathematical equations determining a few of the similarity measures (adapted for comparing 2 equal sized pictures) employed by cv2.matchTemplate:

1 – Sum Square Huge Difference

2 – Cross-Correlation

B – visual descriptors/feature detectors:

Numerous descriptors had been developed for pictures, their use that is main is register images/objects and seek out them various other scenes. But, still they provide lots of information on the image and had been utilized in student detection (A joint cascaded framework for simultaneous attention detection and attention state estimation) as well as seem it useful for lip reading (can not direct you to definitely it since I’m not yes it had been currently posted)

They detect points that may be regarded as features in images (appropriate points) the texture that is local of points and sometimes even their geometrical place to each other may be used as features.

You can easily get the full story if you want to keep research on Computer vision I recomend you check the whole course and maybe Rich Radke classes on Digital Image Processing and Computer Vision for Visual Effects, there is a lot of information there that can be useful for this hard working computer vision style you’re trying to take about it in Stanford’s Image Processing Classes (check handouts for classes 12,13 and 14)

1 – SIFT and SURF:

They are Scale Invariant practices, SURF is really a speed-up and version that is open of, SIFT is proprietary.


These are binary descriptors and they write my are really quick (primarily on processors by having a pop_count instruction) and may be utilized in a comparable option to SIFT and SURF. Additionally, i have utilized BRIEF features as substitutes on template matching for Facial Landmark Detection with a high gain on rate with no loss on precision for both the IPD in addition to KIPD classifiers, although i did not publish any one of it yet (and also this is simply an incremental observation from the future articles thus I don’t believe there is certainly harm in sharing).

3 – Histogram of Oriented Gradients (HoG):

This can be rotation invariant and is employed for face detection.

C – Convolutional networks that are neural

I am aware that you don’t wish to utilized NN’s but i do believe it really is reasonable to aim they’ve been REALLY POWERFULL, training a CNN with Triplet Loss may be actually good for learning a feature that is representative for clustering (and category).

Always check Wesley’s GitHub for an illustration of it is energy in facial recognition Triplet that is using Loss get features after which SVM to classify.

Additionally, if your condition with Deep Learning is computational expense, it is simple to find pre-trained levels with dogs and cats around.

D – check up on previous work:

This dogs and cats battle happens to be taking place for the time that is long. you should check solutions on Kaggle Competitions (Forum and Kernels), there have been 2 on cats and dogs this 1 and therefore One

E – Famous Measures:

  • SSIM Structural similarity Index
  • L2 Norm ( Or Euclidean Distance)
  • Mahalanobis Distance

F – check into other variety of features

Dogs and cats can be a simple to recognize by their ears and nose. size too but I experienced kitties as huge as dogs.

so not really that safe to utilize size.

You could take to segmenting the pictures into pets and back ground and try to do then area home analisys.

When you yourself have the full time, this guide right here: Feature Extraction & Image Processing for Computer Vision from Mark S. Nixon have much information about this sort of procedure

You can test Fisher Discriminant review and PCA to produce a mapping while the evaluate with Mahalanobis Distance or L2 Norm