Object recognition algorithm for multiple instances using the Generalized Hough Transform (GHT) and Scale-invariant feature transform (SIFT) descriptors.
The general idea behind the Hough Transform is that objects are found through a voting procedure in a configuration or Hough space. If evidence for a particular object configuration is found in the image, a vote is added for this configuration. In our case such evidence is a matched feature vector from the object image to the scene image. The location, orientation and scale of the feature with respect to the object's coordinate system is known from the object image. If the feature is also found somewhere in the scene image, it is possible to calculate the likely location, orientation and scale of the whole object and add a "vote" for that configuration. After all matched feature vectors have voted, clusters in the configuration space correspond to possible object instances