How to find related or duplicate items with IndexDen

Many applications and websites faced with the challenge of finding “related items” like:

  • Related articles in a blog
  • Related products in a shop
  • And so on

How to determine related item?
In the comparison of text object – related items determined based on the percentage of text similarity. For instance if text A similar more than on 80% to text B then both text object are related to each other.

How IndexDen can help?

Recently IndexDen added new feature to the API – quorum operator. With quorum operator you could match those documents that pass a given threshold of given words.
For example: search query like “the world is a wonderful place”/3 will match all documents that have at least 3 of the 6 specified words.

Ok, lets try real example with IndexDen.

