This is related to google print application. In other words, we have many books out of which some books titles were different but content same. We need to figure out such cases efficiently. How?
When curiosity outbursts …!!!
This is related to google print application. In other words, we have many books out of which some books titles were different but content same. We need to figure out such cases efficiently. How?
What we can do is to have more attributes listed with every book, like the number of chapters, the list of chapters(in order), and check for similarity in them. At the highest level I think this is possible.
What is your similarity function? So, that means you need to compare one to one i.e. O(n^2) algo ? isn’t it?
For an application like google print, there are some millions of books. If you run an n^2 algo, it will take some years to find similar books. isn’t it?
Pa1