2.1. LOOKING FOR BOOKS ON AMAZONWhen Two Books are Related (according to Amazon)People who shop at amazon.com are familiar with suggestions such as "You might like these books …" (when you first visit the site) or "people who bought this book, also bought …" (when completing an order). This is all done by a program that keeps track of what books people buy and links books when bought by the same person as illustrated in Figure 2.1.1.
In the example of the figure, two different customers bought both the first and the last book, so these two books are considered related. Of course the computer does not draw colored lines between customers and books but it does the equivalent by keeping arrays of data. In reality things are a bit more complicated because books are marked as related depending on the percentage of common sales rather than depending on absolute number of sales. Here is one way to do such a calculation.
There are several other methods for making recommendations on the basis of customer preferences and they are referred to as Collaborative Filtering. The "Collaborative" refers to the use of several kinds of information that are available from earlier customer actions. It is to be contrasted with Content-based Filtering that relies on information about the content of various iterms, books, movies, music, etc. Content-based filtering requires human effort to add labels to the computer record of each item. Because human labor is more expensive content-based filtering is used by few merchants. When you shop at Amazon the important thing to remember is that customer actions rather than any content analysis determines, in the eyes of the seller, whether two books are on related subject or not. Here is a story that illustrates how such methods may fail. I am a member of a reading group that decided on successive months to read two books that were unrelated to each other and not particular popular. As a results the sales to our group were a significant part of the overall sales of the two books and that led amazon.com to recommend each book as related to the other. |