Stephen Baker of BusinessWeek has just published a book entitled The Numerati, and has a blog related to the book. The purpose of the book is to look how mathematicians are using data to to profile people in their shopping, voting, and even dating habits.
I am not exactly an unbiased reader of the book. I talked with Stephen during the writing of the book, and he asked me to review the two pages he wrote about “operations research” (I made a couple suggestions which didn’t make it into the final version: I guess this is my “cutting room floor” experience). He was kind enough to send me a review copy of the book, which I received a few weeks ago. He also accepted my invitation to speak here at CMU to the Tepper School Faculty and doctoral students.
The book is divided into chapters corresponding to the different uses of data: “Worker”, “Shopper”, “Voter”, “Terrorist”, “Patient” and “Lover”. For instance, in the “Voter” section, the emphasis is on predicting voter behavior. In the past (perhaps), geography and economics were very good predictors of voting behavior. Now, people seem much more in flux as to their behavior. Perhaps there are better predictors. Or perhaps there are useful clusterings of like-minded people that would respond to a particular pitch. If Barack Obama were to identify a cluster of “people who blog about obscure but important mathematical modeling methods” and would send a mailer (or email more likely) showing his deep understanding of operations research and a promise to use that phrase in his acceptance speech, then perhaps he would gain a crucial set of voters. Barack, are you listening?
I greatly enjoyed reading the book, and did so in one sitting. For someone like me who perhaps could be seen as one of the Numerati, there is not much technical depth to the book, but there are a number of good examples that could be used in the classroom or in conversation. There is a bit too much “The Numerati know much about you and can use it for good or EEEVVVIILLLL” for my taste, but perhaps I take comfort in understanding how poorly data mining and similar methods work in predicting individual behavior. The book is very much about modeling people, so essentially ignores the way operations research is used to automate business decisions and processes. This is a book primarily about what I would call data mining and clustering, so there are wide swathes of the “numerati” field that are not covered. But for a popular look on how our mathematics is used to characterize and predict human behavior, The Numerati is an extremely interesting book.