Unsupervised Machine Learning, Most Promising Ingredient Of Big Data
Orange (France Telecom), one of the largest mobile operators in the world, issued a challenge " Data for Development " by releasing a dataset of their subscribers in Ivory Coast. The dataset contained 2.5 billion records, calls and text messages exchanged between 5 million anonymous users in Ivory Coast, Africa. Various researchers got access to this dataset and submitted their proposals on how this data can be used for development purposes in Ivory Coast. It would be an understatement to say these proposals and projects were mind-blowing. I have never seen so many different ways of looking at the same data to accomplish so many different things. Here's a book [very large pdf] that contains all the proposals. My personal favorite is AllAborad where IBM researchers used the cell-phone data to redraw optimal bus routes . The researchers have used several algorithms including supervised and unsupervised machine learning to analyze the dataset resulting in a variety of scena...