As presented in the previous post, we have at our disposal a huge dataset containing the food purchases made in the Tesco shops within the boundaries of London. Faced with all this data, we are a little lost and ask ourselves where to begin. And suddenly we remember that we learned, during our Padawan formation, how to become a data vizard. We take out our most beautiful tools and begin to make some visualizations to understand better our data. We have many features available for the purchases . We notice that the Tesco paper did not deeply explore the different food categories. Therefore, we decide to focus on these ones.
What we want to visualize
We wonder if the Tesco consumers buy the same products and in same quantity all over the city. In particular, we wonder :
- Are there some food categories that are preferred in certain regions ?
- Are there some others that are equally bought in the city ?
- Can we draw a pattern from this visualization ? In order to try to answer all these questions, we make plots of the distribution of a certain food category all over London. We consider a mean of the fraction of purchase of each food category in a given ward.
Visualizations
Each plot below represents the mean proportion of purchase of a certain food category per ward. To see all the plots, you can click on this link. On top of that, by moving your mouse all over the map you can discover the exact mean proportion of purchase of a certain food category, the mean income and the median income in each ward.