STAT 29000

Project 4

due September 30, 2015

Put all of your solutions into one text file for your group.
For example, the file for project 4, group 1, should be called: p04g1.Rmd and
it should be stored in the folder: /proj/gpproj15/p04g1/

PREAMBLE

Please submit your solutions in R Markdown (i.e., with an Rmd file) at the end of the project. It is not necessary to use R Markdown while you are initially solving the problems, unless you want to… but please format the final submission for the project in R Markdown, by the time that you are finished.

NOTE: As in the previous project, in questions 8, 9, 10, it will be helpful to use {r cache=TRUE, cache.lazy=FALSE} instead of {r}.

This project is all about visualizing data.

Question 1

  1. Check out the website Many Eyes (sponsored by IBM). Find 3 (or more) separate plots on Many Eyes (please give links to each of these plots) that violate the concepts of effective data visualization that are discussed in the handouts from class (e.g., in Cleveland’s book and Robbins’s book, and in the paper “How to display data badly”). Write a paragraph about each plot, with a critique of what aspects of the plotting could be improved. Imagine, for instance, that you were going to correspond with the people who designed the plot, and give them guidance about how to make a more effective depiction of the data. (Your discussion of these 3 plots should be about 1/3 of a page per plot, i.e., about 1 page altogether; more than 1 page altogether is certainly allowed.) Each student should write about at least 1 plot.

Question 2

  1. Revisit the website Many Eyes (sponsored by IBM). Find 3 (or more) separate plots (again, with links to the plots) on Many Eyes that do an overall good job of effective data visualization. Justify the reasons why you think that the plots are effective. (Again, please write at least 1/3 of a page for each plot, i.e., one page total, justifying the reasons that you think each plot is effective.) Each student should write about at least 1 plot.

Question 3

  1. Check out the website Information Is Beautiful. Find 3 (or more) separate plots on Information Is Beautiful (please give links to each of these plots) that violate the concepts of effective data visualization. Write a paragraph about each plot, with a critique of what aspects of the plotting could be improved. Imagine you were going to correspond with the people who designed the plot, and give them guidance about how to make a more effective depiction of the data. Your constructive criticism should be at least 1/3 of a page per plot, i.e., at least 1 page altogether.

Question 4

  1. The Wealth and Health of Nations is a fun depiction of data. On the other hand, as with many depictions of data, it violates some of the techniques of effective data display. Please write an explanation of which techniques of effective data display are violated. If you imagine you are writing a constructive criticism to the authors of this animation, please make suggestions for how the depiction of data (for the health and wealth, over the years displayed) could have been done more effectively. Please make sure your explanation is at least 1 page long.

Question 5

  1. Describe (at least!) 3 very significant ways that the poster winner “Congestion in the sky” (from the Data Expo 2009 poster competition results) could be significantly improved, using the concepts of effective data visualization. Write a constructive criticism (of at least 1 page) that gives suggestions for improvement on each aspect that you criticize.

Question 6

  1. For the other posters (do not use the winner, “Congestion in the sky”, since it was discussed already in question 5), find a total of at least 3 significant ways that some of the other posters can be improved. You can analyze several different posters, that is OK. Your constructive critique should be at least 1 page.

Question 7

  1. Which of the posters in the Data Expo 2009 do you think should be the winner? Why? (It is OK if you choose the poster that actually won, or any of the other posters.) Thoroughly justify your answer, using the techniques of effective data visualization, to justify your answer, with an explanation that is at least 1 page long.

Questions 8, 9, 10

8., 9., 10. Imagine that you are going to enter the Data Expo 2009. Rather than having to organize your information into a poster, prepare 3 pages of analysis, exploring some aspects of the airline data set that are interesting to you, and which you think might be of broad interest to potential readers too. Your discussion and plots should be at least 3 pages long.