AE 08: Data science ethics - Misrepresentation

Application exercise
Important

Go to the course GitHub organization and locate the repo titled ae-08-YOUR_GITHUB_USERNAME to get started.

This AE is due Friday, Oct 7 at 11:59pm.

Packages

library(tidyverse)

Part 1 - People’s Poll

GB News tweeted the following on Aug 26, 2022.

  • Question: What is wrong with the visualization above?

Add your response here.

  • Your turn (5 minutes): The data from this poll are at data/gbpoll.csv. First, load the data and confirm the number of responses match those mentioned in the tweet.

    # add code here

    Then, confirm that the proportions of intended votes match those mentioned in the tweet.

    # add code here
  • Demo: Recreate the visualization from the tweet. You do not need to worry about matching the colors precisely and your bars should be correctly scaled.

    # add code here
  • Your turn (10 minutes): Improve the visualization. State the improvements you made and why you made them. Discuss how these improvements help make the plot less misleading.

    # add code here

Part 2 - Private sector

The following chart was shared by @GraphCrimes on Twitter on September 3, 2022.

  • Question: What is misleading about this graph?

Add your response here.

  • Your turn (6 minutes): If you needed to recreate this plot, with improvements to avoid its misleading pitfalls, what data do you need? How many variables? How many observations? Can you find the data online? Try looking for it for at least 3 minutes with a partner.

Add your response here.

  • Demo: Load the data for this survey from data/survation.csv. First confirm that the data match the percentages from the visualization. Then, recreate the visualization, and improve it. Does the improved visualization look different than the original? Does it send a different message at a first glance?
# add code here