library(tidyverse)
AE 08: Data science ethics - Misrepresentation
Packages
Part 1 - People’s Poll
GB News tweeted the following on Aug 26, 2022.
- Question: What is wrong with the visualization above?
Add your response here.
Your turn (5 minutes): The data from this poll are at
data/gbpoll.csv
. First, load the data and confirm the number of responses match those mentioned in the tweet.# add code here
Then, confirm that the proportions of intended votes match those mentioned in the tweet.
# add code here
Demo: Recreate the visualization from the tweet. You do not need to worry about matching the colors precisely and your bars should be correctly scaled.
# add code here
Your turn (10 minutes): Improve the visualization. State the improvements you made and why you made them. Discuss how these improvements help make the plot less misleading.
# add code here
Part 2 - Private sector
The following chart was shared by @GraphCrimes on Twitter on September 3, 2022.
- Question: What is misleading about this graph?
Add your response here.
- Your turn (6 minutes): If you needed to recreate this plot, with improvements to avoid its misleading pitfalls, what data do you need? How many variables? How many observations? Can you find the data online? Try looking for it for at least 3 minutes with a partner.
Add your response here.
- Demo: Load the data for this survey from
data/survation.csv
. First confirm that the data match the percentages from the visualization. Then, recreate the visualization, and improve it. Does the improved visualization look different than the original? Does it send a different message at a first glance?
# add code here