1.10. Exercises¶

Question 1

The data shown here are the number of visits to a university website for a particular statistics course. There are 90 students in the class.

What are the names (type) of the 2 plots shown?
List any 2 interesting features in these data.

Solution Click to show answer

Question 2

What are the names of the axes on a bar plot?

Solution Click to show answer

Question 3

Which types of features can can the human eye easily pick out of a time series plot?

Solution Click to show answer

Question 4

Why is the principle of minimizing “data ink” so important in an effective visualization? Give an scientific or engineering example of why this important.

Solution Click to show answer

Question 5

Describe what the main difference(s) between a bar chart and a histogram are.

Solution Click to show answer

Question 6

Write out a list of any features that can turn a plot into a poor visualization. Think carefully about plots you encountered in textbooks and scientific publications, or the lab reports you might have recently created for a university or college course.

Question 7

This question is an extension to visualizing more than 3 variables. Investigate on your own the term “scatterplot matrix”, and draw one for the Food texture data set. See the car library in R to create an effective scatterplot matrix with the scatterplotMatrix function. List some bullet-points that interpret the plot.

Solution Click to show answer

Question 8

Using the Website traffic data set

Create a chart that shows the variability in website traffic for each day of the week.
Use the same data set to describe any time-based trends that are apparent.

Solution Click to show answer

Question 9

Load the room temperature dataset into R, Python or MATLAB, or whichever software tool you prefer to plot with.

Plot the 4 trajectories, FrontLeft, FrontRight, BackLeft and BackRight on the same plot.
Comment on any features you observe in your plot.
Be specific and describe how sparklines of these same data would improve the message the data is showing.

Solution Click to show answer

Question 10

Load the six point board thickness dataset, available from datasets website.

Plot a boxplot of the first 100 rows of data to match the figure in these notes
Explain why the thick center line in the box plot is not symmetrical with the outer edges of the box.

Solution Click to show answer

Question 11

Read the short, clearly written article by Stephen Few on the pitfalls of pie charts: Save the pies for dessert, https://www.perceptualedge.com/articles/08-21-07.pdf. The article presents an easy-to-read argument against pie charts that will hopefully convince you.

Here’s a great example that proves his point from the Canada Revenue Agency.

Question 12

Enrichment:

Watch this 20 minute video that shows how a 2-dimensional plot comes alive to show 5 dimensions of data. What are the 5 dimensions?
A condensed version from this, 4 minute YouTube video shows Hans Rosling giving a new perspective on the same data. This Economist article has some interesting background on Dr. Rosling, as does this page, giving a selection of his work.