Difference between revisions of "Visualizing process data"

From Statistics for Engineering
Jump to navigation Jump to search
Line 6: Line 6:
* How to effectively visualize up to 5 dimensions on a 2-D plot, as shown in a video by Hans Rosling.
* How to effectively visualize up to 5 dimensions on a 2-D plot, as shown in a video by Hans Rosling.
* Know the meaning of words like sparklines, data density, and chart junk.
* Know the meaning of words like sparklines, data density, and chart junk.
== Extended readings ==
* [http://en.wikipedia.org/wiki/Sankey_diagram Sankey diagrams] for example, would make a great way to show energy utilization in your company, or even a mass balance superimposed on a flowsheet.  Here's a [http://bost.ocks.org/mike/sankey/ great example applied to the UK energy supply and demand].
* [http://www.investopedia.com/articles/financial-theory/11/lie-with-financial-statistics.asp How To Lie With Financial Statistics], Investopedia, November 2011
* [http://vita.had.co.nz/papers/boxplots.pdf 40 years of boxplots]
* Why you should [http://www.perceptualedge.com/articles/08-21-07.pdf never have to use pie charts], an article by Stephen Few.
* This is one video you must watch for the course: [https://www.youtube.com/watch?v=jbkSRLYSojo Hans Rosling shows an incredible data visualization]


== Resources ==
== Resources ==
Line 21: Line 14:
* [https://docs.google.com/document/d/1rZXJY7aybeng7H_ZngzHDsbBxn-uipvRPF6r9JdH-9g Quiz] and [https://docs.google.com/document/d/1ew6UwKP8SBM19jvdYfMxgSnRVIvUNuwnOSNlJoZ3Tg4 Solution]
* [https://docs.google.com/document/d/1rZXJY7aybeng7H_ZngzHDsbBxn-uipvRPF6r9JdH-9g Quiz] and [https://docs.google.com/document/d/1ew6UwKP8SBM19jvdYfMxgSnRVIvUNuwnOSNlJoZ3Tg4 Solution]
* Complete steps 1, 2 ... 9 of the [http://learnche.mcmaster.ca/4C3/Software_tutorial software tutorial]
* Complete steps 1, 2 ... 9 of the [http://learnche.mcmaster.ca/4C3/Software_tutorial software tutorial]
== Extended readings ==
* [http://en.wikipedia.org/wiki/Sankey_diagram Sankey diagrams] for example, would make a great way to show energy utilization in your company, or even a mass balance superimposed on a flowsheet.  Here's a [http://bost.ocks.org/mike/sankey/ great example applied to the UK energy supply and demand].
* [http://www.investopedia.com/articles/financial-theory/11/lie-with-financial-statistics.asp How To Lie With Financial Statistics], Investopedia, November 2011
* [http://vita.had.co.nz/papers/boxplots.pdf 40 years of boxplots]
* Why you should [http://www.perceptualedge.com/articles/08-21-07.pdf never have to use pie charts], an article by Stephen Few.
* This is one video you must watch for the course: [https://www.youtube.com/watch?v=jbkSRLYSojo Hans Rosling shows an incredible data visualization]


== Class videos from prior years ==
== Class videos from prior years ==

Revision as of 07:36, 5 January 2016

Learning outcomes

  • Understand when it is appropriate to use scatter plots, bar plots, pie charts (hint: almost never), and even tables.
  • Learn an interesting, potentially new plot: the box plot, to summarize and compare data.
  • How to effectively visualize up to 5 dimensions on a 2-D plot, as shown in a video by Hans Rosling.
  • Know the meaning of words like sparklines, data density, and chart junk.

Resources

Extended readings

Class videos from prior years

Videos from 2015

07:31 | Download video | Download captions | Script
03:16 | Download video | Download captions | Script
04:51 | Download video | Download captions | Script
07:23 | Download video | Download captions | Script

Videos from 2014


Videos from 2013

Software codes for this section

Code to show how to superimpose plots

Try this code in a web-browser

# Run this code line-by-line (copy & paste) to understand the demonstration

data <- read.csv('http://openmv.net/file/raw-material-properties.csv')
summary(data)

# Single plot
plot(data$density1)

# Connect the dots
plot(data$density1, type='b')

# Another variable
plot(data$density2, type='b', col="red")

# Superimpose them?
plot(data$density1, type='b', col="blue")
lines(data$density2, type='b', col="red")  # where's density2 ?

# Superimpose them: limits
plot(data$density1, type='b', col="blue", ylim=c(10, 45))
lines(data$density2, type='b', col="red")  # now density2 shows up