Difference between revisions of "Visualizing process data"

From Statistics for Engineering
Jump to navigation Jump to search
 
(27 intermediate revisions by the same user not shown)
Line 1: Line 1:
<div class="noautonum">__TOC__</div>{{ClassSidebar
<div class="noautonum">__TOC__</div>  
| date =
| dates_alt_text =
| course_notes_PDF = 2015-4C3-6C3-Data-visualization.pdf
| course_notes_alt = Class slides
| overheads_PDF =
| assignment_alt = Reading
| assignment_instructions = [http://learnche.org/pid/data-visualization/data-visualization  Textbook Chapter 1]
| assignment_solutions =
}}
 
== Learning outcomes ==
== Learning outcomes ==


Line 16: Line 6:
* How to effectively visualize up to 5 dimensions on a 2-D plot, as shown in a video by Hans Rosling.
* How to effectively visualize up to 5 dimensions on a 2-D plot, as shown in a video by Hans Rosling.
* Know the meaning of words like sparklines, data density, and chart junk.
* Know the meaning of words like sparklines, data density, and chart junk.
== Resources ==
* [[Image:Nuvola_mimetypes_pdf.png|20px|link=Media:2015-4C3-6C3-Data-visualization.pdf]]  [[Media:2015-4C3-6C3-Data-visualization.pdf | Class notes 2015]]
* [[Image:Nuvola_mimetypes_pdf.png|20px|link=Media:2014-4C3-6C3-Visualizing-data.pdf]] [[Media:2014-4C3-6C3-Visualizing-data.pdf | Class notes 2014]]
* [http://learnche.org/pid/data-visualization/index Textbook, chapter 1]
* Check your knowledge with these quizzes:
** [[Media:Viz-quiz-01.pdf|Quiz 1]]  and [[Media:Viz-quiz-01-solution.pdf|Solution 1]]
** [[Media:Viz-quiz-02.pdf|Quiz 2]] and [[Media:Viz-quiz-02-solution.pdf|Solution 2]]
** [https://docs.google.com/document/d/1rZXJY7aybeng7H_ZngzHDsbBxn-uipvRPF6r9JdH-9g Quiz 3] and [https://docs.google.com/document/d/1ew6UwKP8SBM19jvdYfMxgSnRVIvUNuwnOSNlJoZ3Tg4 Solution 3]
* Complete steps 1, 2 ... 9 of the [[Software_tutorial| software tutorial]]


== Extended readings ==
== Extended readings ==
* [http://en.wikipedia.org/wiki/Sankey_diagram Sankey diagrams] for example, would make a great way to show energy utilization in your company, or even a mass balance superimposed on a flowsheet. <!--  Here's a [http://gnewton.ca/gn/sankey/sankey_canada_energy_flow_2007.html great example applied to Canada's energy use]. -->
* [http://en.wikipedia.org/wiki/Sankey_diagram Sankey diagrams] for example, would make a great way to show energy utilization in your company, or even a mass balance superimposed on a flowsheet.   Here's a [http://bost.ocks.org/mike/sankey/ great example applied to the UK energy supply and demand].
* [http://www.investopedia.com/articles/financial-theory/11/lie-with-financial-statistics.asp How To Lie With Financial Statistics], Investopedia, November 2011
* [http://vita.had.co.nz/papers/boxplots.pdf 40 years of boxplots]  
* [http://vita.had.co.nz/papers/boxplots.pdf 40 years of boxplots]  
* Why you should [http://www.perceptualedge.com/articles/08-21-07.pdf never have to use pie charts], an article by Stephen Few.
* Why you should [http://www.perceptualedge.com/articles/08-21-07.pdf never have to use pie charts], an article by Stephen Few.
* This is one video you must watch for the course: [https://www.youtube.com/watch?v=jbkSRLYSojo Hans Rosling shows an incredible data visualization]
* This is one video you must watch for the course: [https://www.youtube.com/watch?v=jbkSRLYSojo Hans Rosling shows an incredible data visualization]
== Test your knowledge ==
* [https://docs.google.com/document/d/1rZXJY7aybeng7H_ZngzHDsbBxn-uipvRPF6r9JdH-9g Quiz] and [https://docs.google.com/document/d/1ew6UwKP8SBM19jvdYfMxgSnRVIvUNuwnOSNlJoZ3Tg4 Solution]


== Class videos from prior years ==
== Class videos from prior years ==
===Videos from 2015===
===Videos from 2015===
{| class="wikitable"
{{VideoBox
|-
|ytid = aU6eZuiG8ck
! Watch video on YouTube (closed captioned)
|timing=07:31
! Script
|shortcut=vis-01
! Video download
}}
|-
{{VideoBox
| {{#widget:YouTube|id=aU6eZuiG8ck}}
|ytid = tb20hIQlEBU
| [http://yint.org/sfe/vis-01-script Script]
|timing=03:16
| [http://yint.org/sfe/vis-01.mp4 Download video]
|shortcut=vis-02
|-
}}
| {{#widget:YouTube|id=tb20hIQlEBU}}
{{VideoBox
| [http://yint.org/sfe/vis-02-script Script]
|ytid = LumUy2F_DRc
| [http://yint.org/sfe/vis-02.mp4 Download video]
|timing=04:51
|-
|shortcut=vis-03
| {{#widget:YouTube|id=LumUy2F_DRc}}
}}
| [http://yint.org/sfe/vis-03-script Script]
{{VideoBox
| [http://yint.org/sfe/vis-03.mp4 Download video]
|ytid = JB8UP1JWNXQ
|-
|timing=07:23
| {{#widget:YouTube|id=JB8UP1JWNXQ}}
|shortcut=vis-04
| [http://yint.org/sfe/vis-04-script Script]
}}
| [http://yint.org/sfe/vis-04.mp4 Download video]
 
|}
===Videos from 2014===
{{#widget:YouTube|id=ZDGhrLsprkU}}
{{#widget:YouTube|id=ervdpbipMjE}}


===Videos from 2013===
{{#widget:YouTube|id=FHt6fgwbT_c}}
{{#widget:YouTube|id=E7LM9fqY1cI}}


===Videos from 2014===
== Software codes for this section ==
{{#widget:Vimeo|id=83723241}}
 
{{#widget:Vimeo|id=83855005}}
=== Code to show how to superimpose plots  ===
[http://www.r-fiddle.org/#/fiddle?id=19PyJkpa&version=1 Run this code in a web-browser]
 
<html><div data-datacamp-exercise data-lang="r">
<code data-type="sample-code">
 
# Run this code line-by-line (copy & paste) to understand the demonstration
f <- 'http://openmv.net/file/raw-material-properties.csv'
data <- read.csv(f)
summary(data)
 
# Single plot
plot(data$density1)
 
# Connect the dots
plot(data$density1, type='b')
 
# Another variable
plot(data$density2, type='b', col="red")
 
# Superimpose them?
plot(data$density1, type='b', col="blue")
 
# Where's density2 ?
lines(data$density2, type='b', col="red") 
 
# Superimpose them: limits
plot(data$density1,
    type='b',
    col="blue",
    ylim=c(10, 45))
 
# Now density2 shows up
lines(data$density2,
      type='b',
      col="red")
 
</code>
</div></html>

Latest revision as of 19:11, 14 January 2019

Learning outcomes

  • Understand when it is appropriate to use scatter plots, bar plots, pie charts (hint: almost never), and even tables.
  • Learn an interesting, potentially new plot: the box plot, to summarize and compare data.
  • How to effectively visualize up to 5 dimensions on a 2-D plot, as shown in a video by Hans Rosling.
  • Know the meaning of words like sparklines, data density, and chart junk.

Resources

Extended readings

Class videos from prior years

Videos from 2015

07:31 | Download video | Download captions | Script
03:16 | Download video | Download captions | Script
04:51 | Download video | Download captions | Script
07:23 | Download video | Download captions | Script

Videos from 2014

Videos from 2013

Software codes for this section

Code to show how to superimpose plots

Run this code in a web-browser

# Run this code line-by-line (copy & paste) to understand the demonstration f <- 'http://openmv.net/file/raw-material-properties.csv' data <- read.csv(f) summary(data) # Single plot plot(data$density1) # Connect the dots plot(data$density1, type='b') # Another variable plot(data$density2, type='b', col="red") # Superimpose them? plot(data$density1, type='b', col="blue") # Where's density2 ? lines(data$density2, type='b', col="red") # Superimpose them: limits plot(data$density1, type='b', col="blue", ylim=c(10, 45)) # Now density2 shows up lines(data$density2, type='b', col="red")