Difference between revisions of "Visualizing process data"

From Statistics for Engineering
Jump to navigation Jump to search
 
(37 intermediate revisions by the same user not shown)
Line 1: Line 1:
__NOTOC__
<div class="noautonum">__TOC__</div>
== Learning outcomes ==


* Understand when it is appropriate to use scatter plots, bar plots, pie charts (hint: [http://www.perceptualedge.com/articles/08-21-07.pdf almost never]), and even tables.
* Learn an interesting, potentially new plot: the box plot, to summarize and compare data.
* How to effectively visualize up to 5 dimensions on a 2-D plot, as shown in a video by Hans Rosling.
* Know the meaning of words like sparklines, data density, and chart junk.


{{ClassSidebar
== Resources ==
| date =
 
| dates_alt_text =
* [[Image:Nuvola_mimetypes_pdf.png|20px|link=Media:2015-4C3-6C3-Data-visualization.pdf]]  [[Media:2015-4C3-6C3-Data-visualization.pdf | Class notes 2015]]
| course_notes_PDF = 2014-4C3-6C3-Visualizing-data.pdf
* [[Image:Nuvola_mimetypes_pdf.png|20px|link=Media:2014-4C3-6C3-Visualizing-data.pdf]] [[Media:2014-4C3-6C3-Visualizing-data.pdf | Class notes 2014]]
| course_notes_alt = Class slides
* [http://learnche.org/pid/data-visualization/index Textbook, chapter 1]
| overheads_PDF =
* Check your knowledge with these quizzes:
| assignment_alt = Course notes
** [[Media:Viz-quiz-01.pdf|Quiz 1]]  and [[Media:Viz-quiz-01-solution.pdf|Solution 1]]
| assignment_instructions = [http://learnche.org/pid/data-visualization/data-visualization Textbook Chapter 1]
** [[Media:Viz-quiz-02.pdf|Quiz 2]] and [[Media:Viz-quiz-02-solution.pdf|Solution 2]]
| assignment_solutions =  
** [https://docs.google.com/document/d/1rZXJY7aybeng7H_ZngzHDsbBxn-uipvRPF6r9JdH-9g Quiz 3] and [https://docs.google.com/document/d/1ew6UwKP8SBM19jvdYfMxgSnRVIvUNuwnOSNlJoZ3Tg4 Solution 3]
* Complete steps 1, 2 ... 9 of the [[Software_tutorial| software tutorial]]
 
== Extended readings ==
* [http://en.wikipedia.org/wiki/Sankey_diagram Sankey diagrams] for example, would make a great way to show energy utilization in your company, or even a mass balance superimposed on a flowsheet.  Here's a [http://bost.ocks.org/mike/sankey/ great example applied to the UK energy supply and demand].
* [http://vita.had.co.nz/papers/boxplots.pdf 40 years of boxplots]
* Why you should [http://www.perceptualedge.com/articles/08-21-07.pdf never have to use pie charts], an article by Stephen Few.
* This is one video you must watch for the course: [https://www.youtube.com/watch?v=jbkSRLYSojo Hans Rosling shows an incredible data visualization]
 
== Class videos from prior years ==
===Videos from 2015===
{{VideoBox
|ytid = aU6eZuiG8ck
|timing=07:31
|shortcut=vis-01
}}
}}
{{VideoBox
|ytid = tb20hIQlEBU
|timing=03:16
|shortcut=vis-02
}}
{{VideoBox
|ytid = LumUy2F_DRc
|timing=04:51
|shortcut=vis-03
}}
{{VideoBox
|ytid = JB8UP1JWNXQ
|timing=07:23
|shortcut=vis-04
}}
===Videos from 2014===
{{#widget:YouTube|id=ZDGhrLsprkU}}
{{#widget:YouTube|id=ervdpbipMjE}}
===Videos from 2013===
{{#widget:YouTube|id=FHt6fgwbT_c}}
{{#widget:YouTube|id=E7LM9fqY1cI}}
== Software codes for this section ==


=== Code to show how to superimpose plots  ===
[http://www.r-fiddle.org/#/fiddle?id=19PyJkpa&version=1 Run this code in a web-browser]


{| class="wikitable" style="text-align: center;"
<html><div data-datacamp-exercise data-lang="r">
|-
<code data-type="sample-code">
! Date
 
! Class number
# Run this code line-by-line (copy & paste) to understand the demonstration
! colspan="2"|Video  files
f <- 'http://openmv.net/file/raw-material-properties.csv'
! Reading (PID)
data <- read.csv(f)
! Slides
summary(data)
! Other materials
 
|-
# Single plot
| 2014
plot(data$density1)
| Vis-01
 
| [http://learnche.mcmaster.ca/media/2014-4C3-6C3-Class-01B.mp4 Video] (333 M)  
# Connect the dots
| rowspan="2"|[http://learnche.mcmaster.ca/pid/?source=Visualize Chapter 1]
plot(data$density1, type='b')
| rowspan="2"|[[Image:Nuvola_mimetypes_pdf.png|20px|link=Media:2014-4C3-6C3-Visualizing-data.pdf]] [[Media:2014-4C3-6C3-Visualizing-data.pdf|Slides for class]]
 
| rowspan="2"  style="text-align: left;"|
# Another variable
* [http://en.wikipedia.org/wiki/Sankey_diagram Sankey diagrams] for example, would make a great way to show energy utilization in your company, or even a mass balance superimposed on a flowsheet. <!--  Here's a [http://gnewton.ca/gn/sankey/sankey_canada_energy_flow_2007.html great example applied to Canada's energy use]. -->
plot(data$density2, type='b', col="red")
* [http://www.investopedia.com/articles/financial-theory/11/lie-with-financial-statistics.asp How To Lie With Financial Statistics], Investopedia, November 2011
 
* [http://vita.had.co.nz/papers/boxplots.pdf 40 years of boxplots]
# Superimpose them?
plot(data$density1, type='b', col="blue")
 
# Where's density2 ?
lines(data$density2, type='b', col="red") 


|-
# Superimpose them: limits
| 2014
plot(data$density1,
| Vis-02
    type='b',
| [http://learnche.mcmaster.ca/media/2014-4C3-6C3-Class-01C.mp4 Video] (298 M)  
    col="blue",
|}
    ylim=c(10, 45))


# Now density2 shows up
lines(data$density2,
      type='b',
      col="red")


{{ClassSidebar
</code>
| date = 2015
</div></html>
| dates_alt_text =
| vimeoID1 = 83723241
| vimeoID2 = 83855005
| course_notes_PDF = 2014-4C3-6C3-Visualizing-data.pdf
| course_notes_alt = Course slides
| overheads_PDF =
| assignment_instructions =
| assignment_solutions =
| video_download_link_MP4 = http://learnche.mcmaster.ca/media/2014-4C3-6C3-Class-01B.mp4
| video_download_link_MP4_size = 333 M
| video_notes1 =
| video_download_link2_MP4 = http://learnche.mcmaster.ca/media/2014-4C3-6C3-Class-01C.mp4
| video_download_link2_MP4_size = 298 M
| video_notes2 =
}}

Latest revision as of 19:11, 14 January 2019

Learning outcomes

  • Understand when it is appropriate to use scatter plots, bar plots, pie charts (hint: almost never), and even tables.
  • Learn an interesting, potentially new plot: the box plot, to summarize and compare data.
  • How to effectively visualize up to 5 dimensions on a 2-D plot, as shown in a video by Hans Rosling.
  • Know the meaning of words like sparklines, data density, and chart junk.

Resources

Extended readings

Class videos from prior years

Videos from 2015

07:31 | Download video | Download captions | Script
03:16 | Download video | Download captions | Script
04:51 | Download video | Download captions | Script
07:23 | Download video | Download captions | Script

Videos from 2014

Videos from 2013

Software codes for this section

Code to show how to superimpose plots

Run this code in a web-browser

# Run this code line-by-line (copy & paste) to understand the demonstration f <- 'http://openmv.net/file/raw-material-properties.csv' data <- read.csv(f) summary(data) # Single plot plot(data$density1) # Connect the dots plot(data$density1, type='b') # Another variable plot(data$density2, type='b', col="red") # Superimpose them? plot(data$density1, type='b', col="blue") # Where's density2 ? lines(data$density2, type='b', col="red") # Superimpose them: limits plot(data$density1, type='b', col="blue", ylim=c(10, 45)) # Now density2 shows up lines(data$density2, type='b', col="red")