Difference between revisions of "Univariate data analysis"

From Statistics for Engineering
Jump to navigation Jump to search
Line 14: Line 14:
* [[Image:Nuvola_mimetypes_pdf.png|20px|link=Media:2014-4C3-6C3-Visualizing-data.pdf]] [[Media:2014-4C3-6C3-Visualizing-data.pdf | Class notes 2014]]
* [[Image:Nuvola_mimetypes_pdf.png|20px|link=Media:2014-4C3-6C3-Visualizing-data.pdf]] [[Media:2014-4C3-6C3-Visualizing-data.pdf | Class notes 2014]]
* [http://learnche.org/pid/univariate-review/index Textbook, chapter 2]
* [http://learnche.org/pid/univariate-review/index Textbook, chapter 2]
<!--* [https://docs.google.com/document/d/1rZXJY7aybeng7H_ZngzHDsbBxn-uipvRPF6r9JdH-9g Quiz] and [https://docs.google.com/document/d/1ew6UwKP8SBM19jvdYfMxgSnRVIvUNuwnOSNlJoZ3Tg4 Solution]-->
* Quizzes (with solutions): attempt these after you have watched the videos
:{| class="wikitable"
|-
! Videos to watch first
! Quiz
! Solution
|-
| Watch videos 1, 2 and 3
| [https://docs.google.com/document/d/1UHoVz0Gi9eazCYpvtIzRlP79RmjtjxERS5xka5ZBkps Quiz]
| [https://docs.google.com/document/d/1lMSDnUVTZ6X-uIbC7iUc1UKk7QvQWrngm8LDm1r8RuI Solution]
|-
| Watch videos 4, 5 and 6
| [https://docs.google.com/document/d/19NnM1v7yut1q0HFaHMQmNg6zMH2UPN1rrHZl7l38qBI  Quiz]
| [https://docs.google.com/document/d/1YcF4PxxfQqnz4UqgGZMWv3zfFmmjB7uSkNfQ9XMe97A Solution]
|-
| Watch videos 7, 8 and 9
| [https://docs.google.com/document/d/1BySja0o1O8MtZOitV42BU2bnjWmzH_vARcXCbh5OviI Quiz]
| [https://docs.google.com/document/d/17jKKHwgVLkBT04R3OgUr9TqSzCGkxVYBP-fZtuiszig Solution]
|-
| Watch videos 10, 11, and 12
| [https://docs.google.com/document/d/1qMBqum4kdgLBK3dlzXncZJm42NKdp8pnsp9jxG_dKXA Quiz]
| [https://docs.google.com/document/d/19mCY3qCpVKLzvOBpa-c1t7JDzLTl0xagnivRpslnTD0 Solution]
|-
| Watch videos 13, 14
| [https://docs.google.com/document/d/1zxOUn30wTeg8xiJfIgfZ3WKgiUk3hqrnCV-sDTrlFiA Quiz]
| [https://docs.google.com/document/d/19RozNWVpmQo5oJOfw2SDhWZrJ2Sbpq2LuWFq8NsLN6g Solution]
|-
| Watch videos 15, 16
| [hhttps://docs.google.com/document/d/1ZJ1aZ-hvS11-k5btkBqb1X16RAihbFTDjX-fNX8iQm8 Quiz]
| [https://docs.google.com/document/d/10h2qavaw08AYC5qGAWyemXrgro0F-ARBaDnoE52S77Y Solution]
|}
* Complete steps 10, 11, 12 and 13 of the [http://learnche.mcmaster.ca/4C3/Software_tutorial software tutorial]
* Complete steps 10, 11, 12 and 13 of the [http://learnche.mcmaster.ca/4C3/Software_tutorial software tutorial]
* A [http://www.r-fiddle.org/#/fiddle?id=49eUbRMk demonstration of R]
* A [http://www.r-fiddle.org/#/fiddle?id=49eUbRMk demonstration of R]

Revision as of 18:53, 2 January 2016

Learning outcomes

  • The study of variability important to help answer: "what happened?"
  • Univariate tools such as the histogram, median, MAD, standard deviation, quartiles will be reviewed from prior courses (as a refresher)
  • The normal and t-distribution will be important in our work: what are they, how to interpret them, and use tables of these distributions
  • The central limit theorem will be explained conceptually: you cannot finish a course on stats without knowing the key result from this theorem.
  • Using and interpreting confidence intervals will be crucial in all the modules that follow.

Extended readings

  • New Boeing planes will generate 0.5 TB of data per flight. Read about this, and other sources of data: "every piece of that plane has an internet connection, from the engines to the flaps to the landing gear".
  • All students, but especially the 600-level students should read the article by Peter J. Rousseeuw, Tutorial to Robust Statistics it is easy to read, and contains so much useful content.

Resources

Videos to watch first Quiz Solution
Watch videos 1, 2 and 3 Quiz Solution
Watch videos 4, 5 and 6 Quiz Solution
Watch videos 7, 8 and 9 Quiz Solution
Watch videos 10, 11, and 12 Quiz Solution
Watch videos 13, 14 Quiz Solution
Watch videos 15, 16 [hhttps://docs.google.com/document/d/1ZJ1aZ-hvS11-k5btkBqb1X16RAihbFTDjX-fNX8iQm8 Quiz] Solution

Class videos from prior years

Videos from 2015

  1. Introduction [05:59]
  2. Histograms [04:50]
  3. Basic terminology [06:41]
  4. Outliers, medians and MAD [04:42]
  5. The central limit theorem [06:56]
  6. The normal distribution, and standardizing variables [05:54]
  7. Normal distribution notation and using tables and R [05:48]
  8. Checking if data are normally distributed [05:57]
  9. Introducing the idea of a confidence interval [covered in class]
  10. Confidence intervals when we don't know the variance [07:59]
  11. Interpreting the confidence interval [07:52]
  12. A worked example: calculating and interpreting the CI [03:37]
  13. A motivating example to see why tests for differences are important [08:29]
  14. The mathematical derivation for a confidence interval for differences [covered in class]
  15. Using the confidence interval to test for differences to solve the motivating example [covered in class]
  16. Confidence intervals for paired tests: theory and an example [11:59]
05:59 | Download video | Download captions | Script
04:50 | Download video | Download captions | Script
06:41 | Download video | Download captions | Script
04:42 | Download video | Download captions | Script
06:56 | Download video | Download captions | Script
05:54 | Download video | Download captions | Script
05:48 | Download video | Download captions | Script
05:57 | Download video | Download captions | Script
Covered in class | No video | Script
07:59 | Download video | Download captions | Script
07:52 | Download video | Download captions | Script
03:37 | Download video | Download captions | Script
08:29 | Download video | Download captions | Script
Audio only | No video | Script
Audio only | No video | Script
11:59 | Download video | Download captions | Script

Videos from 2014

See the webpage from 2014


Videos from 2013

See the webpage from 2013