Difference between revisions of "Univariate data analysis"
Jump to navigation
Jump to search
Kevin Dunn (talk | contribs) |
Kevin Dunn (talk | contribs) |
||
Line 14: | Line 14: | ||
* [[Image:Nuvola_mimetypes_pdf.png|20px|link=Media:2014-4C3-6C3-Visualizing-data.pdf]] [[Media:2014-4C3-6C3-Visualizing-data.pdf | Class notes 2014]] | * [[Image:Nuvola_mimetypes_pdf.png|20px|link=Media:2014-4C3-6C3-Visualizing-data.pdf]] [[Media:2014-4C3-6C3-Visualizing-data.pdf | Class notes 2014]] | ||
* [http://learnche.org/pid/univariate-review/index Textbook, chapter 2] | * [http://learnche.org/pid/univariate-review/index Textbook, chapter 2] | ||
* Quizzes (with solutions): attempt these after you have watched the videos | |||
:{| class="wikitable" | |||
|- | |||
! Videos to watch first | |||
! Quiz | |||
! Solution | |||
|- | |||
| Watch videos 1, 2 and 3 | |||
| [https://docs.google.com/document/d/1UHoVz0Gi9eazCYpvtIzRlP79RmjtjxERS5xka5ZBkps Quiz] | |||
| [https://docs.google.com/document/d/1lMSDnUVTZ6X-uIbC7iUc1UKk7QvQWrngm8LDm1r8RuI Solution] | |||
|- | |||
| Watch videos 4, 5 and 6 | |||
| [https://docs.google.com/document/d/19NnM1v7yut1q0HFaHMQmNg6zMH2UPN1rrHZl7l38qBI Quiz] | |||
| [https://docs.google.com/document/d/1YcF4PxxfQqnz4UqgGZMWv3zfFmmjB7uSkNfQ9XMe97A Solution] | |||
|- | |||
| Watch videos 7, 8 and 9 | |||
| [https://docs.google.com/document/d/1BySja0o1O8MtZOitV42BU2bnjWmzH_vARcXCbh5OviI Quiz] | |||
| [https://docs.google.com/document/d/17jKKHwgVLkBT04R3OgUr9TqSzCGkxVYBP-fZtuiszig Solution] | |||
|- | |||
| Watch videos 10, 11, and 12 | |||
| [https://docs.google.com/document/d/1qMBqum4kdgLBK3dlzXncZJm42NKdp8pnsp9jxG_dKXA Quiz] | |||
| [https://docs.google.com/document/d/19mCY3qCpVKLzvOBpa-c1t7JDzLTl0xagnivRpslnTD0 Solution] | |||
|- | |||
| Watch videos 13, 14 | |||
| [https://docs.google.com/document/d/1zxOUn30wTeg8xiJfIgfZ3WKgiUk3hqrnCV-sDTrlFiA Quiz] | |||
| [https://docs.google.com/document/d/19RozNWVpmQo5oJOfw2SDhWZrJ2Sbpq2LuWFq8NsLN6g Solution] | |||
|- | |||
| Watch videos 15, 16 | |||
| [hhttps://docs.google.com/document/d/1ZJ1aZ-hvS11-k5btkBqb1X16RAihbFTDjX-fNX8iQm8 Quiz] | |||
| [https://docs.google.com/document/d/10h2qavaw08AYC5qGAWyemXrgro0F-ARBaDnoE52S77Y Solution] | |||
|} | |||
* Complete steps 10, 11, 12 and 13 of the [http://learnche.mcmaster.ca/4C3/Software_tutorial software tutorial] | * Complete steps 10, 11, 12 and 13 of the [http://learnche.mcmaster.ca/4C3/Software_tutorial software tutorial] | ||
* A [http://www.r-fiddle.org/#/fiddle?id=49eUbRMk demonstration of R] | * A [http://www.r-fiddle.org/#/fiddle?id=49eUbRMk demonstration of R] |
Revision as of 18:53, 2 January 2016
Learning outcomes
- The study of variability important to help answer: "what happened?"
- Univariate tools such as the histogram, median, MAD, standard deviation, quartiles will be reviewed from prior courses (as a refresher)
- The normal and t-distribution will be important in our work: what are they, how to interpret them, and use tables of these distributions
- The central limit theorem will be explained conceptually: you cannot finish a course on stats without knowing the key result from this theorem.
- Using and interpreting confidence intervals will be crucial in all the modules that follow.
Extended readings
- New Boeing planes will generate 0.5 TB of data per flight. Read about this, and other sources of data: "every piece of that plane has an internet connection, from the engines to the flaps to the landing gear".
- All students, but especially the 600-level students should read the article by Peter J. Rousseeuw, Tutorial to Robust Statistics it is easy to read, and contains so much useful content.
Resources
- Class notes 2015
- Class notes 2014
- Textbook, chapter 2
- Quizzes (with solutions): attempt these after you have watched the videos
Videos to watch first Quiz Solution Watch videos 1, 2 and 3 Quiz Solution Watch videos 4, 5 and 6 Quiz Solution Watch videos 7, 8 and 9 Quiz Solution Watch videos 10, 11, and 12 Quiz Solution Watch videos 13, 14 Quiz Solution Watch videos 15, 16 [hhttps://docs.google.com/document/d/1ZJ1aZ-hvS11-k5btkBqb1X16RAihbFTDjX-fNX8iQm8 Quiz] Solution
- Complete steps 10, 11, 12 and 13 of the software tutorial
- A demonstration of R
- Using tables of the normal distribution
Class videos from prior years
Videos from 2015
- Introduction [05:59]
- Histograms [04:50]
- Basic terminology [06:41]
- Outliers, medians and MAD [04:42]
- The central limit theorem [06:56]
- The normal distribution, and standardizing variables [05:54]
- Normal distribution notation and using tables and R [05:48]
- Checking if data are normally distributed [05:57]
- Introducing the idea of a confidence interval [covered in class]
- Confidence intervals when we don't know the variance [07:59]
- Interpreting the confidence interval [07:52]
- A worked example: calculating and interpreting the CI [03:37]
- A motivating example to see why tests for differences are important [08:29]
- The mathematical derivation for a confidence interval for differences [covered in class]
- Using the confidence interval to test for differences to solve the motivating example [covered in class]
- Confidence intervals for paired tests: theory and an example [11:59]
05:59 | Download video | Download captions | Script |
04:50 | Download video | Download captions | Script |
06:41 | Download video | Download captions | Script |
04:42 | Download video | Download captions | Script |
06:56 | Download video | Download captions | Script |
05:54 | Download video | Download captions | Script |
05:48 | Download video | Download captions | Script |
05:57 | Download video | Download captions | Script |
Covered in class | No video | Script |
07:59 | Download video | Download captions | Script |
07:52 | Download video | Download captions | Script |
03:37 | Download video | Download captions | Script |
08:29 | Download video | Download captions | Script |
Audio only | No video | Script |
Audio only | No video | Script |
11:59 | Download video | Download captions | Script |
Videos from 2014
Videos from 2013