This textbook, Process Improvement using Data, (freely-available, Creative Commons) is the basis for an undergraduate course in data analysis at McMaster University. Chemical Engineering students in their final year and graduate students learn how to design experiments properly, analyze data using least squares and classical statistical tools and develop good visualizations. You can access all the course materials ( , including lectures notes, and class video recordings. All assignments and full solutions are available. The course material is used at several universities for undergraduate engineering statistics courses.

Videos of all the class materials are available from this YouTube playlist, to supplement the above textbook.


Collected datasets ( that may be freely used in any way you like. I use these datasets in teaching assignments, but many of them are from real applications, and so contain problems such as missing values, strong collinearity and noise.

Latent Variable Methods

A graduate course in Latent variable modelling and data analysis ( for engineering applications is available: full course notes, videos and data sets. All the literature references ( on latent variable methods are collected on a separate website.

R tutorial

An R tutorial ( is provided for students to self­-study.