Course project
Revision as of 01:02, 31 October 2011 by Kevin Dunn (talk | contribs)
Project outline
Please email me by 4 November a 1 page outline for your course project. Include
- a paragraph describing your objective (e.g. optimize a process, investigate the effect of robust preprocessing on several datasets)
- the data you have available (see some potential data sources below),
- if you are simulating the data, let me know what the columns will contain, and how your simulation works in general
- how you plan to use latent variables methods (and possibly other methods) in your data analysis plan
You can always change your project topic after this date, so don't feel locked into a particular area.
Data sources/ideas
These data there are plenty of freely available data sets
- http://kaggle.com contains some excellent, real-world data sets
- http://data.gov
- http://toronto.ca/open
- Landsat image data
- I collect some smaller teaching datasets on my own website: http://datasets.connectmv.com
- I have other industrial-scale data sets available, which I can offer, depending on your topic.
Project presentations
You will present your project to the class on either 9 or 16 December. More details to come.
Project report
The project report is due in earlier January 2012. More details to come.