Difference between revisions of "Course project"
Jump to navigation
Jump to search
Kevin Dunn (talk | contribs) m |
Kevin Dunn (talk | contribs) m |
||
Line 9: | Line 9: | ||
These data there are plenty of freely available data sets | These data there are plenty of freely available data sets | ||
* http://kaggle.com | * http://kaggle.com contains some excellent, real-world data sets | ||
* http://data.gov | * http://data.gov | ||
* http://toronto.ca/open | * http://toronto.ca/open | ||
* [http://landsat.gsfc.nasa.gov/data/where.html Landsat image data] | * [http://landsat.gsfc.nasa.gov/data/where.html Landsat image data] | ||
* I collect some smaller teaching datasets on my own website: http://datasets.connectmv.com | * I collect some smaller teaching datasets on my own website: http://datasets.connectmv.com | ||
* I have other industrial-scale data sets available |
Revision as of 00:33, 31 October 2011
Project outline
Please email me by 4 November a 1 page outline for your course project. Include
- a paragraph describing your objective (e.g. optimize a process, investigate the effect of robust preprocessing on several datasets)
- the data you have available (see some potential data sources below)
- how you plan to use latent variables methods (and possibly other methods) in your data analysis plan
Data sources/ideas
These data there are plenty of freely available data sets
- http://kaggle.com contains some excellent, real-world data sets
- http://data.gov
- http://toronto.ca/open
- Landsat image data
- I collect some smaller teaching datasets on my own website: http://datasets.connectmv.com
- I have other industrial-scale data sets available