Difference between revisions of "Software tutorial/Basic plots in R"

From Statistics for Engineering
Jump to navigation Jump to search
(Created page with "{{Navigation|Book=Software tutorial|previous=Basic data manipulation in R|current=Tutorial index|next=Plots with multiple series, colour, and legends}} __NOTOC__ <rst> <rst-o...")
 
 
(2 intermediate revisions by the same user not shown)
Line 6: Line 6:
<rst-options: 'reset-figures' = False/>
<rst-options: 'reset-figures' = False/>
A simple sequence plot
A simple sequence plot
------------------------
-------------------------


We will continue on with the data set described in the previous section.  Load the dataset and let's plot the column called ``Visits`` - there are 4 columns in the dataset, so we must be specific on which one to plot.
We will continue on with the data set described in the previous section.  Load the dataset and let's plot the column called ``Visits`` - there are 4 columns in the dataset, so we must be specific on which one to plot.
Line 12: Line 12:
.. code-block:: s
.. code-block:: s


website <- read.csv('http://datasets.connectmv.com/file/website-traffic.csv')
website <- read.csv('http://openmv.net/file/website-traffic.csv')
plot(website$Visits)
plot(website$Visits)
Line 42: Line 42:
and ``type="o"`` will connect (overplot) the lines and points.   
and ``type="o"`` will connect (overplot) the lines and points.   


Box plots
Box plots
Line 58: Line 57:


boxplot(website$Visits ~ website$DayOfWeek)
boxplot(website$Visits ~ website$DayOfWeek)
</rst>
</rst>
[[Image:website-traffic-boxplot-default.jpg|500px|center]]
[[Image:website-traffic-boxplot-default.jpg|500px|center]]
{{Navigation|Book=Software tutorial|previous=Basic data manipulation in R|current=Tutorial index|next=Plots with multiple series, colour, and legends}}

Latest revision as of 09:34, 13 January 2016

← Basic data manipulation in R (previous step) Tutorial index Next step: Plots with multiple series, colour, and legends →


<rst> <rst-options: 'toc' = False/> <rst-options: 'reset-figures' = False/> A simple sequence plot


We will continue on with the data set described in the previous section. Load the dataset and let's plot the column called ``Visits`` - there are 4 columns in the dataset, so we must be specific on which one to plot.

.. code-block:: s

website <- read.csv('http://openmv.net/file/website-traffic.csv') plot(website$Visits) Which produces this figure. Note that the defaults in R are to leave a lot of white space around the figure. We will show later how to remove that. </rst> [[Image: website-traffic-base.jpg|500px|center]] <rst> <rst-options: 'toc' = False/> <rst-options: 'reset-figures' = False/> That plot shows only the points (markers), in the order of the data set. What if you want lines between the points? Type ``help(plot)`` to find out more about the ``plot`` command. For example, it tells you there that ``type="p"`` will just show the points (the default setting for ``plot``): .. code-block:: s plot(website$Visits, type="p")


If you use ``type="l"`` you get a line plot:

.. code-block:: s

plot(website$Visits, type="l") and ``type="b"`` will show both lines and points, leaving a space between the point and the line connections .. code-block:: s plot(website$Visits, type="b")

and ``type="o"`` will connect (overplot) the lines and points.

Box plots


The basic boxplot syntax is:

.. code-block:: s

boxplot(website$Visits) To get boxplots for each day of the week, side-by-side we rely on the fact the column, ``DayOfWeek``, is a categorical variable. R calls these ``factor`` variables, and you can confirm this: ``is.factor(website$DayOfWeek)`` returns ``TRUE``. We can then tell the ``boxplot`` command to group the boxplots by a factor variable. Read the help text, ``help(boxplot)``, and ``help(factor)`` in order to understand more clearly how this plot was generated.

.. code-block:: s

boxplot(website$Visits ~ website$DayOfWeek)

</rst>

Website-traffic-boxplot-default.jpg
← Basic data manipulation in R (previous step) Tutorial index Next step: Plots with multiple series, colour, and legends →