Raymond Raymond

Plotting with R (Part II)

event 2020-09-23 visibility 366 comment 0 insights toc
more_vert
insights Stats
Plotting with R (Part II)

In Plotting with R (Part I), I summarized the functions that can be used in R plotting. In this part, we continue the journey to plot more rich and complex charts like Pie Chart, Bar Chart, BoxPlot, Histogram, Line and Scatterplot using those functions. 

Pie chart

Pie chart can be drawn using the following function:

pie(x, labels, radius, main, col, clockwise)

Pie chart with legends

The following code snippet (script R28.PieChart.R) draws a pie chart with legends:

# Draw pie charts
(
  data <- data.frame(Quarter=c('1st Qtr','2nd Qtr', '3rd Qtr','4th Qtr'), Sales=c(8.2,3.2,1.4,1.2))
)

# create image device
#png(file="R28.PieChart.png")

# percentage calculation
percentage <- paste(as.character( round(data$Sales/sum(data$Sales)*100, 1)), "%", sep="")

# colors
colors <- terrain.colors(length(data$Sales))

# draw 
pie(data$Sales, labels = percentage, main="Sales", col = colors)

# legends
legend("topright", legend = data$Quarter, cex=0.8, fill = colors)

The output looks like the following screenshot:

20200923110823-image.png

3D pie chart

We can use package plotrix to draw 3D pie chart:

# 3D Chart

if("plotrix" %in% installed.packages())
{
  print("plotrix is installed.")
}else
{
  print("plotrix is not installed.")
  install.packages("plotrix")
}

require(plotrix)

pie3D(data$Sales, labels = percentage, main="Sales", col = colors)

# legends
legend("topright", legend = data$Quarter, cex=0.8, fill = colors)
The chart looks like this:

20200923111429-image.png

Bar chart

Bar chart can be drawn using the following function:

barplot(H, xlab, ylab, main, names.arg, col)

The following code snippets (script R29.BarChart.R) draws bar charts using this function with different parameter values.

Plot bar chart

# Draw bar charts

Quarter <- c('1st Qtr','2nd Qtr', '3rd Qtr','4th Qtr')

(
  data <- data.frame(Quarter=Quarter, Sales=c(8.2,3.2,1.4,1.2))
)

# colors
colors <- terrain.colors(length(data$Sales))

# draw 
barplot(data$Sales, names.arg = data$Quarter, main="Sales", col = colors, xlab="Quarter", ylab="Sales ($m)")

# legends
legend("topright", legend = data$Quarter, cex=0.8, fill = colors)

The chart looks like the following:

20200923112311-image.png

Plot stacked bar chart

Matrix can be used as input for drawing stacked bar chart:

# Stacked bar chart
Year <- c(2015,2016,2017)

# Use matrix 
(
data <- matrix(c(8.2,3.2,1.4,1.2,9.2,1.2,5.4,3.2,1,3,4,6), nrow=4, ncol = 3)
)

# draw
barplot(data, names.arg = Year, main="Sales", col = colors, xlab="Year", ylab="Sales ($m)")

# legends
legend("topright", legend = Quarter, cex=0.8, fill = colors)

The chart looks like the following screenshot:

20200923112506-image.png

Box chart

Boxplots are a measure of how well distributed is the data in a data set. It divides the data set into three quartiles. This graph represents the minimum, maximum, median, first quartile and third quartile in the data set. It is also useful in comparing the distribution of data across data sets by drawing boxplots for each of them.

BoxPlot can be drawn using the following function:
boxplot(x, data, notch, varwidth, names, main)

The following code snippet (script R30.BoxPlot.R) draw a boxplot chart with test data set:

# Box Plot

#A data frame with 32 observations on 11 variables.
#
#[, 1]	 mpg	 Miles/(US) gallon
#[, 2]	 cyl	 Number of cylinders
#[, 3]	 disp	 Displacement (cu.in.)
#[, 4]	 hp	 Gross horsepower
#[, 5]	 drat	 Rear axle ratio
#[, 6]	 wt	 Weight (1000 lbs)
#[, 7]	 qsec	 1/4 mile time
#[, 8]	 vs	 V/S
#[, 9]	 am	 Transmission (0 = automatic, 1 = manual)
#[,10]	 gear	 Number of forward gears
#[,11]	 carb	 Number of carburetors

# Plot the chart.
boxplot(wt ~ cyl, data = mtcars, xlab = "Cylinders #",
        ylab = "Weight (1000 lbs)", main = "Weight and Cylinders Data", col=terrain.colors( length(unique(mtcars$cyl))))
The chart looks like this screenshot:
20200923111918-image.png

Histogram chart

A histogram represents the frequencies of values of a variable bucketed into ranges. Histogram is similar to bar chat but the difference is it groups the values into continuous ranges. Each bar in histogram represents the height of the number of values present in that range.

Histogram can be drawn using the following function:
hist(v,main,xlab,xlim,ylim,breaks,col,border) 
The following are two examples (script R31.Histogram.R) of plotting histogram using generated data (one via rnorm and another via sample):
# Histogram
# prepare data
(
  data1 <- rnorm(200,mean=40,sd=10)
)
hist(data1, col=c("green"))

(
  data2 <- sample.int(500,200)
)
hist(data2, col=c("green"))
The charts look like the following two screenshots:

20200923112930-image.png

infoWe can tell from the chart that rnorm does generate a sequence of data elements (variable) that aligns with normal distribution.
20200923112940-image.png

Line chart

Line can be drawn using the following functions:

plot() 
lines()
The following code snippet (script R32.Line.R) draws a simple line chart:
# Line chart
Quarter <- c('1st Qtr','2nd Qtr', '3rd Qtr','4th Qtr')
(
  data <- data.frame(Quarter=Quarter, Sales=c(8.2,3.2,1.4,1.2))
)
# colors
colors <- "green"

# draw 
plot(data$Sales, main="Sales", col = colors, xlab="Quarter", ylab="Sales ($m)", type="o")

#another line
lines((1:4),rnorm(4,2,1), type="b", col="blue")
20200923113651-image.png

Scatter chart

Scatterplots show many points plotted in the Cartesian plane. Each point represents the values of two variables. One variable is chosen in the horizontal axis and another in the vertical axis.

Scatterplot can be drawn using the following function:
plot(x, y, main, xlab, ylab, xlim, ylim, axes) 
pairs() # create matrix of scatterplots.

The following code snippet (script R33.Scatterplot.R) shows how to use these two functions to draw Scatterplot:

# Scatterplot
plot(y = mtcars$wt,x = mtcars$cyl,
     xlab = "Cylinder",
     ylab = "Weight",
     ylim = range(mtcars$wt),
     xlim = range(mtcars$cyl),		 
     main = "Weight & Cylinder #",
     col="green"
)


# pairs 
pairs(~mpg+disp+cyl,data = mtcars,
      main = "Scatterplot Matrix", col="green")

The charts look like the following screenshots:

20200923114035-image.png

20200923114105-image.png

Summary

There are many other charts that can be drawn in R. Refer to official documentation or help documents for more details. 

More from Kontext
comment Comments
No comments yet.

Please log in or register to comment.

account_circle Log in person_add Register

Log in with external accounts