Matrices in R Programming

A matrix is a table of numbers. In math text, it is conventional to denote matrices with bold letters. For example, consider a matrix D of the prices of the securities on first three days of the week.

1D = 
2     64    31 
3     65    28 
4     66    35
5

This matrix is {3\*2} matrix (pronounced "3 by 2") . The number of rows is given first, followed by the number of columns. The Matrix D shows that on the first day, the bond was worth $64 and the stock was worth $31. On the second day the bond was worth $65 and the stock $28. On the third day the bond was worth $66 and the stock $35.

With this data in place, we can answer many analytics questions considering someone was holding these assets in their portfolio. Let's learn how to use matrices in R and then how to perform statistical analysis on them.

Defining a Matrix in R

Let's say we have the above data in the form of a vector:

> price_data <- c(64,31,65,28,66,35)

We know that this data represents three rows with each row containing the bond price and the stock price. In R, we can use this vector to create a matrix using the matrix() function as shown below:

1#price data each number pair represents the bond an dstock price
2price_data <- c(64,31,65,28,66,35)
3#Create matrix of prices
4price_matrix <- matrix(price_data, nrow=3,byrow=TRUE)
5#print the matrix
6price_matrix
7

Matrix function parameters:

The first argument is the data that the matrix function will convert into rows and columns.
The second argument nrow indicates that the matrix will have three rows. A similar argument ncol could be used to indicate the number of columns.
The third argument byrow indicates how the data should be processed. byrow=TRUE indicates that it should be processed by row, i.e., as data comes in it will first fill the first row, then second and so on. To process data by columns, byrow should be set to FALSE.

When this matrix is printed, it will look as follows:

1> price_matrix
2     [,1] [,2]
3[1,]   64   31
4[2,]   65   28
5[3,]   66   35
6>
7

Naming Rows and Columns

Currently our matrix doesn't have any names for the rows and columns and it's difficult to understand the data. Our first column represents the prices of the bond and the second column represents the prices of stock. The three rows represent three days, namely, Mon, Tue and Wed.

In R, we can assign names to the matrix using the rownames() and colnames() functions as shown below:

1#set row names and column names for price_matrix
2rownames(price_matrix) <- c("Mon","Tue", "Wed")
3colnames(price_matrix) <- c("Bond","Stock")
4#print the matrix
5price_matrix
6

We can now print the matrix:

1> price_matrix
2    Bond Stock
3Mon   64    31
4Tue   65    28
5Wed   66    35
6>
7

Another easy way to assign names to a matrix is to use the dimnames() function, as shown below. Note the use of list() function which we will discuss in the upcoming lessons. This list has two objects, the first is the vector for row names and the second is the vector for column names.

1#Use the dimnames function to assign names to matrix elements
2dimnames(price_matrix) <-list(c("Mon","Tue", "Wed"),c("Bond","Stock"))
3#print the matrix
4price_matrix
5

Selecting the Matrix Elements

We can select specific elements of a matrix by using the expression D[m, n], i.e., mth row and nth column of matrix D.

Below R script shows some different ways of selecting the matrix elements:

1> #print the matrix
2> price_matrix
3    Bond Stock
4Mon   64    31
5Tue   65    28
6Wed   66    35
7> 
8> #What is the bond price on 3rd day? (3rd row and 1st column)
9> price_matrix[3,1]
10[1] 66
11> 
12> #Get me just the stock prices for all days
13> #In this case we want to access all rows, so we will just omit supplying the row
14> #stock prices are in 2nd column
15> price_matrix[,2]
16Mon Tue Wed 
17 31  28  35 
18> 
19> #Get stock and bond price on Tuesday
20> #Tuesday is the 2nd row
21> price_matrix[2,]
22 Bond Stock 
23   65    28 
24> 
25> #Get all prices for Monday and Wednesday
26> #Monday and Wednesday data is in row 1 and 3
27> price_matrix[c(1,3),]
28    Bond Stock
29Mon   64    31
30Wed   66    35
31> 
32> #Get stock prices for Tuesday and Wednesday
33> #We can use colon for continuing elements
34> price_matrix[2:3,2]
35Tue Wed 
36 28  35 
37>
38

Arithmetic Operations on Matrices

Just like vectors, we can use standard operators +, -, /, * with matrices. Let's take a few examples to understand the matrix arithmetic and also understand a few other matrix operations along the way.

Let's say that you hold 5 quantity each of this bond and stock. We can multiple the price_matrix to get the dollar holding value of your assets. We store this in the matrix portfolio_value.
Now that we have the portfolio values, we can calculate the total portfolio value on each day by adding the value of stocks and bonds. This can be done using the rowSums() function. We store the results in a vector called days_total.
We now have a new vector which contains daily portfolio value totals. But it is not a part of the portfolio_value matrix. We can add the days_total vector to the main matrix using cbind() function.
Since we have the portfolio values over a three day period, we can calculate the average portfolio value during this period. The values are in columns so we can use colMeans() function to calculate the column means. We store these values in a vector called days_average.
Finaly we can add the days_average vector as a new row to our main matrix. Since we are adding a row, we will use rbind() function.
The resulting matrix contains portfolio values along with totals and average over a period of three days.

1> #print the matrix
2> price_matrix
3    Bond Stock
4Mon   64    31
5Tue   65    28
6Wed   66    35
7> 
8> #You hold 5 quantity ech of bond and stock. What is the value?
9> portfolio_value <- price_matrix * 5
10> 
11> #What is the total portfolio value on each day
12> days_total <- rowSums(portfolio_value)
13> 
14> #print days total
15> days_total
16Mon Tue Wed 
17475 465 505 
18> 
19> #We can added the days_total vector to the main matrix using cbind()
20> portfolio_value_totals <- cbind(portfolio_value,days_total)
21> 
22> #Calculate average value per day
23> days_average <- colMeans(portfolio_value_totals)
24> 
25> #Add the averages row to the portfolio_value_totals matrix
26> final_matrix <-rbind(portfolio_value_totals,days_average)
27> 
28> #print final_matrix
29> final_matrix
30             Bond    Stock days_total
31Mon           320 155.0000   475.0000
32Tue           325 140.0000   465.0000
33Wed           330 175.0000   505.0000
34days_average  325 156.6667   481.6667
35>
36

Multiplying Matrix with a Vector

In the above example, we assumed the same quantity 5 for both stock and bond and simply multiplied it with the matrix. R took care of multiplying each matrix element with 5 to get us the values. However, what if we have different quantities of stock and bond. In such a case we can store the quantities in a new vector and then do standard matrix multiplication to achieve our results. Note that what we did earlier (multiply by *) is not the standard matrix multiplication for which you should use %*% in R. The calculation and nuances are demonstrated below:

1#print the matrix
2price_matrix
3# Assign quantities - 5 bonds and 3 stocks
4quantities <-c(5,3)
5# Multiply prices with quantities to get the values.
6portfolio_value <- price_matrix %*% diag(quantities)
7

%*% represents standard matrix multiplication.
diag() create a diagnal matrix from the vector so that we can then multiply the two matrices.
Multiplying the price matrix with the quantity vector will look as follows:
Note: If you did not understand this calculation, don't worry about it at this stage. We will learn a lot more about matrix multiplication in future courses. The objective of bringing this up here was to just familiarize you with the possibilities of the advance calculations that can be done.

The resulting matrix will look as follows:

1> portfolio_value
2    [,1] [,2]
3Mon  320   93
4Tue  325   84
5Wed  330  105
6>
7

Workspace

When you create a new vector, matrix or any other R object, it gets saved into the workspace and is available for you to use in your calculations. These variables can be seen in the Global Environment, i.e., the top-right window in RStudio. You can also access all objects available in the workspace using the ls() command in R console.

Using ls() command:

1> ls()
2[1] "days_average"           "days_total"             "final_matrix"
3[4] "portfolio_value"        "portfolio_value_totals" "price_data"
4[7] "price_matrix"          
5>
6

Learn

Resources

Matrices in R Programming

Defining a Matrix in R

Naming Rows and Columns

Selecting the Matrix Elements

Arithmetic Operations on Matrices

Multiplying Matrix with a Vector

Workspace

Creating and Using Vectors in R

Factors in R Programming

Getting Started with R Programming

Data Science for Finance Bundle

Topics