Data Types in R
In R programming, there are five basic data types. Also, everything in R is an object, so we will refer to these data types as classes of objects. So, these five types of objects are:
- Numeric
- Integer
- Complex
- Logical
- Character
Let's briefly look at each of these.
Numeric
In R, decimal values such as 8.5
are numeric. You can assign a numeric value to a variable and then that variable will be of numeric data type.
For any variable or object, you can check it's data type by using the class
command.
1> # Assign a decimal value to x
2>
3> x <- 8.5
4>
5> # Print the value of x
6> x
7[1] 8.5
8>
9> # Print the class name of x
10>
11> class(x)
12[1] "numeric"
13>
14
Integer
Integers are the natural numbers such as 1, 2, 3, etc. In R, when you assign a number to a variable, it is by default of the numeric class type. In order to create an integer variable, we use a function called as.integer()
. This ensures that the variable created is indeed an integer.
1> #Assign a value to a variable
2> x <- 8
3>
4> #check the class type of x
5> class(x)
6[1] "numeric"
7>
8> # You will notice that it is numeric.
9>
10> # Use the as.integer() function to create an integer
11>
12> y <- as.integer(8)
13>
14> #check the class type of yx
15> class(y)
16[1] "integer"
17>
18> > # It is an integer now
19>
20
Complex
In R, a complex value is defined using a pure imaginary value <em>i</em>
.
1> #Assign a complex value to a variable
2> x = 1 + 2i
3>
4> #Print the complex value x
5> x
6[1] 1+2i
7>
8> #Print the class of x
9> class(x)
10[1] "complex"
11>
12
Logical
Logical values are boolen values (TRUE
or FALSE
) often created by comparison between variables.
1> #Create sample variables
2> x <- 5
3> y <- 7
4>
5> #Check if y is less than x
6> z <- y < x
7>
8> # z will have a logical value. Let's print it
9>
10> z
11[1] FALSE
12>
13> #Print the class of z
14> class(z)
15[1] "logical"
16>
17
Character
Character values are string or text values. We can create a character type variable by assigning a string to the variable. Notice the use of double quotes.
1> #Create a character variable
2>
3> x <-"John Doe"
4>
5> #Print the character string
6>
7> x
8[1] "John Doe"
9>
10
If we have an object of some other data type (e.g., numeric), we can convert it into character type using the as.character()
function.
1> #Create a numeric variable
2>
3> x <- 8.75
4>
5> #Print the variable x
6>
7> x
8[1] 8.75
9>
10> #Print the class of x
11>
12> class(x)
13[1] "numeric"
14>
15> #convert x into a character value
16>
17> y <- as.character(x)
18>
19> #Print the variable y
20>
21> y
22[1] "8.75"
23>
24> #Print the class of y
25>
26> class(y)
27[1] "character"
28>
29
Standard Logical Operations
We can perform standard logical operations such as "&" (and), "|" (or), and "!" (negation) on logical values in R.
1> x = TRUE; y = FALSE
2> x & y # Checks that both x AND y are true
3[1] FALSE
4> x | y # Checks that either x OR y is true
5[1] TRUE
6> !x # Returns the negation of x
7[1] FALSE
8>
9
Standard String Operations
We can easily perform basic string operations in R such as concatenation, or creating readable strings.
1> #Define sample character variables
2> fname <- "John"
3> lname <- 'Doe'
4>
5> # Concatenate strings using paste function
6> paste(fname, lname)
7[1] "John Doe"
8>
9> #Create a readable string using sprintf() function
10> #Use %s for strings and %d for numeric values
11>
12> sprintf("His first name is %s and last name is %s",fname,lname)
13[1] "His first name is John and last name is Doe"
14>
15> # Extract a substr from a long string. In the following example, we
16> # are extracting stock symbol from text
17>
18> substr("Google's stock symbol is GOOG", start=26, stop=29)
19[1] "GOOG"
20>
21> #Replace the first occurance of a word in a string by another word.
22>
23> sub("John","Sam","His first name is John")
24[1] "His first name is Sam"
25>
26
The sprintf() function allows you to create strings as output using formatted data. For example:
1sprintf("I go for a walk at %s:%s%s a.m.", 7, 0, 5)
2
This will output:
#> [1] "I go for a walk at 7:05 a.m."
See the special syntax %s
. Each %
is referred to as a slot, which is basically a placeholder for a variable that will be formatted. The values after the comma are the variables or values that will be filled in each slot.
Getting Help
We can find many more functions for these data types in the R documentation. For each function, the help also provides details about how to use these functions. For example, you can get help for sprintf()
function by using the command help('sprintf')
. The documentation will open in the help window (bottom right).
Create Your Free Account
Create a free account to access this content and join our community of learners.
You'll get access to:
- Access the full tutorial
- Join our learning community
- Track your progress
- Bookmark content for later