Handling Date and Time in R
By: Karthik Janar in data-science Tutorials on 2018-05-07
R has a special way of representing dates and times, which can be helpful if you're working with data that show how something changes over time (i.e. time-series data) or if your data contain some other temporal information, like dates of birth.
Dates are represented by the 'Date" class and times are represented by the 'POSIXct" and 'POSIXlt" classes. Internally, dates are stored as the number of days since 1970-01-01 and times are stored as either the number of seconds since 1970-01-01 (for 'POSIXct") or a list of seconds, minutes, hours, etc. (for 'POSIXlt").
Dates in R
Let's start by using d1 <- Sys.Date() to get the current date and store it in the variable d1. (That's the letter 'd" and the number 1.)
d1 <- Sys.Date()
Use the class() function to confirm d1 is a Date object.
class(d1)
## [1] "Date"
We can use the unclass() function to see what d1 looks like internally.
unclass(d1)
## [1] 17658
That's the exact number of days since 1970-01-01!
However, if you print d1 to the console, you'll get today's date - YEAR-MONTH-DAY.
d1
## [1] "2018-05-07"
What if we need to reference a date prior to 1970-01-01? Create a variable d2 containing as.Date("1969-01-01").
d2 <- as.Date("1969-01-01")
Now use unclass() again to see what d2 looks like internally.
unclass(d2)
## [1] -365
As you may have anticipated, you get a negative number. In this case, it's -365, since 1969-01-01 is exactly one calendar year (i.e. 365 days) BEFORE 1970-01-01.
Time in R
Now, let's take a look at how R stores times. You can access the current date and time using the Sys.time()
t1 <- Sys.time()
View the contents of t1.
t1
## [1] "2018-05-07 08:08:06 +08"
And check the class() of t1.
class(t1)
## [1] "POSIXct" "POSIXt"
As mentioned earlier, POSIXct is just one of two ways that R represents time information. (You can ignore the second value above, POSIXt, which just functions as a common language between POSIXct and POSIXlt.) Use unclass() to see what t1 looks like internally - the (large) number of seconds since the beginning of 1970.
unclass(t1)
## [1] 1525651687
By default, Sys.time() returns an object of class POSIXct, but we can coerce the result to POSIXlt with as.POSIXlt(Sys.time()). Give it a try and store the result in t2.
t2 <- as.POSIXlt(Sys.time())
Check the class of t2.
class(t2)
## [1] "POSIXlt" "POSIXt"
Now view its contents.
t2
## [1] "2018-05-07 08:08:06 +08"
The printed format of t2 is identical to that of t1. Now unclass() t2 to see how it is different internally.
unclass(t2)
## $sec
## [1] 6.724409
##
## $min
## [1] 8
##
## $hour
## [1] 8
##
## $mday
## [1] 7
##
## $mon
## [1] 4
##
## $year
## [1] 118
##
## $wday
## [1] 1
##
## $yday
## [1] 126
##
## $isdst
## [1] 0
##
## $zone
## [1] "+08"
##
## $gmtoff
## [1] 28800
##
## attr(,"tzone")
## [1] "" "+08" "+0720"
t2, like all POSIXlt objects, is just a list of values that make up the date and time. Use str(unclass(t2)) to have a more compact view.
str(unclass(t2))
## List of 11
## $ sec : num 6.72
## $ min : int 8
## $ hour : int 8
## $ mday : int 7
## $ mon : int 4
## $ year : int 118
## $ wday : int 1
## $ yday : int 126
## $ isdst : int 0
## $ zone : chr "+08"
## $ gmtoff: int 28800
## - attr(*, "tzone")= chr [1:3] "" "+08" "+0720"
If, for example, we want just the minutes from the time stored in t2, we can access them with t2$min.
t2$min
## [1] 8
Extracing values from a date/time object in R
Now that we have explored all three types of date and time objects, let's look at a few functions that extract useful information from any of these objects - weekdays(), months(), and quarters().
The weekdays() function will return the day of week from any date or time object. Try it out on d1, which is the Date object that contains today's date.
weekdays(d1)
## [1] "Monday"
The months() function also works on any date or time object. Try it on t1, which is the POSIXct object that contains the current time (well, it was the current time when you created it).
months(t1)
## [1] "May"
The quarters() function returns the quarter of the year (Q1-Q4) from any date or time object. Try it on t2, which is the POSIXlt object that contains the time at which you created it.
quarters(t2)
## [1] "Q2"
Often, the dates and times in a dataset will be in a format that R does not recognize. The strptime() function can be helpful in this situation.
Converting Character Strings to Dates
strptime() converts character vectors to POSIXlt. In that sense, it is similar to as.POSIXlt(), except that the input doesn't have to be in a particular format (YYYY-MM-DD).
To see how it works, store the following character string in a variable called t3: "October 17, 1986 08:24" (with the quotes).
t3 <- "October 17, 1986 08:24"
Now, use strptime(t3, "%B %d, %Y %H:%M") to help R convert our date/time object to a format that it understands. Assign the result to a new variable called t4. (You should pull up the documentation for strptime() if you"d like to know more about how it works.)
t4 <- strptime(t3, "%B %d, %Y %H:%M")
Print the contents of t4.
t4
## [1] "1986-10-17 08:24:00 +08"
Now, let's check its class().
class(t4)
## [1] "POSIXlt" "POSIXt"
Comparing and manipulating dates
Finally, there are a number of operations that you can perform on dates and times, including arithmetic operations (+ and -) and comparisons (<, ==, etc.)
The variable t1 contains the time at which you created it (recall you used Sys.time()). Confirm that some time has passed since you created t1 by using the 'greater than" operator to compare it to the current time:
Sys.time() > t1
## [1] TRUE
So we know that some time has passed, but how much? Try subtracting t1 from the current time using Sys.time() - t1. Don't forget the parentheses at the end of Sys.time(), since it is a function.
Sys.time() - t1
## Time difference of 0.666158 secs
The same line of thinking applies to addition and the other comparison operators. If you want more control over the units when finding the above difference in times, you can use difftime(), which allows you to specify a 'units" parameter.
Use difftime(Sys.time(), t1, units = 'days") to find the amount of time in DAYS that has passed since you created t1.
difftime(Sys.time(), t1, units = 'days')
## Time difference of 7.958785e-06 days
In this tutorial, you learned how to work with dates and times in R. While it is important to understand the basics, if you find yourself working with dates and times often, you may want to check out the lubridate package by Hadley Wickham.
Add Comment
This policy contains information about your privacy. By posting, you are declaring that you understand this policy:
- Your name, rating, website address, town, country, state and comment will be publicly displayed if entered.
- Aside from the data entered into these form fields, other stored data about your comment will include:
- Your IP address (not displayed)
- The time/date of your submission (displayed)
- Your email address will not be shared. It is collected for only two reasons:
- Administrative purposes, should a need to contact you arise.
- To inform you of new comments, should you subscribe to receive notifications.
- A cookie may be set on your computer. This is used to remember your inputs. It will expire by itself.
This policy is subject to change at any time and without notice.
These terms and conditions contain rules about posting comments. By submitting a comment, you are declaring that you agree with these rules:
- Although the administrator will attempt to moderate comments, it is impossible for every comment to have been moderated at any given time.
- You acknowledge that all comments express the views and opinions of the original author and not those of the administrator.
- You agree not to post any material which is knowingly false, obscene, hateful, threatening, harassing or invasive of a person's privacy.
- The administrator has the right to edit, move or remove any comment for any reason and without notice.
Failure to comply with these rules may result in being banned from submitting further comments.
These terms and conditions are subject to change at any time and without notice.
Most Viewed Articles (in data-science ) Introduction to logical operations in R Functions in R - Creating your first R function Logical and Character Vectors in R Generating Sequence numbers in R - seq(), rep() c() etc. Types of Analysis - Data Science Questions? Data Analytics - Which programming language to learn. R vs Python |
Latest Articles (in data-science) |
- Data Science
- Android
- React Native
- AJAX
- ASP.net
- C
- C++
- C#
- Cocoa
- Cloud Computing
- HTML5
- Java
- Javascript
- JSF
- JSP
- J2ME
- Java Beans
- EJB
- JDBC
- Linux
- Mac OS X
- iPhone
- MySQL
- Office 365
- Perl
- PHP
- Python
- Ruby
- VB.net
- Hibernate
- Struts
- SAP
- Trends
- Tech Reviews
- WebServices
- XML
- Certification
- Interview
Comments