I have built a dataframe with R reading some events from my email. Basically, the structure of the final dataframe is as follows:
'data.frame': 74 obs. of 7 variables:
$ process_name : Factor w/ 2 levels : 1 1 1 1 2 2 2 2 2 2 ...
$ job_code : chr "TRB1619825404" "TRB1619825404" "TRB1619825404" "TRB1619825404" ...
$ phase : Factor w/ 7 levels ,..: 4 4 6 6 4 5 7 1 3 2 ...
$ stage : Factor w/ 2 levels "End","Start": 2 1 2 1 2 1 2 2 2 2 ...
$ date : POSIXct, format: "2021-04-30 23:30:04" "2021-05-01 01:57:26" "2021-05-01 01:57:26" "2021-05-01 02:25:26" ...
$ execution_date: Date, format: "2021-04-30" "2021-05-01" "2021-05-01" "2021-05-01" ...
$ execution_time : 'hms' num 23:30:04 01:57:26 01:57:26 02:25:26 ...
..- attr(*, "units")= chr "secs"
Every event has an associated date and time for its start and end. What I want to do (in order to compute the duration of the event) is to cast the dataframe in something like this:
'data.frame'
$ process_name
$ job_code
$ phase
$ start_date
$ end_date
$ duration
I tried to use dcast, but it uses a default aggregation function and I just want to reshape the dataframe. Any ideas?
First, I'm guessing your data looks like this:
(If not, please provide your data in something we can use.)
From here:
I had to re-
as.POSIXct
it becausedcast
cast them tonumeric
(not sure how to fix that).