dplyr: how to modify column names based on a pattern?

5k Views Asked by At

consider this simple example:

data <- data_frame('data::col1' = c(1,2,3), 'data::col2' = c(1,2,3))
> data
# A tibble: 3 × 2
  `data::col1` `data::col2`
         <dbl>        <dbl>
1            1            1
2            2            2
3            3            3

This kind of dataframe is the output one would get by using Apache Pig. Here, I am able to load it using dplyr, but as you can see the names of the columns are cumbersome.

How can I use the tidyverse suite to get rid of the part before the ::? Also, assume I have many columns with the pattern data::mycol so an ideal solution needs not typing manually each affected column.

output expected:

# A tibble: 3 × 2
   col1  col2
  <dbl> <dbl>
1     1     1
2     2     2
3     3     3

Thanks!

1

There are 1 best solutions below

6
On
library(dplyr)
library(purrr)

data <- data.frame('data::col1' = c(1,2,3), 'data::col2' = c(1,2,3))
names(data) <- names(data) %>%
  gsub("data..", "", .)