Rselenium - Save page as

1.8k Views Asked by At

My goal is to download an image from a URL. In my case I can't use download.file because my picture is in a web page requiring login and it has some java scripts running in the background before the real image gets visible. This is why I need to do it using RSelenium package.

As suggested here, I've built a docker container with a standalone-chrome tag. Output from Docker terminal:

$ docker-machine ip
192.168.99.100
$ docker ps
CONTAINER ID  IMAGE                              COMMAND                CREATED             STATUS              PORTS                    NAMES
c651dab3a948  selenium/standalone-chrome:3.4.0  "/opt/bin/entry_po..."  24 hours ago        Up 24 hours         0.0.0.0:4445->4444/tcp   cranky_kalam

Here's what I've tried:

require(RSelenium)

# Avoid download prompt to pop up and parsing default download folder
eCaps <- list(
  chromeOptions = 
    list(prefs = list(
      "profile.default_content_settings.popups" = 0L,
      "download.prompt_for_download" = FALSE,
      "download.default_directory" = "C:/temp/Pictures"
    )
    )
)

# Open connection
remDr <- remoteDriver(remoteServerAddr = "192.168.99.100",port = 4445L,browserName="chrome",extraCapabilities = eCaps)
remDr$open()

# Navigate to desired URL with picture
url <- "https://www.google.be/images/branding/googlelogo/2x/googlelogo_color_272x92dp.png"
remDr$navigate(url)
remDr$screenshot(display = TRUE) # Everything looks fine here

# Move mouse to the page's center
webElem <- remDr$findElement(using = 'xpath',value = '/html/body')
remDr$mouseMoveToLocation(webElement = webElem)

# Right click and 
remDr$click(2)
remDr$screenshot(display = TRUE) # I don't see the right-click dialog!
# Try to move right-click dialog to 'Save as' or 'Save image as'
remDr$sendKeysToActiveElement(list(key = 'down_arrow',
                                   key = 'down_arrow',
                                   key = 'enter'))
### NOTHING HAPPENS

I've tried to play around with the amount of key = 'down_arrow' and every time I look into C:/temp/Pictures nothing has been saved.

Please note that this is just an example and I know I could have downloaded this picture with download.file. I need a solution with RSelenium for my real case.

1

There are 1 best solutions below

0
On

I tried using remDr$click(buttonId = 2) to perform Right click but to no avail. Thus, one workaround to save the image would be extracting links from the webpage and using download.file to download it.

#navigate 
url <- "https://www.google.be/images/branding/googlelogo/2x/googlelogo_color_272x92dp.png"
remDr$navigate(url)

#get the link of image
link = remDr$getPageSource()[[1]] %>%
  read_html() %>% html_nodes('img') %>% 
  html_attr('src')
[1] "https://www.google.be/images/branding/googlelogo/2x/googlelogo_color_272x92dp.png"

#download using download.file in your current working directory. 
download.file(link, basename(url), method = 'curl')