How do I extract the url from the following HTML.
i.e.. extract:
http://media.somesite.com.au/img-101x76.jpg
from:
<div class="media-img">
<div class=" searched-img" style="background-image: url(http://media.somesite.com.au/img-101x76.jpg);"></div>
</div>
In XPath 1.0 in general, you can use combination of
substring-after()
andsubstring-before()
functions to extract part of a text. But HAP'sSelectNodes()
andSelectSingleNode()
can't return other than node(s), so those XPath functions won't help.One possible approach is to get the entire value of
style
attribute using XPath & HAP, then process the value further from .NET, using regex for example :.NET Fiddle Demo
output :