Parsing a whole tag by lxml.html

202 Views Asked by nazmus saif At 26 June 2025 at 16:05

I'm new to lxml and want to parse a page retrieved by "requests". My html is this:

<html>
<body>
<h1 class="entry-title">
    <a href="http://a.com" rel="bookmark">
    bla bla bla
    </a>
</h1>
</body>
</html>

and I want to have a string that looks like this:

"""<h1 class="entry-title">
    <a href="http://google.com" rel="bookmark">
    bla bla bla
    </a>
</h1>"""

what would be the code in python 3.4?

Original Q&A

There are 1 best solutions below

Valeriy Gaydar On 08 December 2014 at 12:27 BEST ANSWER

try something like this:

from lxml.html import document_fromstring
from lxml.html import tostring
doc = document_fromstring(YOUR_HTML_STRING)
h1 = tostring(doc.xpath("//h1")[0])

Parsing a whole tag by lxml.html

There are 1 best solutions below

Related Questions in PYTHON-REQUESTS

Related Questions in PYTHON-3.4

Related Questions in LXML.HTML

Trending Questions

Popular # Hahtags

Popular Questions