I want to get all data-js
attribute values from the content by BeautifulSoup.
Input:
<p data-js="1, 2, 3">some text..</p><p data-js="5">some 1 text</p><p data-js="4"> some 2 text. </p>
Output:
['1, 2, 3', '5', '4']
I've done it with lxml:
>>> content = """<p data-js="1, 2, 3">some text..</p><p data-js="5">some 1 text</p><p data-js="4"> some 2 text. </p>"""
>>> import lxml.html as PARSER
>>> root = PARSER.fromstring(content)
>>> root.xpath("//*/@data-js")
['1, 2, 3', '5', '4']
I want the above result via BeautifulSoup.
The idea would to find all elements having
data-js
attributes and collect them in a list:Prints
['1, 2, 3', '5', '4']
.