I have a question regarding xml parsing. I have tags with spaces in e.g.
<item1 id=rt name ="th">
<point1>1254</point1>
<point2>1254</point2>
</item>
How do I extract the id and name out of this tags?
I'm now using R as I need for the rest of my analysis, but I can also do file parsing in perl and python. What is the best solution?
You can do this for example, using
XML
package:EDIT
In case your data is not well formatted, you should reformat your data as I did above or read your data line by line , and extract the information using some regular expression ( not recommended with XML tags to use regex)