I am trying to get information from a website (html) into MATLAB. I am able to get the html from online into a string using:
urlread('http://www.websiteNameHere.com...');
Once I have the string I have a very LONG string variable, containing the entire html file contents. From this variable, I am looking for the value/characters in very specific classes. For example, the html/website will have a bunch of lines, and then will have the classes of interest in the following form:
...
<h4 class="price">
<span class="priceSort">$39,991</span>
</h4>
<div class="mileage">
<span class="milesSort">19,570 mi.</span>
</div>
...
<h4 class="price">
<span class="priceSort">$49,999</span>
</h4>
<div class="mileage">
<span class="milesSort">9,000 mi.</span>
</div>
...
I need to be able to get the information between <span class="priceSort">
and </span>
; ie $39,991 and $49,999 in the above example. What is the best way to go about this? If the tags were specific beginning and ends that were also the same (such as <price>
and </price>
), I would have no problem...
I also need to know the most robust method, since I would like to be able to find <span class="milesSort">
and other information of this sort too. Thanks!
Try this and let us know if it works for you -