Hi everyone this is my first question here and im not a programmer.
I would like to generate a sitemap. I am crawling a website with webcrawler (crawler.dev.java.net). Is there any way to use a sax parser for the data i get?
I also used jtidy and i got the homepage html data converted in an xml file.
im very confused there are so many sax parsers, idont know the difference between them and which one to choose.
I want to have access to the attributes of html tags and i cant do that with webcrawler or i dont know how to do it
Whats the difference between org.xml.sax and all the other packages?
Java provides a standard way of interacting with SAX parsers through JAXP (see code below). To switch between SAX parsers typically you just need to add the parser jar to your class path the code stays the same.
You can do sax parsing as follows: