Can the go-colly library crawl all HTML tags and text content under a div tag? If so, how? I can get all texts under a div tag. Like this:
c.OnHTML("body .post-topic-main .post-topic-des", func(e *colly.HTMLElement) {
text = strings.TrimSpace(e.Text)
})
But I dont'know how to get HTML tags under the div tag.
If you looking for
innerHTMLit is accessible byDOMand usingHtmlmethod (e.DOM.Html()).If you looking for a special tag under the founded element,
ForEachcould use for this purpose. The first argument is the selector and the second parameter is the callback function. The callback function will iterate for each element that matches the selector and also is a member of theeelement.More information: https://pkg.go.dev/github.com/gocolly/[email protected]#HTMLElement.ForEach