Cheerio on Node: crazy recursion that breaks the server

335 Views Asked by At

I've an array of HTML string, I am trying to convert it to an array of Cheerio Object while changing the relative links to absolute links. While doing that, the node app run into a crazy recursion, probably due to a stack overflow.

For instance: the recursion will call _htmlToCheerio for infinite times when handling the 5th element of the "links" array.

Any idea how to solve this?

async.map(links, _htmlToCheerio, function(err, results) {
    if (err) {
        return cb(null, null);
    }
    return cb(null, results);
});

function _htmlToCheerio(html) {
        var $ = cheerio.load(html);
        return _updateLinksToAbsolute($);
}


function _updateLinksToAbsolute($) {

    var elms = $('a');
    elms.each(function(i, e) {

        var tagName = e.name;

        // <a href="">
        var href =  e.attribs['href'];
        if (tagName == 'a' && href && !isAbsoluteUrl(href)) {
            var newHref = URI.resolve(baseUrl, href);
            $(e).attr('href', newHref);
        }

    });

    return $;
}
0

There are 0 best solutions below