Scrapy returning a empty json file

878 Views Asked by Marcus Vinícius At 17 June 2025 at 20:52

I am trying to get data from a website, everything seems to be correct and the xpath was tested on the shell.

# -*- coding: utf-8 -*-

from scrapy.contrib.spiders import CrawlSpider


class KabumspiderSpider(CrawlSpider):
    name = "kabumspider"
    allowed_domain = ["www.kabum.com.br"]
    start_urls = ["https://www.kabum.com.br"]


def parse(self, response):
        categorias = response.xpath('//p[@class = "bot-categoria"]/a/text()').extract()
        links = response.xpath('//p[@class = "bot-categoria"]/a/@href').extract()

        for categoria in zip(categorias, links):

            info = {
                'categoria': categoria[0],
                'link': categoria[1],
            }
            yield info

Although, the output seems to be:

What is wrong with my code?

Original Q&A

There are 1 best solutions below

Tarun Lalwani On 08 September 2017 at 04:50 BEST ANSWER

I ran the scraper and it runs fine for me. The only issue i found is your parse method is outside the class.

# -*- coding: utf-8 -*-

from scrapy.contrib.spiders import CrawlSpider


class KabumspiderSpider(CrawlSpider):
    name = "kabumspider"
    allowed_domain = ["www.kabum.com.br"]
    start_urls = ["https://www.kabum.com.br"]

    def parse(self, response):
        categorias = response.xpath('//p[@class = "bot-categoria"]/a/text()').extract()
        links = response.xpath('//p[@class = "bot-categoria"]/a/@href').extract()

        for categoria in zip(categorias, links):
            info = {
                'categoria': categoria[0],
                'link': categoria[1],
            }
            yield info

Scrapy returning a empty json file

There are 1 best solutions below

Related Questions in PYTHON

Related Questions in JSON

Related Questions in SCRAPY

Related Questions in SCRAPY-SHELL

Trending Questions

Popular # Hahtags

Popular Questions