Django googlebot crawling ajax url

325 Views Asked by At

I have setup a Django site and receiving this alert from GoogleBot that I think it put in a test paramater (zero) at my URL. Zero is nonexistant primary key in my database

<WSGIRequest
path:/for_sale_detail/0/,
GET:<QueryDict: {}>,
POST:<QueryDict: {}>,
COOKIES:{},
META:{u'CSRF_COOKIE': u'xxxxxxxxxxxxxxxxx',
'HTTP_ACCEPT': 'text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8',
'HTTP_ACCEPT_ENCODING': 'gzip,deflate',
'HTTP_CONNECTION': 'close',
'HTTP_FROM': 'googlebot(at)googlebot.com',
'HTTP_HOST': 'example.com',
'HTTP_USER_AGENT': 'Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)',

This is my url.py with the first group of digits is my primary key (example below is number 2):

url(r'^for_sale_detail/(?P<slug>[-\w\d]+)/$', for_sale_detail, name='for_sale_detail'),

and in my html file, ajax URL (dynamic content populated after user interaction) is called in such way:

var slug = '2-terrace-link-house';

for (abc < 10)
{
    ...
    slug = new_url;
    call_detail_page(); 
}

function call_detail_page()
{
    var detail_page_url = "{% url 'for_sale_detail' '2-terrace-link-house' %}".replace ('2-terrace-link-house', slug);

     $.colorbox({
        iframe : true,      
        width : '1200px',
        height : '600px',           
        href:detail_page_url
        });     
}

I've created sitemap.xml file to point to all valid slug address but still when GoogleBot crawls, it will append "0" to my URL slug. Ideally, I would want all detail pages to be indexed and found by google thus will not want to redirect GoogleBot to a nonexistant page

a valid URL is this:

http://www.example.com/for_sale_detail/2-terrace-link-house

Thank you

1

There are 1 best solutions below

1
On BEST ANSWER

You can check on for_sale_detail if the item exists and return HttpResponseNotFound or raise Http404 exception if not.