Unable to delete large data set in Django — OperationalError: (1040, 'Too many connections')

1k Views Asked by At

I am attempting to delete 300,000+ spam comments from a Django site that is using the Zinnia blogging app. Zinnia includes a command for deleting spam called, appropriately, spam_cleanup but running this command spews thousands of the following error before being terminated by the OS.

OperationalError: (1040, 'Too many connections')

The code for the spam_cleanup command is as follows:

class Command(NoArgsCommand):
"""
Command object for removing comments
marked as non-public and removed.
"""
help = "Delete the entries's comments marked as non-public and removed."

def handle_noargs(self, **options):
    verbosity = int(options.get('verbosity', 1))

    content_type = ContentType.objects.get_for_model(Entry)
    spams = comments.get_model().objects.filter(
       #is_public=False, is_removed=True,
        content_type=content_type)
    spams_count = spams.count()
    spams.delete()

    if verbosity:
        print('%i spam comments deleted.' % spams_count)

My initial thought was just to break the query down to only delete say 80 items at at time using the limit property but Django tells me that I can't do that on delete:

AssertionError: Cannot use 'limit' or 'offset' with delete.

It's not reasonable to increase the max connections on MySQL to 300,000, right? I also read that Django emulates cascade on delete but does not set it at the DB level so a raw SQL query could orphan all the relations. I am lost as to how to perform this delete properly, please help!

0

There are 0 best solutions below