first of all forgive me for any ambiguity . i find my problem a bit hard to explain in English . basically what i want to do is , to divide a huge set of words to "N" parts .
for example read all the words in a file , then divide them between lets say N=10 parts . to be more precise , i'm working on a data mining project . there are thousands of documents which i need to sort the words of .
say n = 2 . i know i can put a-m and n-z in a file . i need an algorithm which can do this for n > 100 .
PS : my program FIRST has to create the N files ( or chunks ) then read all the words and depending on how they begin , assign them to one of the chunks .
EXAMPLE : input : N = 2 words = [....]
output : [words starting with a-m] , [words starting with n-z]
in other words i want to divide my words Lexicographically
This is a rough idea of what you want:
Obviously you will have to handle the case when n chunks does not divide evenly into the length of your list of words.