compare flatten index with own index

118 Views Asked by At

I'm creating an array within an array. (paragraphs within articles). Then i flatten this into one array. Let's call this ARRAY_COMPARISON_REFERENCE

Next to that I am making an array that gives a number to every next paragraph. The first paragraph of the first article is number one, the second paragraph is number two, and the first paragraph of the second article is number three, and so on. Let's call this ARRAY_INFORMATION_REFERENCE

Everytime that I find a 50% match of words between two paragraphs, I want to save the text of those of paragraphs into a file. I use .flatten to loop through all the articles, and reference in the index numbers. However, these do not correspond with the index number of the ARRAY_INFORMATION_REFERENCE.

How do I make the translation of a (two level).flatten to a regular (every new paragraph += 1) reference?

paragraphnumber = Array.new
paragraphs = []
Dir.glob("*.txt").each do |textfile|
    #first level: textfiles
 paragraphtext = []
 File.foreach(textfile, "\.\r") do |paragraph|
    #second level: paragraphs within the textfiles
    #here I fill the array, effectively starting the index and adding 1 to the index at every iteration through the loop.
    # THIS IS THE ARRAY_INFORMATION_REFERENCE
    paragraphtext << paragraph
  end
  paragraphs << paragraphtext
 end

#here i make the second index:
paragraphs.flatten.each_with_index do |x, indexx|

    paragraphs.flatten.each_with_index do |y, indexy|
    count = x.count { |k,v| y.include?(k) }
    if count > 20
        #these are the reference numbers
        index_paragraph1 = "#{indexx}"
        index_paragraph2 = "#{indexy}"
        #And here i try to use the reference of the second Array, to find the information out the first Array, which is not working
            #THIS IS THE ARRAY_COMPARISON_INDEX
        information_paragraph1 = paragraphtext.at(indexx)
        information_paragraph2 = paragraphtext.at(indexy)
        end
    end
end

The problem is: the reference number of a paragraph in the ARRAY_COMPARISON_REFERENCE does not correspond with that of the ARRAY_INFORMATION REFERENCE. Using .flatten the index is made in a different way apparently.. How do I make the translation between both indexes?

1

There are 1 best solutions below

3
On

Since you're not providing all the relevant code and we can't see what is going in paragraphs, we can't know the effect of flatten.

However, we can say that the flattened indices will only match the unflattened indices if flatten has no effect (i.e. if there are no arrays within paragraphs.)