Does anyone know how to or have some code on counting the number of unique phrases in a document? (Single word, two word phrases, three word phrases).
Thanks
Example of what I'm looking for:
What I mean is I have a text document, and i need to see what the most popular word phrases are. Example text
I took the car to the car wash.
I : 1 took : 1 the : 2 car: 2 to : 1 wash : 1 I took : 1 took the : 1 the car : 2 car to : 1 to the : 1 car wash : 1 I took the : 1 took the car : 1 the car to : 1 car to the : 1 to the car : 1 the car wash : 1 I took the car to : 1 took the car to the : 1 the car to the car : 1 car to the car wash : 1
I need the phrase, and the count that it shows up.
Any help would be appreciated. The closet thing I found to this was a PHP script from http://tools.seobook.com/general/keyword-density/source.php
I used to have some code for this, but I cannot find it.
Here is some initial code that solves your problem.
You will need to elaborate this code for real world production, for example, by recognizing more word delimiters (not only blanks), and by implementing some sort of case insensitivity.
To test it, put a Button, an EntryField and a Memo in a Form, and add the following code.
I first try with
I took the car to the car wash
gives