I have read many times around the web about this question:
How do you extract the meaning of a page.
And I know that I am not experience enough to even try to suggest any solution. To me this is the holy grail of web programming or maybe even computer technology as a whole.
But through the power of imagination let us assume that I have written the ultimate script that does exactly that. For example I enter this text:
Imagination has brought mankind through the dark ages to its present state of civilization. Imagination led Columbus to discover America. Imagination led Franklin to discover electricity.
and my powerful script extracts the meaning and says this:
The ability of human beings to think leads them to discover new things.
For the purpose of this example, I used a "String" to explain the meaning the text. But if I had to store this in a database, or an array or any sort of storage, what will be the datatype I will be using?
Note that I can have another text that uses a different analogy but still has the same meaning worded differently, for example:
Imagination helps human kind advance.
Now I can enter a search query about the importance of imagination and these 2 results appear. But how will they be matched? Will it be a String comparison? Some integers, floating points? Maybe even binary?
What will the meaning be saved under? I would like to hear from you.
Update: Let me restate the question simply.
How do you represent Meaning in data?
Semantics is a wide and deep field, and there are many models, all of them with advantages and problems from an AI implementation point of view. With this scarce amount of background, one can hardly make a recommendation, beyond "study the literature, and pick a theory which resonates with your intuition (and if you are at all successful in this, replace it with a better theory of your own, and score academic points)". Having said that, the freshman course material I can vaguely recollect used to have nice things to say about a recursive structure called a "frame", but this must have been 15 years ago.