In the NLTK senseval
module, senses are of the form HARD1
, HARD2
, etc. (see source here). However, there doesn't seem to be a way to get the actual definition. I'm trying to implement the Lesk algorithm, and I'm now attempting to check whether the sense predicted by the Lesk algorithm is correct (using a definition from WordNet).
The problem I'm running into is how to unify the WordNet definition with the senseval
answer (HARD1
, HARD2
). Does anybody know how to translate the SENSEVAL sense into a definition, or look it up somewhere?
I ended up finding out that these correspond to the senses in WordNet 1.7, which is pretty archaic (doesn't seem easily installable on Mac OS X or Ubuntu 11.04).
There are no online versions of WordNet 1.7 that I could find.
This site also has some useful information about these three corpora. For example, it says that the six senses of
interest
were taken from the Longman English Dictionary Online (circa 2001). See hereIt describes the source of HARD as WordNet 1.7.
Ultimately, I ended up manually mapping the definitions to those in WordNet 3.0. If you're interested, here's the dictionary. Note, however, that I'm not an expert on linguistics, and they're not exact