I am trying to write a simple lexer that recognises words such as prepositions. I have lists of these words in CSV format. At the moment I have a lexer that works but I am having to type out each string from my list individually e.g.:
...
("before",rest) -> TokenPreposition : lexer rest
("behind",rest) -> TokenPreposition : lexer rest
...
Is it possible to read the words in from the CSV files? I know there is a library for parsing CSV files but I wouldn't know how to continue after this?
You can use a
Set String
to store a word list and the use themember
function to determine if a word is in a set.Here is some example code. The input to
lexer
are lists of verbs, nouns and prepositions and a list of words which it then classifies according to which list the word is in.